Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rcas_1512 |
Symbol | |
ID | 5538988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Roseiflexus castenholzii DSM 13941 |
Kingdom | Bacteria |
Replicon accession | NC_009767 |
Strand | + |
Start bp | 1931195 |
End bp | 1934113 |
Gene Length | 2919 bp |
Protein Length | 972 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640893650 |
Product | DNA polymerase I |
Protein accession | YP_001431623 |
Protein GI | 156741494 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.43872 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0397579 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGCACACA ACCGTCCGCT GCTGTTGCTG ATCGATGGTC ATGCGTTGGC GTATCGCGCC TTCCACGCGC TGGCGGAAGC GGGGTTGCGC TCCTCGACCG GCGAGCCAAC CTATGCGGTC TACGGGTTCA CCTCGGCGAT GCTCAATGCC ATCGAGGAGT ATCAACCGGA ATATGCCGCA GTGGCGTTCG ATGTCGGCAA GACTTTCCGC GACGATCTGT ACGCCGAATA CAAGGCGAAC CGCGCCGAGA CCCCGGCTGA GTTTGAGCAG CAACTGGAAC GCATCAAGCA GGTGCTAGCG GCGTTCGACA TTCCGATCTA TACCGCCGAA GGATACGAAG CCGATGATGT GATCGGCACG CTGGCGCGTC AGGCGACTGA ACGCGGCGTT GATGTGCTGA TCCTCACCGG CGATACCGAT ACGCTGCAAC TGGTCGACGA TCATGTGACC GTGCTCCTCA ACAATCCGTA TCTCCGCGGA CCGAAAAATA CCACGCGCTA CGGCGTCGCG GAGGTGACGG AACGCTACAA AGGGCTGCGC CCCCATCAAC TTGCCGACCT GCGCGGCTTG AAGGGCGATC CGTCCGACAA TATCCCCGGT GTCAAAGGCA TTGGTGAGGC TGGCGCAATT GCGCTCCTCA ATCAGTTTGG CACAATCGAG AACCTGTACG ACCATCTGGA CGAAGCGCCG AAACGCTATC AGAAGCATCT CGAAGGGCAG CGCGACATTG CGCTGTTCAG CAAGAAACTG GCGACCATTG TGTGTGATGC GCCGGTAACG CTCAACCTTG CAGACGCAAC GCTCGCCGAT TACGACCGCA GCCGGGTGAT CGCGGTCTTT CAGGAACTCG AGTTTGGTGC ATCGCTGGTG AAGCGCCTTC CGCCATCCGG CAGTACGACC GTTGCGCAGC CGCTGCCGTC CATACCGCAG GCAGACGCAT CGGCGCCGCT TCAGGCGGAC ATGTTCGCGC CGGAAGCGCC TGCTGTGCGA GAGTCGTCCG CCGAGCCGCA GCAGTTGACC CTGTTTGGCG ATCCAGAGGT TGTGCCGCCG CCGGTCGCGC ATGTCGAGGC GCCGCTGCGC GCAGCGCCGG GTGAATACCG CGCTGCCTGC ACCGATGCCG ACCTGGAAGC GGTGGTCGCC GAACTCGCCA CGGCGCCGCT GGTTGCGTTC GATACCGAAA CGCGCGGGAT GAACCCGCTG CGCGATGATC TCGTCGGTCT GTCGCTGGCG ACGGTCCCCG GTCGAGCCTG GTACATTCCC GTCGGGCATA CCACCGGCGA GATGCAATTG CCGCGCGATC GGGTGATCGC GGCGCTACGC TCATTCTTTG CCGATCCGGC GCGCCCCAAG GTTGCCCACA ACGCCAAGTT TGATATCGAA GTGCTGGAAC GCGCCGGCAT GCCGGTCGCC GGTTTGTCGT TCGATACGAT GCTGGCGGCT GGCTTGCTCG ACAAACGTCG TAATTTGAAA GACCTGGCGT TCTACGAACT GGAACTCGCT GAACCGCTGG ACGCAATTGG CGACCTGATC GGCAAGGGGA AGAACCAGGT CACCTTCGCA GAGGTGCCGA TTGCCCGCGC TACGCGATAT GCTGCTGCCG ACGCCGATAT GACGCTGCGC CTGCAACCGG CGCTCGAAGC GAAACTGCGT GCTGCCGGGA GTCTCGCCGA TATGTTCTAC CGCCTGGAAA TGCCGCTCGT GCCGGTGCTG GTGCGGATGG AGCAGTCCGG CATTCTGCTG GATGTTCCGT ATATGCGCGC TCTGGGGGAG CGCATGGGTC GGGAACTTGC ACAGATCGAG CAGCAGATCT TTGCCATCGC CGGGAAACCG TTCAATATCA ACTCCGGCGA TCAGTTGAGC GAGGTGCTGT TCGGTCCCAA GATCAACCTG CCCACAACGG GTCTCGAACG CACCCGGACA GGGCGCTACT CGCTGACGGC GCAGGCGCTC GAAGAGTTGC AGGCAAGCGA CACTACCGGC ATTATTGAGT TAATCCTGCG CCACCGTCGC CTGAGCAAAC TCAAATCAAC CTATGTTGAT GAACTCCCGG CGCTGGTCAA CCCGGAGACC GGCAGAGTCC ATACCGATTA CAATCAGCTT GGTGCGGCGA CCGGGCGGTT GAGCAGCAAT TCGCCGAACC TGCAAAATAT TCCAACCCGC ACCGAAGAGG GGCGCGAGGT GCGGCGCGGC TTCATCGCTG CCCCCGGTCA TGTGTTGATC GCCGCCGACT ATTCGCAGAT CGAGTTGCGC GTGCTGGCGC ACATCACCGG CGATCCGAAC CTGATCCAGA CGTTTATTGA AGGGCGCGAC ATTCACGCAG CCACCGCCGC ACGTCTGTTC GGCGTCGGTT TCAATGCCGT GGACAAGAAT CAGCGGCGGA TCGCCAAAAC CGTCGTTTTT GGCGTCATCT ACGGCATCAG CCCATTCGGG TTGGCGCAAC GCCTGGGTAT TTCGCGCGAA CAGGCGCGCA ACCTGATCGA TAGCCTGTTC AATCAGTTCC CCCGCATCCG CGACTACATC GACCGCACGC TGGAAACCGG ACGGAATGAA GGGTATGTGC AGTCGCTCTT CGGCCGCCGC CGCCCCATGT TCGACCTGCG CACGTCCGGT CCGCGGCGTC AGGCAGCCGA ACGTGAAGCG ATCAACCATC CGATCCAATC CACAGCGGCG GACATCATGA AACTGGCGAT GATTGCTGTG GATGCCGAAC TGCGTCGCCG CCGGTTGCGC ACCCGCATGT TGCTCCAGGT TCACGACGAA TTGATCTTCG AGGCGCCGGA GGCGGAAGTG GATGAGGTGG TGGCGCTGGT GCGCGAGCGA ATGGAAGGGG TGTTGAGCGA TATGCACCCG CCGTTCGCCG TCCCGCTGCG CGTCGAGATC GAAAAAGGTT CGAATTGGGA AGAACTGACG CCAGTGTGA
|
Protein sequence | MAHNRPLLLL IDGHALAYRA FHALAEAGLR SSTGEPTYAV YGFTSAMLNA IEEYQPEYAA VAFDVGKTFR DDLYAEYKAN RAETPAEFEQ QLERIKQVLA AFDIPIYTAE GYEADDVIGT LARQATERGV DVLILTGDTD TLQLVDDHVT VLLNNPYLRG PKNTTRYGVA EVTERYKGLR PHQLADLRGL KGDPSDNIPG VKGIGEAGAI ALLNQFGTIE NLYDHLDEAP KRYQKHLEGQ RDIALFSKKL ATIVCDAPVT LNLADATLAD YDRSRVIAVF QELEFGASLV KRLPPSGSTT VAQPLPSIPQ ADASAPLQAD MFAPEAPAVR ESSAEPQQLT LFGDPEVVPP PVAHVEAPLR AAPGEYRAAC TDADLEAVVA ELATAPLVAF DTETRGMNPL RDDLVGLSLA TVPGRAWYIP VGHTTGEMQL PRDRVIAALR SFFADPARPK VAHNAKFDIE VLERAGMPVA GLSFDTMLAA GLLDKRRNLK DLAFYELELA EPLDAIGDLI GKGKNQVTFA EVPIARATRY AAADADMTLR LQPALEAKLR AAGSLADMFY RLEMPLVPVL VRMEQSGILL DVPYMRALGE RMGRELAQIE QQIFAIAGKP FNINSGDQLS EVLFGPKINL PTTGLERTRT GRYSLTAQAL EELQASDTTG IIELILRHRR LSKLKSTYVD ELPALVNPET GRVHTDYNQL GAATGRLSSN SPNLQNIPTR TEEGREVRRG FIAAPGHVLI AADYSQIELR VLAHITGDPN LIQTFIEGRD IHAATAARLF GVGFNAVDKN QRRIAKTVVF GVIYGISPFG LAQRLGISRE QARNLIDSLF NQFPRIRDYI DRTLETGRNE GYVQSLFGRR RPMFDLRTSG PRRQAAEREA INHPIQSTAA DIMKLAMIAV DAELRRRRLR TRMLLQVHDE LIFEAPEAEV DEVVALVRER MEGVLSDMHP PFAVPLRVEI EKGSNWEELT PV
|
| |