Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tcr_0044 |
Symbol | |
ID | 3762198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thiomicrospira crunogena XCL-2 |
Kingdom | Bacteria |
Replicon accession | NC_007520 |
Strand | + |
Start bp | 52701 |
End bp | 55520 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637784750 |
Product | DNA polymerase I |
Protein accession | YP_390315 |
Protein GI | 78484390 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | [TIGR00593] DNA polymerase I [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00167336 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCAAA TCCTTCCTGA ATTCAATCCA GACAGTCCTT TTATCTTAGT TGACGGCTCG TCTTACCTGT TCCGGGCGTT TCATGCCATG CCACCCTTGA CGAACACCAA AGGGCATGCC ACCGGGGCGA TTTTCGGTGT CATCAATATG ATTGGAAAGT TATTAGAGCA ATACCAACCC GAACGCATTG CGGTGGTATT TGATGCCAAA GGAAAAAATT TTCGCCATGA ACTCTATGAG GACTACAAAG CCCATCGCCC GCCAATGCCA GATGAGTTGC GCATTCAGAT TGAACCGATT CATGAAATCA TCAAAGCATT GGGCATTCCT TTGTTAGTGA TTGATGGTGT GGAAGCCGAT GATGTGATGG GCACCCTGGC CCACCAAGCC ACCCAGGCGA AAATGGACGC GCTGCTGTCA ACCGGGGATA AAGACATGGC GCAATTGGTG AATGAACACA TTACCCTTGT TAACACCATG AATGATACGT TGATGACACC TGACAGTGTG TTTGAAAAGT TCCACGTGAA ACCTGAACAG ATTATTGATT ATCTGGCCTT AATGGGCGAT AGCAGTGACA ACATTCCTGG CATTCCCAAA TGTGGTCCCA AAACCGCCGC AAAATGGATT TCCGAATATG GCTCTATTGA TAATTTGATT GAGCATGCGG AGGAAATTAA AGGCAAAATC GGTGAAAATT TACGTGCCAA TTTAGAACAA CTGAAATTAT CACAACAGTT AACCACCATT CGACTGGATT GCGACCTACC AATTGCACTT AATGACATCA AACGCCATGA AGCGGATATG GAGGCACTGG AAGCCCTGTT TTCTGAATAC GACTTACGTA ACTGGTTAAC CCGCGTATTA AAAGGCGAAC TGCCTTTCAG TAAAAGCAGC GGTCGAAAAG CTCATTCCGA AACCGTCACT AACGGCCAGT CAAAAAAAGC CAATACCTCT CCGTCGACAG AGACACCAAA AGCGGATTCG CAACCTTATG AAACCATTGT CGACTGGCAC ACTTTCGATC AATGGCTTAA AAAGCTCGAA GCATCGGATG TTTTTGCGAT TGATACCGAA ACCACTTCCT TAAATGCTAT GGAAGCAAAA ATTGTTGGTG TCAGCTTTGC CTATGCTGAA AAAAAGGACC AGGCCTGGCA GAATTTTGCG GCGTATGTGC CCTTAACCCA CGACTATGAC GGTGCACCGG AACAATTACC GATTCAAGAG GTATTGGCAA AATTGAAACC GCTTCTGGAA AACCCAGCCA TCAAGAAAGT TGGCCAAAAT TTTAAATATG ACTGGCACAT TTTCAAAAAT GCGGGCATTG AAGTCCAAGG CATGGCGTAT GACACCATGC TGGAATCTTA CTGTTTTAAC AGTGTTGCAA CACGCCACAA TATGGATGAT TTGGCGTTGA CGTATCTAAA CCACAGCACC ATTCATTTTA AAGACATTGC CGGTACAGGA AAAAAACAAA AAACCTTTAA TCAGATTGAA CTGGAAACAG CTTCACCTTA TGCCGCAGAA GATGCGGATA TCACGCTGCA ACTGCATCAA ACATTGCTAC CGAAACTGCA AGCCGAACCG ACTTTATATA AGGTCTTTGA AGAAATTGAA ATGCCATTAA TGCCTGTCTT GGCAAAAATG GAACGAAATG GCGTGTTGAT TGATCGTCAA ATGTTGGCAG ACCAGTCTTA TGAACTGGGA CAAAAACTCA CTGAGCTGGA ACAAAAAGCC CATTTAATCG CTGGCACACC GTTTAATTTG AATTCCTCCA AACAATTACA GGAAGTGCTG TTTGAACGCC TGGAGCTTCC TATCGTGAAA AAAACCCCGA AAGGTCAACC TTCCACCGCG GAACCGGTGC TGGTGCAGTT GGCGGAAGAC GGTCACGAAA TGCCAAATTT AATTCTCGAA TATCGTAGTT TGGCAAAATT AAAATCAACC TACACCGATT CTTTACCAAA GCAAATCAAT CAACAAACGG GACGCGTCCA TACTTCTTAT CAACAAGCGG TGGCCTCAAC TGGGCGTTTA TCTTCAACAG AGCCCAATTT ACAGAACATT CCGATTCGAA GCGCCGAAGG ACGACGTATC CGTCAGGCAT TTATTGCCCA ACCAGGTTAT CGCTTAATGG CCTCGGATTA CTCGCAAATT GAATTGCGCA TCATGGCGCA TTTATCGGGC GATGCGAGTT TATTAAAGGC TTTTGCCGAG GGCAAAGACA TTCACCAGGC GACCGCGGCC GAAATTTTTA ACATGCCGTT AGAAGAGGTC ACATCCGAAC AACGCCGCAG TGCGAAAGCG GTCAACTTCG GATTGATTTA TGGCATGTCA GCCTTTGGGT TAGCAAAACA ACTCAACATC AGCCGAGGGT TGGCACAAGA ATACATCAAC CTTTATTTTG CGCGTTATCC CGGCGTGGCA AATTATATGG AATCCACAAA AGAAAACGCC AAACAAACTG GCTATGTTGA AACCTTAATG GGTCGTCGAC TGTATTTACC AGATATTAAT GCCAAAAATG GTCAATTAAG ACAATATGCG GAAAGAACCG CCATCAACGC ACCGATGCAA GGCACGGCCG CCGACATCAT CAAAACTGCC ATGGTGAAAA TGCAGCAATG GCTCGATCAA ACCCCGTGTG ATATCAAAAT GTTAATGCAA GTGCACGATG AATTGGTTTT TGAAGTGGCT GAAGCCGATA TGGATCAAGC CAGAAAGGAA ATCAAGACCA TTATGGAAGC CGCTTTAAAG CTCGATGTGC CATTGATTGT TGAAATTGGC GACGGCCTAA ATTGGGATGA AGCACACTGA
|
Protein sequence | MTQILPEFNP DSPFILVDGS SYLFRAFHAM PPLTNTKGHA TGAIFGVINM IGKLLEQYQP ERIAVVFDAK GKNFRHELYE DYKAHRPPMP DELRIQIEPI HEIIKALGIP LLVIDGVEAD DVMGTLAHQA TQAKMDALLS TGDKDMAQLV NEHITLVNTM NDTLMTPDSV FEKFHVKPEQ IIDYLALMGD SSDNIPGIPK CGPKTAAKWI SEYGSIDNLI EHAEEIKGKI GENLRANLEQ LKLSQQLTTI RLDCDLPIAL NDIKRHEADM EALEALFSEY DLRNWLTRVL KGELPFSKSS GRKAHSETVT NGQSKKANTS PSTETPKADS QPYETIVDWH TFDQWLKKLE ASDVFAIDTE TTSLNAMEAK IVGVSFAYAE KKDQAWQNFA AYVPLTHDYD GAPEQLPIQE VLAKLKPLLE NPAIKKVGQN FKYDWHIFKN AGIEVQGMAY DTMLESYCFN SVATRHNMDD LALTYLNHST IHFKDIAGTG KKQKTFNQIE LETASPYAAE DADITLQLHQ TLLPKLQAEP TLYKVFEEIE MPLMPVLAKM ERNGVLIDRQ MLADQSYELG QKLTELEQKA HLIAGTPFNL NSSKQLQEVL FERLELPIVK KTPKGQPSTA EPVLVQLAED GHEMPNLILE YRSLAKLKST YTDSLPKQIN QQTGRVHTSY QQAVASTGRL SSTEPNLQNI PIRSAEGRRI RQAFIAQPGY RLMASDYSQI ELRIMAHLSG DASLLKAFAE GKDIHQATAA EIFNMPLEEV TSEQRRSAKA VNFGLIYGMS AFGLAKQLNI SRGLAQEYIN LYFARYPGVA NYMESTKENA KQTGYVETLM GRRLYLPDIN AKNGQLRQYA ERTAINAPMQ GTAADIIKTA MVKMQQWLDQ TPCDIKMLMQ VHDELVFEVA EADMDQARKE IKTIMEAALK LDVPLIVEIG DGLNWDEAH
|
| |