Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GM21_1003 |
Symbol | |
ID | 8136325 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter sp. M21 |
Kingdom | Bacteria |
Replicon accession | NC_012918 |
Strand | + |
Start bp | 1182621 |
End bp | 1184135 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 644868617 |
Product | putative DNA packaging protein GP17 (terminase) |
Protein accession | YP_003020825 |
Protein GI | 253699636 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 112 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCGA CAGAACTTGA AGAGAAGCTA GGAAGCCGCC TGTGGCGGCT CAACAACCTC TACAAGGTGA TCGACAAGGA CGGCAACCTG ATCCAGTTCC GGCTGAACCA CGTACAGAAA GAGCTTCTGA AAAACCTCTG GTATCTGAAC GTGATCCTGA AGAGCAGGCA GCAGGGTATC ACCACTTTCA TCTGCATCCT ATTCCTCGAC CTTGCCCTTT TCTCCGAGAA CATCCACTGT GGCATCGTGG CGCACCGCCT CCCCGACGCA CAGACATTCT TCGATGACAA GATACGCTTT GCCTACGACA ACCTCCCCGA GGAGATCCGG GAGCGGATCA CGCTGGTCAA GGACACGTCG ATGAGGCTGG TCTTTTCCAA CGGCTCCAAA ATCACTGTGG GGGTATCGCT TCGTTCTGGT ACCTACCAAT ACCTCCACAT TTCCGAGTTG GGTAAGATCT GCGCTCAGTT CCCCAAGAAG GCGCTGGAGA TCAAATCAGG GGCGCTGAAC ACCATCAAGG CTGGAAACTA CATTTTCATC GAGAGTACGG CAGAAGGTCG CAGCGGGGAT TTTTATAGCT ACTGCCAGTT GGCCGAGAAC GGCCTTCGCG AGGGGAAGGA ACTGACCCTG CTTGACTGGC GCTTCCACTT TTTTGGCTGG ACGCTTGACC CTGCGGCTAG GCTCAACCCC GAGGGAGTGA CTATAACTCA GGAGCTCCAC GAGTATTTCC GGGAGCTGGA GGTGAAGCAC GGCCTCGTCA CCGACGACTG GCAGCGGGCT TGGTATGCGA AGAAGCTACT GCAGCAGGGT GTAGACATGA TGTACCGCGA ATACCCCGCC ACCTCCGAGG AAGCCTTCTT CGCGGCGATC CACGGAGCAT ACTACAAGCA GCAAATGCTG AAACTCAGAC AGCAGGTTCC CCCCCGCTTC GCGACGGTTC CCTGGGAGCC CAAGCTCCCG GTAGACACCG CCTGGGATTT AGGCATGGAC GACACCACCT GCATCGTGTT CCGCCAGAGG TACGGCACCC AAAATAGACT CATTGATTAT CATGAGAACA GTGGAGAGGG GCTGCCCCAC TACGTCAAGG TACTGCAGGA CAAGCCATAC ACCTACGGTC GCCATTACCT GCCGCACGAC TCCAAGGTGC GGAGCCTCAA CGATGCAGTT TCCCGCGAAG ACAAGCTGTA CGAGCTAGGA CTGCGCAATC TGGTGATCGT GGAGCGGACC CGCGACATCG AAGACGGAAT CGAGGAGGTG CGGAGCTTCC TCGCCTCCTG TTGGTTCGAC CAGGAGACGT GTCAGCGGCT GATTAACGCC CTAGATGAGT ACAGGAAGAA GTGGAACGAC ACCACAGGAG CCTTTGCCAG CCAGCCCTTG CACAACTGGG CCAGTAACCC CGCCGACGCC TTCCGCTGTC TTGCCTGCGG GATCTCGTCC AATGAACGGG GAGACAGTGG CAACGACTTC CTGGGGCGCG GTCGTGAGCG CGGCGGTAGT TGGAGGACGG CATGA
|
Protein sequence | MKATELEEKL GSRLWRLNNL YKVIDKDGNL IQFRLNHVQK ELLKNLWYLN VILKSRQQGI TTFICILFLD LALFSENIHC GIVAHRLPDA QTFFDDKIRF AYDNLPEEIR ERITLVKDTS MRLVFSNGSK ITVGVSLRSG TYQYLHISEL GKICAQFPKK ALEIKSGALN TIKAGNYIFI ESTAEGRSGD FYSYCQLAEN GLREGKELTL LDWRFHFFGW TLDPAARLNP EGVTITQELH EYFRELEVKH GLVTDDWQRA WYAKKLLQQG VDMMYREYPA TSEEAFFAAI HGAYYKQQML KLRQQVPPRF ATVPWEPKLP VDTAWDLGMD DTTCIVFRQR YGTQNRLIDY HENSGEGLPH YVKVLQDKPY TYGRHYLPHD SKVRSLNDAV SREDKLYELG LRNLVIVERT RDIEDGIEEV RSFLASCWFD QETCQRLINA LDEYRKKWND TTGAFASQPL HNWASNPADA FRCLACGISS NERGDSGNDF LGRGRERGGS WRTA
|
| |