Gene GM21_1003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1003 
Symbol 
ID8136325 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1182621 
End bp1184135 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content57% 
IMG OID644868617 
Productputative DNA packaging protein GP17 (terminase) 
Protein accessionYP_003020825 
Protein GI253699636 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones112 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCGA CAGAACTTGA AGAGAAGCTA GGAAGCCGCC TGTGGCGGCT CAACAACCTC 
TACAAGGTGA TCGACAAGGA CGGCAACCTG ATCCAGTTCC GGCTGAACCA CGTACAGAAA
GAGCTTCTGA AAAACCTCTG GTATCTGAAC GTGATCCTGA AGAGCAGGCA GCAGGGTATC
ACCACTTTCA TCTGCATCCT ATTCCTCGAC CTTGCCCTTT TCTCCGAGAA CATCCACTGT
GGCATCGTGG CGCACCGCCT CCCCGACGCA CAGACATTCT TCGATGACAA GATACGCTTT
GCCTACGACA ACCTCCCCGA GGAGATCCGG GAGCGGATCA CGCTGGTCAA GGACACGTCG
ATGAGGCTGG TCTTTTCCAA CGGCTCCAAA ATCACTGTGG GGGTATCGCT TCGTTCTGGT
ACCTACCAAT ACCTCCACAT TTCCGAGTTG GGTAAGATCT GCGCTCAGTT CCCCAAGAAG
GCGCTGGAGA TCAAATCAGG GGCGCTGAAC ACCATCAAGG CTGGAAACTA CATTTTCATC
GAGAGTACGG CAGAAGGTCG CAGCGGGGAT TTTTATAGCT ACTGCCAGTT GGCCGAGAAC
GGCCTTCGCG AGGGGAAGGA ACTGACCCTG CTTGACTGGC GCTTCCACTT TTTTGGCTGG
ACGCTTGACC CTGCGGCTAG GCTCAACCCC GAGGGAGTGA CTATAACTCA GGAGCTCCAC
GAGTATTTCC GGGAGCTGGA GGTGAAGCAC GGCCTCGTCA CCGACGACTG GCAGCGGGCT
TGGTATGCGA AGAAGCTACT GCAGCAGGGT GTAGACATGA TGTACCGCGA ATACCCCGCC
ACCTCCGAGG AAGCCTTCTT CGCGGCGATC CACGGAGCAT ACTACAAGCA GCAAATGCTG
AAACTCAGAC AGCAGGTTCC CCCCCGCTTC GCGACGGTTC CCTGGGAGCC CAAGCTCCCG
GTAGACACCG CCTGGGATTT AGGCATGGAC GACACCACCT GCATCGTGTT CCGCCAGAGG
TACGGCACCC AAAATAGACT CATTGATTAT CATGAGAACA GTGGAGAGGG GCTGCCCCAC
TACGTCAAGG TACTGCAGGA CAAGCCATAC ACCTACGGTC GCCATTACCT GCCGCACGAC
TCCAAGGTGC GGAGCCTCAA CGATGCAGTT TCCCGCGAAG ACAAGCTGTA CGAGCTAGGA
CTGCGCAATC TGGTGATCGT GGAGCGGACC CGCGACATCG AAGACGGAAT CGAGGAGGTG
CGGAGCTTCC TCGCCTCCTG TTGGTTCGAC CAGGAGACGT GTCAGCGGCT GATTAACGCC
CTAGATGAGT ACAGGAAGAA GTGGAACGAC ACCACAGGAG CCTTTGCCAG CCAGCCCTTG
CACAACTGGG CCAGTAACCC CGCCGACGCC TTCCGCTGTC TTGCCTGCGG GATCTCGTCC
AATGAACGGG GAGACAGTGG CAACGACTTC CTGGGGCGCG GTCGTGAGCG CGGCGGTAGT
TGGAGGACGG CATGA
 
Protein sequence
MKATELEEKL GSRLWRLNNL YKVIDKDGNL IQFRLNHVQK ELLKNLWYLN VILKSRQQGI 
TTFICILFLD LALFSENIHC GIVAHRLPDA QTFFDDKIRF AYDNLPEEIR ERITLVKDTS
MRLVFSNGSK ITVGVSLRSG TYQYLHISEL GKICAQFPKK ALEIKSGALN TIKAGNYIFI
ESTAEGRSGD FYSYCQLAEN GLREGKELTL LDWRFHFFGW TLDPAARLNP EGVTITQELH
EYFRELEVKH GLVTDDWQRA WYAKKLLQQG VDMMYREYPA TSEEAFFAAI HGAYYKQQML
KLRQQVPPRF ATVPWEPKLP VDTAWDLGMD DTTCIVFRQR YGTQNRLIDY HENSGEGLPH
YVKVLQDKPY TYGRHYLPHD SKVRSLNDAV SREDKLYELG LRNLVIVERT RDIEDGIEEV
RSFLASCWFD QETCQRLINA LDEYRKKWND TTGAFASQPL HNWASNPADA FRCLACGISS
NERGDSGNDF LGRGRERGGS WRTA