Gene GM21_3672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_3672 
Symbol 
ID8139046 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4246150 
End bp4247667 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content61% 
IMG OID644871293 
Producthypothetical protein 
Protein accessionYP_003023451 
Protein GI253702262 
COG category 
COG ID 
TIGRFAM ID[TIGR01630] phage uncharacterized protein (putative large terminase), C-terminal domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones169 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTAAGC GCCCAAACCT CACAGAAGGC CAGTTCGACC AGCAGGTCGA CGAGCTCAAG 
AAGTGGATCC GCGAGAGCGT CTCCCCCTTC GAGAACGACA CGCCGGCCAA GAAGAAGGCG
CGCATCGAGC GCGGCAAGAC GGATCTGCTG TTCTTCTTCC AGACCTATCT GCCGCACTAC
TTCATCTGCG CCTTTGGACC GGAGCTGCAT CCGGAGTGGG AAGAGGCGAC CCAACTGCAG
GACCAGCTCG CCCTGATCGG CGCGTTCCGC GAGGGCGCCA AGTCGACCTT CTTCACCCTG
GGCAATCCGG TGCACAAGAT CTGCTACGGG CTCAAGCGCT TTATCTGGCC CTGCTCCGAT
ACCCATGAGC AGGCAGAGTC TTTCAGCACC CAGATCAAGC TGGAGCTGGA GGAGAATCCG
CGTATCCGTC ACGACTTCGG CAACCTGAAG ACGAAGACCT GGAGCAACGA CGAGTTCGAG
ACCAGCAACG GCGTCAAGGT GCTCGCCCGC GGCCGTGGAG ACAAGGTTAG GGGCATCAGG
TACCGGCAGT ACCGCCCGGA CATGGTCGTA TTCGACGACA TGGAGAACGA CGAGACGGTA
GAGAATCCGC GCACCACCAA GAAGATCTTG AACTGGATTC GCGGCGCCGT GCTCGGCTCC
TTGGGCAAAG GCTACTCGGC CATCATGGTC GGCAACCTGT TCCACCCGCT CTCGGCCATC
TGCCAGTTGA TCGCCGACGT GGATGACGAG GGGGAGAAGC GCTACTTCTC CAAGGTCTAC
GCTTTGATCC TGGACGAGGG GGGGCCGAAC GAGAGATCGG CATGGCCTGC CAACTGGCCC
ATGGAGCGCA TCACCAGGAA GCGCCGCGAC GTAGGCTCCT ACACCTTCAA CAAGGAGTAC
ATGAACAAGG TAGGGACCGA CGACACGCCG TTCCCCGAGG AGCAGGTGAA GTGGTACCAG
AAGATCGAGG TGGTCAACAG AAAGCTCATC TTCTGCACCG CCATCGACCC CTCGGCGACC
GCCACCAGCG GCAGCGACTA TCGCGCCGTG GTCACCTACG GCTTCGACCC GCAGGCCATG
CTTTTCCCCT GCATGCACGC CTGGATCAAG AAGCGCTCCA TCAACGAGAT GTTGGCCGCG
GCCTACCAGC AAAACGACCA GTACCCGGGT GTGGTGGCCA TCGAAGACAA CATGCTGAAA
GACTTCCTGC ACCAGGCGAT TCACAACTAC GCCAAGGAAG TCGGCCGCTA CCTCCCCTGG
GCGCCGATGC AGCACTCGAC CAACAAGATC GGCCGCATCG TAGGTACCTG CAGCTACCTC
TGGGAGCACG GCAAGATGCA ATTCGAGAAG GGGCATAGCG ACCAGGCGAA GCTCATCGAG
CAGTTCGTCT ACATCTACAA CGCCACCGTC AACGACGACG GGCCCGACGC GGCGGAGATG
GCCATCAGCA AACTCCAGGC GGGTCTGGGG ATTAAAACCA CCGACGCCCT TCCGGCGTTC
GCAGGAGCAG CAGCATGA
 
Protein sequence
MRKRPNLTEG QFDQQVDELK KWIRESVSPF ENDTPAKKKA RIERGKTDLL FFFQTYLPHY 
FICAFGPELH PEWEEATQLQ DQLALIGAFR EGAKSTFFTL GNPVHKICYG LKRFIWPCSD
THEQAESFST QIKLELEENP RIRHDFGNLK TKTWSNDEFE TSNGVKVLAR GRGDKVRGIR
YRQYRPDMVV FDDMENDETV ENPRTTKKIL NWIRGAVLGS LGKGYSAIMV GNLFHPLSAI
CQLIADVDDE GEKRYFSKVY ALILDEGGPN ERSAWPANWP MERITRKRRD VGSYTFNKEY
MNKVGTDDTP FPEEQVKWYQ KIEVVNRKLI FCTAIDPSAT ATSGSDYRAV VTYGFDPQAM
LFPCMHAWIK KRSINEMLAA AYQQNDQYPG VVAIEDNMLK DFLHQAIHNY AKEVGRYLPW
APMQHSTNKI GRIVGTCSYL WEHGKMQFEK GHSDQAKLIE QFVYIYNATV NDDGPDAAEM
AISKLQAGLG IKTTDALPAF AGAAA