Gene GM21_0077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0077 
Symbol 
ID8135376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp100819 
End bp102669 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content60% 
IMG OID644867694 
Producthypothetical protein 
Protein accessionYP_003019922 
Protein GI253698733 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.09082e-34 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGGTCCACG TCTTCACACG CTTCATCCCT CTGCTACTGC TCCTACTGCT AACCCAGCCG 
GCGTTCGCCG CTCAAGGGCT GGCGATCGAC CCGGCTACCT GCCTCGGCTG CCATGGAGAT
AAGATCTCCG CAGAGAAGAT GGCAGTGTCG GTGCACGGCA AAAACGGCTG TACCAGCTGT
CACGTCGAGA TCGTCGAACT CGCCAAGCAC ATGAGGGGCG AGATCAAGGT CGGCAAAGTC
AACTGTGCCC GTTGCCACAA GAAGGAAGCC GCCGAGCATG CTCTGAGCGT CCACACCGAG
AAGGGTGTCC AGTGCGCCGG TTGCCACACC GACATGCACA CCCACACCTC CTGGAACAAG
GACAAGCGCC GGGTGCTCGC CAAGTGCGTG CAGTGCCATG CCGACGAGAA GGGCTTCATC
ACCTCGGTCC ACGGCAAGGG AGTAATCGCC GGAAACCAGG ACTCCGCCGC TTGCAACGAC
TGCCATAACC TGCACGAGAT CAAGGCGCTG GGCGATCCGA ACTCGCACAC CAACCGCGAA
TTCCACACCA AGGTCTGTCT CCGCTGCCAT GCCGACGAGA AGCTGGTCGA GCGCAACCAT
ATCTCCGAAG TGGCCGTGAA AAGCTACATG GAGAGCTACC ACGGCAAGAA CTACCGCCTC
GGCTATCCCG AGAAGGTGGC GGGCTGCGCC GACTGCCACA CGGCGCACGG CATCCTCCCC
TCCAAGGATC CGAACTCCTC GGTCAACGAG AAAAACCTCG TGCAGACCTG TTCCAAGTGC
CACGAGAATA CCAAGACGCC GTTCACCAAG TTCTACGCCC ACGGCGAGCA CGGCGACCGC
GAGAAGTACC CGATCCTTTT CTACACCTTC ATCGCCATGA CCGGCCTGCT GGTCGGCACC
TTCGCCGTAT TCTGGATTCA CACCCTGCTC TGGATGATCC GCGGCTTCGT TGAGAACCGC
GAGAAGGCGG CGGCACTCGA AGAAGGGGTT ATCCTGCACC ACGTGCCCGA AGGGCATAAG
CAGTACCGCC GCTTCAGAAG GGTCCATGTC TTCATGCACC TTCTGGTCAT CATCTCCTTC
CTCGGACTGT CGCTGACCGG TCTGCCGCTT AAGTTCAGCG ACCAGATCTG GGCGAAGTTC
CTGATGGACC TTTACGGCGG GGCCCCCAAC GCAGCCTTCT TCCACAGGGT TTGCGCGGGC
ATCACCTTCG TCTACTTCTC CATGGCGCTG GCCATGAGCA TCCACTTCCT CTTCATCAGG
AAGGACATCA AGGGCAACCC GCTCCAGAGG CTCTTCGGAC CTGACTCCCT CTGCCCGAAC
CTGCGCGACA TAAGCGACGT CGTCGGCATG GTCCGCTGGT TCTTCTTCAA AGGGCCGAAG
CCGGCCTTCG AAAGGTGGAC CTACTGGGAG AAATTCGACT TCATCGCGGT CTTCTGGGGT
ATGTTCGCCA TCGGCGGCTC CGGCCTCATG CTCTGGTTCC CCGAGTTCTT CGGCATGTTC
CTGCCGGGTT GGGCCTTCAA CGTAGCGACC ATCATCCACT CGGACGAGGC GCTCCTGGCG
ACCGGCTTCA TCTTCTCGGT CCACTTCTTC AACACCCACG GGCGTCCGGA GAAGTTCCCG
ATGGACTTCG TCATCTTCAA CGGCCAGATG TCCAAGCACG AGTTCGTCGA AGAGCGTGGC
GATCAGTGGG CACGCTACGA GAAGGAAGGG ATCACCGAGA AGTTCGCAGC CAAGAAGTCT
TCCGGCATCT TCTACGACTT CTGCCTGAAG GCCTTCGGCT TCACGGCGCT CTTCATCGGC
ATCACGCTGC TGATGCTGAT GATCTACGCC TTCATGCACC CGCACCACTA G
 
Protein sequence
MVHVFTRFIP LLLLLLLTQP AFAAQGLAID PATCLGCHGD KISAEKMAVS VHGKNGCTSC 
HVEIVELAKH MRGEIKVGKV NCARCHKKEA AEHALSVHTE KGVQCAGCHT DMHTHTSWNK
DKRRVLAKCV QCHADEKGFI TSVHGKGVIA GNQDSAACND CHNLHEIKAL GDPNSHTNRE
FHTKVCLRCH ADEKLVERNH ISEVAVKSYM ESYHGKNYRL GYPEKVAGCA DCHTAHGILP
SKDPNSSVNE KNLVQTCSKC HENTKTPFTK FYAHGEHGDR EKYPILFYTF IAMTGLLVGT
FAVFWIHTLL WMIRGFVENR EKAAALEEGV ILHHVPEGHK QYRRFRRVHV FMHLLVIISF
LGLSLTGLPL KFSDQIWAKF LMDLYGGAPN AAFFHRVCAG ITFVYFSMAL AMSIHFLFIR
KDIKGNPLQR LFGPDSLCPN LRDISDVVGM VRWFFFKGPK PAFERWTYWE KFDFIAVFWG
MFAIGGSGLM LWFPEFFGMF LPGWAFNVAT IIHSDEALLA TGFIFSVHFF NTHGRPEKFP
MDFVIFNGQM SKHEFVEERG DQWARYEKEG ITEKFAAKKS SGIFYDFCLK AFGFTALFIG
ITLLMLMIYA FMHPHH