Gene GM21_4118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4118 
Symbol 
ID8139492 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4701828 
End bp4703684 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content64% 
IMG OID644871733 
Producthypothetical protein 
Protein accessionYP_003023891 
Protein GI253702702 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value0.41011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGGA TGACTCCAGC AGATAACCAT AAGAAAGCGT CGTTGTTCCT TTTGCTGCTC 
GCACTCCTCG CGCTGCCTGG ATGCACCCGC GCCGGCAAGG TCAGCCAATG CGTGGTCTGC
CACCCGAAGA TCGAAAAGGT GTCGAAAAGC CATGCCGACT GCGTCTCCTG CCACGGCGGC
GACGCTTCCA TAAGGAACAA GCACGCCTCC CACCTGGCGA TGTACGGCCC CCGGAACCCG
GCGGCGCCGG AGCACTGGGA AAATACCTGC GGCTCCTGCC ACCAGTACCA GTTGGACCGG
GTGCGTTCCA ACCTCATGTA CACGACGACG GGGATGATCA AAAACATCCA GCTCACCTGG
GAGGGGCCGG AGGGGCTCTA CAGCAGCAGG GGAGGGGACG AGTACGATCC GGCGGGAAAA
CCGCGCCGGC TTAAGCCGGT GGCCGAACTC GACCATATCT CCGGCGAGTT GTACCGGAAG
TTCTGCTCGC AGTGCCACGT GGCCACGGAA AGCGGCGAGG TCTACGGTGC GAGCCACGCC
TCCGGCTGCG CCGCCTGCCA TTTCCCGTAC AACGACCGCG CCACCTACCA GGGGGGGGAC
GCTGCGGTGC GGGGGAAGGG GCCATATGCC GCGAGCCACG CCATGGAGAC GCTCCCGGGG
ACCGAGGTCT GCGCGCGCTG CCACAACAGA AGCGGACGGA TCGCCCTCTC TTACCAAGGG
CTCTACGACG GGAACAACTC GATGGTCCCC ACCAGAAACG GCCGGCCCGG TCCGGTGATG
ACCTCGGGGG GGCGCAACCT CACCCATATC GCCTCCGACG TCCATTTTGC CGCCGGCATG
GAGTGCATCG ACTGCCACAC CTCAAGGGAC ACCATGGGGG ACGGCTACGG CTACGAGAAC
ATGTACATGC AGACCGAGGT CTCCTGCGAG GACTGCCACG GCGGGGCGAG CCCCCCGCGC
TACGAGCGGA TAGCCGGCGA GAGCGACGAG GCCATCCGCG AATCGCGCGG CTACGCCATG
CAGATGCGCC AAGGGATGAA GATGATCCTC ACCGCCAAGG GGCGCAAGTA CTCCAACGTC
TTCTACCGCG ACGGCGCCGT CTGGGTGCTG GGAAAAAGAA GCGGCAAGCT CTTCAAAAGC
CGTGTGATCA CCGGGACCCC CGAGCACAGC GTGGCCGGCC ACGGCAGGAT GGAATGCTAC
TCCTGCCACT CCCGCACCGT TGTCCAGTGC TACGGCTGCC ACACCACCTA CGACAGGAGC
AAGCCGGGGA TGGATTACAT AGCCAAAATG GCGACCCCCG GGCGCTTCAG CGAGAAGGAA
GATTACCGGA TGCTCTACCC CTTCCCGCTG GCCCTGAACC AGCGAGGGAA GATCTCGACG
GTCACCCCCG GGTGCCAGAC CTTCGTCACC GTGGTCGAGC CCGACCTCTC CGTCTCCAAG
GACGAGTACG TCGCCAGGTT CAAGGGGAAA AAGCAGCTGC GCTTCGCCCC CTTTTACTCG
CACAACACCG GAAAGAAGGC GATCGGCTGC GGCGAATGCC ACGGCAACCC CGCCTTTCTA
GGCTTCGGGC AGCACGTGGT CTCGGGGGGG GATATAGAGG GGACCCTGAT CTGCGAGCAG
TCCGCCGACA AGCCCTTGGA CGGCTTCCTC ACCCTGCAGG GGGGTAAGGT GCGCGCCTAT
TCCGCCATCA CCCGGGAGAG CTCGCGGCCG CTGAACGGGG CGGAGGTGCG GCGGGCGCTG
TCGGTGAACC TCTGCCTGGT CTGCCACGAA AAGGCCAAAG ACCCGATCTA TCGAAAGGAG
CTGGATTATC GTGCGCTCAA TGATGCTCTG CATCGTCGCC TGCTTTCTGC TCCTTAG
 
Protein sequence
MDGMTPADNH KKASLFLLLL ALLALPGCTR AGKVSQCVVC HPKIEKVSKS HADCVSCHGG 
DASIRNKHAS HLAMYGPRNP AAPEHWENTC GSCHQYQLDR VRSNLMYTTT GMIKNIQLTW
EGPEGLYSSR GGDEYDPAGK PRRLKPVAEL DHISGELYRK FCSQCHVATE SGEVYGASHA
SGCAACHFPY NDRATYQGGD AAVRGKGPYA ASHAMETLPG TEVCARCHNR SGRIALSYQG
LYDGNNSMVP TRNGRPGPVM TSGGRNLTHI ASDVHFAAGM ECIDCHTSRD TMGDGYGYEN
MYMQTEVSCE DCHGGASPPR YERIAGESDE AIRESRGYAM QMRQGMKMIL TAKGRKYSNV
FYRDGAVWVL GKRSGKLFKS RVITGTPEHS VAGHGRMECY SCHSRTVVQC YGCHTTYDRS
KPGMDYIAKM ATPGRFSEKE DYRMLYPFPL ALNQRGKIST VTPGCQTFVT VVEPDLSVSK
DEYVARFKGK KQLRFAPFYS HNTGKKAIGC GECHGNPAFL GFGQHVVSGG DIEGTLICEQ
SADKPLDGFL TLQGGKVRAY SAITRESSRP LNGAEVRRAL SVNLCLVCHE KAKDPIYRKE
LDYRALNDAL HRRLLSAP