Gene GM21_4012 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4012 
Symbol 
ID8139386 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4597573 
End bp4598610 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content62% 
IMG OID644871628 
ProductGeneral secretory system II protein E domain protein 
Protein accessionYP_003023786 
Protein GI253702597 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value3.66345e-31 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCGGCAC GGCTTGGCGA GATGTTGCTG AAGGTTGGAA CCCTGACGGA GGACCAGCTG 
GAGCAGGTGC TTAACGCGCA GTCCATCTAT GGCGGCAGGC TCGGGACAAA TCTCGTGGAG
ATGGGGTTAG TCGAAGAGGA GGAATTGGCG CGCCTATTGA GCGAGCAGCT TGGTGTACCC
TGCGCCCACC CCTCAGAACT CAGTTCCATT CCGGAATCCC TATTGAAGAT GTTCCCGCTG
GAGCTGGTGC AGCGCTACCG CGTGCTCCCC CTCGCTCTGG ACGGCAAGCG GCTCACCGTG
GCCATGACGA ACCCTTCGGA TTTCAAGGCG CTCGAAGACA TCGCTTTTGT TACCGGGATG
ATCATCATCC CAAGGGTCTG CTCCGAACTC CGGTTGAGCA TCGCGCTGGA GCGAATCTTC
GGGGTAAAGC GCCCCATGCG TTACATCCCT GTGGAGGGAG GTGCTAGGAG CCGCTTCGCC
GCCACCCTCG CTGAGCGGGG GAGCGCGGAC CCCGCCTGGG ATGGCGGCGC AGTTTGCCAT
ACTTCCGAGC GAGTCAGTCT GGAGGATCTG TCCGAACGCC TGGCAAAAGC CGTCGGCGAG
TCCGAGGTTG TCCAGGCTGT TCTGTCCTAT CTGGCGGGTG AATTCGACCG GGGTGCCTTC
CTGAGGCTGA AAGGGGGGTG CGTGCACGGG GTTCAGGCAG TGGAGGCCGG CTCACCGGTG
AAAGGCTTCC CGTTTTTTGC CGCGGCGATG GCTGACACGA GGCAGTTGAA ACGGGTGGTC
GAGGAAAGGC GGCTCTTTCT AGGTGAGCTG GAACCGGATC AGGGCGAAGG GCTGTTGCTG
AGGGCGATGG GGGGTAAGGT TCCCGGGTCG GCGCTGCTGG TGCCTGTGGC GCTTGGCGGG
CAGGTGGTGG GGGTCATCTG CGCCAGCGAT CAGAGGGGGC GACTCGGCGG TGGCGTCTTC
GAGCTGCAGC GGGTCGCGGT GATGGCAGAG TTGAGCTTCG AGATGCTGTC GCTCAAGAAA
AGGATCATGA CCGTGTGA
 
Protein sequence
MSARLGEMLL KVGTLTEDQL EQVLNAQSIY GGRLGTNLVE MGLVEEEELA RLLSEQLGVP 
CAHPSELSSI PESLLKMFPL ELVQRYRVLP LALDGKRLTV AMTNPSDFKA LEDIAFVTGM
IIIPRVCSEL RLSIALERIF GVKRPMRYIP VEGGARSRFA ATLAERGSAD PAWDGGAVCH
TSERVSLEDL SERLAKAVGE SEVVQAVLSY LAGEFDRGAF LRLKGGCVHG VQAVEAGSPV
KGFPFFAAAM ADTRQLKRVV EERRLFLGEL EPDQGEGLLL RAMGGKVPGS ALLVPVALGG
QVVGVICASD QRGRLGGGVF ELQRVAVMAE LSFEMLSLKK RIMTV