Gene GM21_4131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_4131 
Symbol 
ID8139505 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp4721765 
End bp4722955 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID644871746 
Productdomain of unknown function DUF1745 
Protein accessionYP_003023904 
Protein GI253702715 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones80 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAACTT TCGCAGGCGT TGGCTTCAGC CTTGACAAGA ACCCGGCTGA CGCGGGTAAG 
GAAGCGGCTC TGCAGGCGAT GCAGCGGGGC AGGATGGCCA AACCGGATTT CCTCTTCGTC
TTCGCCACGG TGGGGTACGA CCAGGAGCTT CTGCTACGCT CGGTCCGCGA CGCCACCGCC
GGCGCCCCCT TGAGCGGCTG CTCGGGGGAG GGTGTGATCA CGGCTGGAGC CGCCGCCGAA
ACCAACTTCG GGGTCTGCGT CCTCGCCATA GCCTCGGATG AGGTCCGCTT CGCCAACGCC
TACGTGCAGG GGCTAGATGC CGGGTACGCC CGGGCCGGGG AGTTGCTGGG GGAGCAGGTC
CGCCCGCTGC TGGATGACGA CGCCGTCGCC TGCTTCCTCT TCGCCGACGG CCTCCTCTTC
GACTTCGATC CCTTCCGGGA CGCCTTCGAA ACCGCTCTTG GTTGCGGAAG GGCGCTCCCC
ATGTTCGGAG GGCTTGCCGC CAACAACCTC TCCACCCGCA GGACCTACCA GTACCACGAC
GACCAGGTGA TCTCCGAAGG GATCTGCTGC GTGGTCATGT CCGGCAACGC CAGGCTCGCC
TGGGGGGTGA ACCACGGCTG CGTGCCGGTG GGGACGCGGC GCACCATCAC CCGCTGCCAG
GGCAACATCA TCTACGAGAT CGACGGCATC CCTGCGCTGG AAGCGTTGAA GGAGTACATC
GAGAACGGCT CCGGCAGCGA CTGGAACAAG GTAACCCTCA ACCTCTGCCT CGGCTTCAAG
ACCCCCGAAC ACCTGAGGAA GGAGTACGGC GAGTACATCA TCCGCTACAT GATGGACAAA
AACGACCAAG AGGGGTGGGT AAGCATCCAG TCGGATGTGA CCCAGGGAAG CGCTCTCTGG
ATCATGAGGC GGGACAAGGA GCTGATGCGT GAAGGGCTGC AGACGATCTC GCGGCATATA
CGGGAGCAGT TGGGAGAGAG TCGGCCCAAG CTGGTGATGC AGTTCGAATG CATGGGGCGC
GGACGCGTGG TCTACCGCGA GCAGGAGAAG CTCGAGCTGC TCAAGTCCCT GCAAGATGAC
CTGGGGGCGG AACTACCCTG GATCGGATTC TACTCCTACG CGGAAATCGG CCCCGTCTCC
GGCTACAACT GCATCCACAA TTTCACGTCC GTGCTGTTGG CCGTGTACTG A
 
Protein sequence
MGTFAGVGFS LDKNPADAGK EAALQAMQRG RMAKPDFLFV FATVGYDQEL LLRSVRDATA 
GAPLSGCSGE GVITAGAAAE TNFGVCVLAI ASDEVRFANA YVQGLDAGYA RAGELLGEQV
RPLLDDDAVA CFLFADGLLF DFDPFRDAFE TALGCGRALP MFGGLAANNL STRRTYQYHD
DQVISEGICC VVMSGNARLA WGVNHGCVPV GTRRTITRCQ GNIIYEIDGI PALEALKEYI
ENGSGSDWNK VTLNLCLGFK TPEHLRKEYG EYIIRYMMDK NDQEGWVSIQ SDVTQGSALW
IMRRDKELMR EGLQTISRHI REQLGESRPK LVMQFECMGR GRVVYREQEK LELLKSLQDD
LGAELPWIGF YSYAEIGPVS GYNCIHNFTS VLLAVY