Gene GM21_0120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0120 
Symbol 
ID8135423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp152224 
End bp153288 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content68% 
IMG OID644867740 
Productshort-chain dehydrogenase/reductase SDR 
Protein accessionYP_003019964 
Protein GI253698775 
COG category[R] General function prediction only 
COG ID[COG0300] Short-chain dehydrogenases of various substrate specificities 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value0.718494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCT CCCGAAAGAA TTCGAATCTC CTCCCCATGC TCCTGCTGGG TCCGAGCGCC 
TTCTTGCTCT GGAGCCTGCG CCGGCGCGCC CGCCGCATGG ACTTCTCCGG CAGGAGCGTG
GTGATCTCGG GGGGCTCCCG CGGGCTCGGG CTGGAACTGG CCCGCCAACT GGGGAGGGAG
GGGGCGAAGC TGGTGCTCCT GGCCCGCAAC CAGGAGGAGC TGGAGCGGGC CCGCGCCGAA
CTCGCGCAAG CAGGCGCCGA CGTCCTCACC CTCCCCTGCG ACGTCGGTAG CCACCAGCAG
GTCGAGGAGG CGGTGACCGC GATCCTTGAG CTGCGCGGCA CCATCGACGT CCTGATCAAC
GTGGCCGGCG TGATCCAGGT GGCGCCGTTC GAGAACCTGG AGTTCAAGGA CTTCCAGGAA
TCGGTCGACG TGCACGCCTG GGGGCCGTAC CACCTGATGC GCGCCGTGGT GCCGCAGATG
CAGCGCCGGC GCACCGGGCG CATCGTGAAC ATCTCCTCGA TAGGGGGACT GGTCGCCGTC
CCGCACCTGT TGGCCTACAC CATGGGGAAG TTCGCCTTGA CCGGGCTCTC CGACGGCTTC
CGCGCCGAGC TTGCCAAGGA CGGCATCTAC GTCACAACCG TGGCGCCCGG GCTGATGCGG
ACCGGCTCCC ACGTCAACGC CCAATTCAAG GGGCAGTACC GCAAGGAGTA CGCCTGGTTC
GCCATTTCCG GCGCCAACCC CATGCTCTCG ACCGCGGCGC CCGCCGCCGC CAAAAGGATC
GTCGAAGGTT GCCGCTACGG CGAAGCCAGA GTCATCATCA ACTGGCCGGC GCGCCTGCTC
CATGCCGCCA ACGCGCTATT CCCCGGCCTC ACCTCCTTCG GCACCGGCAT CGCCGCGCGG
CTGTTGCCGG CCCCCTCGAA GGAACCGGAG GGGAGCGCGC CGCATCCGGG GTGGGAAAGC
CGCTCTCCGC TGGCGCCCTC CATGCTCACC CGCTCAAGCG ACCTGGCTAT CGAGCCGAAT
CACGAAGAGA TCGCCGCACC CCTGCCCCGC AAGGTGGCAG ACTGA
 
Protein sequence
MKFSRKNSNL LPMLLLGPSA FLLWSLRRRA RRMDFSGRSV VISGGSRGLG LELARQLGRE 
GAKLVLLARN QEELERARAE LAQAGADVLT LPCDVGSHQQ VEEAVTAILE LRGTIDVLIN
VAGVIQVAPF ENLEFKDFQE SVDVHAWGPY HLMRAVVPQM QRRRTGRIVN ISSIGGLVAV
PHLLAYTMGK FALTGLSDGF RAELAKDGIY VTTVAPGLMR TGSHVNAQFK GQYRKEYAWF
AISGANPMLS TAAPAAAKRI VEGCRYGEAR VIINWPARLL HAANALFPGL TSFGTGIAAR
LLPAPSKEPE GSAPHPGWES RSPLAPSMLT RSSDLAIEPN HEEIAAPLPR KVAD