Gene GM21_2730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2730 
Symbol 
ID8138073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3178405 
End bp3179616 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content64% 
IMG OID644870335 
Producthypothetical protein 
Protein accessionYP_003022524 
Protein GI253701335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones118 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAGC CTAGCCGAGA ACTACTGACG AAACTGCTGC ACGCCTGGCA GGGGAACCCG 
AAAGGGGCGA GGAAGGTGTC CCTTTCTATC ACCAAGGCGC GCGCCGCCGC CTACTTCCAG
GCAGTTCTCC CCGAAGAAAA GGGTGCGCTG CACGCCGGCC TTGAGGAAGC AGCGGCGTCG
GGGGCCATCG CCTTGGAGTG GGGAAAAGGC TTCGAAAGTC ACATCCTCAG GCGGGTCGTC
TTGGTTGATG GAGCAGTGCT GGCCGAGTAC CTCGGCGTGC CGCTGGCGTC GCAGCAGGCC
GACCAGGCCC GGTCGGCGCT GGAGAGTTGC CTGGAAGAAC GAGAAAGCTG GATTGACCGC
TGGGTGGCAG AGCTATTGGA TCACTGGAGC CGGAACCAGG GGTTCAACGG CATCGCACCC
GGAGAGGTCG CCACGGCCAC CCTCCTGGTC AGGGCTCTTT CGGCAGTGGC CGCCGGACGG
CAGCGCAACC TCGACCTGAG AACCTTCAGC ACCCGCGAAT TGGGAAACTC GAAGGCGATG
GAGTCGATCC TGGCGAAGTT CGCCTCGATC TGGAAGAAGC ATCACGCAGC CGACTATCCC
GCGGAGTTGA CCAACGAAGA ACTCTTCGAG GCTATCGGGC TCGTTAAATT TCCCCAGCCG
CTGCTGTTGC GCGGTCCCCT TACACTCAGG CTTGCCGGTC GCGACGTCGA TTGCGAGGGG
ATCGAGCCGT TCGTGGGGCT TCCGCCCCAA GCCATGCTGG ACGTCCTGGC CGACCAGCGA
CCGGAGTACT GCCTGACCAT TGAAAACCTC GCCAGCTTCA ACCGCTACAC GACGGAGGTC
CGCGACCGTG GGGTGATCGT CTTCACGTCC GGCTTCCCGT CGCCGGGCGT CGCGGATTTC
CTGCGCCTGC TCGACCGGGC TCTGCCGGCG GCAATCCCTT TCTTCCACTG GGGAGACATC
GACGAGGGAG GATTGAAGAT TTTCTTGTAC CTGCAGGGGC TGGTAAAGAG GGGGGTGCAG
CCGCACCGGA TGACCCCGGA ACTCCTGACG GCGAAGGGGC AGCCTTCGCC CGGATTGCGG
CGGCGGGAAG TGGGGCGGCT GATCGCCGAT GACAGGACTG TAGCCCTCCT CGCCGAGGCG
ATCCTCTCAA CGGCCCCTGC CAGAATTCTG GAACAGGAGA ACATAGACCC GGTCGCTCCT
TCCGTGGCCT GA
 
Protein sequence
MNQPSRELLT KLLHAWQGNP KGARKVSLSI TKARAAAYFQ AVLPEEKGAL HAGLEEAAAS 
GAIALEWGKG FESHILRRVV LVDGAVLAEY LGVPLASQQA DQARSALESC LEERESWIDR
WVAELLDHWS RNQGFNGIAP GEVATATLLV RALSAVAAGR QRNLDLRTFS TRELGNSKAM
ESILAKFASI WKKHHAADYP AELTNEELFE AIGLVKFPQP LLLRGPLTLR LAGRDVDCEG
IEPFVGLPPQ AMLDVLADQR PEYCLTIENL ASFNRYTTEV RDRGVIVFTS GFPSPGVADF
LRLLDRALPA AIPFFHWGDI DEGGLKIFLY LQGLVKRGVQ PHRMTPELLT AKGQPSPGLR
RREVGRLIAD DRTVALLAEA ILSTAPARIL EQENIDPVAP SVA