Gene GM21_2131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2131 
Symbol 
ID8137467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2487249 
End bp2489240 
Gene Length1992 bp 
Protein Length663 aa 
Translation table11 
GC content62% 
IMG OID644869746 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003021941 
Protein GI253700752 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value0.000000140494 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAACATA CCTTTTTCGT TGTCCTCTTG TCCCTTTCCA TGGCCGTTTT CGGCTTCAGC 
TGTTACCGCC GGCTGGCGTT GGTGGCGATC GGGAAGTACG AATATCGCTT CGACCAGCCG
ACGGCCCGCC TCAAGGAGAT GCTCGTCTAC GCCCTCGGGC AGAAAAGGGT GGTGAGCCGC
CCCTTCGGCC TCAACCACGG GGTCATCTTC TGGGCCTTCC TGGTGCTAGC CGTGGCGAAC
CTGGAATTCC TGGTATCCGG GATCCTGCCG GGCCTCTCCT TCGCACTTTT GCCCGAACCT
TTGCACGGCG CTCTGGCGTT TGTCTTCGAT ATCTGCTCGC TGGCCGTACT TATCGCGGTG
GCTGTCGCCG CCGTGCGCCG CACAGTAAAG CCGCCGTTTG CCGGCGCGCG TACCTTCGAG
GCCTACTTCA TCCTCTCCAT GATCGCCATC CTGATGACGG CTTACTTCGG GATGAACGGA
GCCCTTATCG CTTCCGGCGC GCGCGCGGCC ACCTCCCTCA CTCCTGTATC CAACTTCTGC
GCGAGCCTGC TCTCTTCGAC ACCGGCCACT GACCATCTGG ATCTCGCCGC GAAGGTGTTT
TGGTGGATCC ACGCCGTGGT GTTGCTGGGG TTCATGAACT TCCTCCCCTA CAGCAAGCAC
ATGCACATCC TCACCGCCAT CCCCAACGTC TTCTTGCGCA CCATGGGCAA GAGCAACACC
CAGCCCCGCG AGGAGTTCAG CGAGGGGAAG AGCTTCGGAG CCGCAACGGT GGACAAGCTC
ACCTGGAAGG ACCTGCTCGA TTCCTTCTCC TGTACCGAAT GCGGCCGATG CCAGGACTCC
TGCCCGGCAG CCGCCACCGG GAAGGCGCTC AACCCGCGGC AGGTGATCCA CGCCATCAAG
GAAAACCTGC TGCAAAACGG GATCGTGCTG GAGAAGCTTC ACGGGGACGC TGAGGCCGAG
CGTTGCGTCT CGCTGATAGG AGAGGGGAAG AAAGGAACGA CTCAGGAATC CGCCCTTTGG
AGCTGCACTT CCTGCGGCGC CTGCATGGAG GCATGCCCCG TTTTCATCGA GCACCTCCCG
AAGATCGTCA AGATGCGGCG GCACCTGGTG CAGATGGAGG CGAAGTTTCC GGAGGAGCTT
TTGAACCTCT TCGAGAATAT GGAGCAGCGC TCCAACCCTT GGGGGATGGC CCCGTCGGAG
CGGAGCAAGT GGTGTTCCCA ACTGGATCTG CGCCCCTTTC AGGCTGGAGA AACCGAATAC
CTCCTCTTCG TCGGCTGCTC GGGCGCCTTT GCTGCACGGA TCAAGCAGGT GAGCGTCGCC
CTGACCCGGG TGCTGGACGC TGCGGGCGTC TCCTATGGCG TCCTGGGCAA GGACGAAAAA
TGCTGCGGCG AGAGCGTGCG CCGGCTGGGC AACGAGTACC TCTTCGACCT GATGGCGCGG
GAAAGCGTGG CGCAGTTCCG GGAAAAGGGA GTCGTCAAGG TCATCACCCA GTGTCCTCAC
TGCTACAACA CGCTCAAGAA CGATTTCCGG CAGTTCGGGC TCGAACTGGA GGTGCTGCAT
CACAGCGAGC TGATCGCGAC ACTTATGGCT TCGGGAAAGC TGCGGATGGA GGGGAAGATG
GGGGATCTGG GCAATCTGGT CCTGCACGAC TCCTGTTATC TGGGGCGTCA TAACGACGTG
TACCAGCCGC CGCGCTCGGT GATACAGGCG GTAACCGGTA CGGCGGCCGG CGAGATGGGG
CGCAGCCTCG ACAAAGCCTT TTGCTGCGGT GCCGGCGGCG GGCGGATGTG GCTCGAGGAG
CACGAGGGAA CGCCGATCAA CCGGGCGCGA GTGGCGGAGG CGCTGGCGTT GAAGCCGGAC
ACCATCTGCG TAAGCTGCCC TTTCTGCATG ACCATGTTCG AAGACGGGGT GAAGGAGGTG
CCTGGCGCCG GGGTTCAGGT GAAGGACTTG GCCGAGGTGG TGGCCCTGGC CCTCCCCATG
AGGTCAACGT AA
 
Protein sequence
MEHTFFVVLL SLSMAVFGFS CYRRLALVAI GKYEYRFDQP TARLKEMLVY ALGQKRVVSR 
PFGLNHGVIF WAFLVLAVAN LEFLVSGILP GLSFALLPEP LHGALAFVFD ICSLAVLIAV
AVAAVRRTVK PPFAGARTFE AYFILSMIAI LMTAYFGMNG ALIASGARAA TSLTPVSNFC
ASLLSSTPAT DHLDLAAKVF WWIHAVVLLG FMNFLPYSKH MHILTAIPNV FLRTMGKSNT
QPREEFSEGK SFGAATVDKL TWKDLLDSFS CTECGRCQDS CPAAATGKAL NPRQVIHAIK
ENLLQNGIVL EKLHGDAEAE RCVSLIGEGK KGTTQESALW SCTSCGACME ACPVFIEHLP
KIVKMRRHLV QMEAKFPEEL LNLFENMEQR SNPWGMAPSE RSKWCSQLDL RPFQAGETEY
LLFVGCSGAF AARIKQVSVA LTRVLDAAGV SYGVLGKDEK CCGESVRRLG NEYLFDLMAR
ESVAQFREKG VVKVITQCPH CYNTLKNDFR QFGLELEVLH HSELIATLMA SGKLRMEGKM
GDLGNLVLHD SCYLGRHNDV YQPPRSVIQA VTGTAAGEMG RSLDKAFCCG AGGGRMWLEE
HEGTPINRAR VAEALALKPD TICVSCPFCM TMFEDGVKEV PGAGVQVKDL AEVVALALPM
RST