Gene GM21_2785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2785 
Symbol 
ID8138128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3234101 
End bp3236071 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content63% 
IMG OID644870388 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003022577 
Protein GI253701388 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones89 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAACCA ATCAAATGTT CTTCATGCCG CTTTTCGTCA TCGCCCTCGT GGCCTTTTGC 
TTCAGCTGCT ATCAGCGCCT GCAACTGGTT GCCGTCGGCA CCCCCGAGGA CCGCTTCGAC
AGGCCCGGCG AGCGGCTGGC CGGCATGTTC CGGTACGCCT TCGGTCAAGA GAGGGTCCTC
GCCAGACCCT ACGGCCTGAA CCACTTCGCG CTCTTCTGGG CCTTTATGCT GCTCCTGGTC
GCGAACGTCT CCTTCCTGGC CGAGGGGCTC TTTCCGGGGT TCACCCTCTC CATTCTCCCG
GCCCCTCTGC ACCACGCCCT GGCGCTTTCC TTCGACCTGG TCTCGGTGGT AGCGCTCGTC
AGCGTCGCGG TAGCGCTCGC GCGCCGGCTC TTCTTCGCGC CGTCTTACCT CGGTAACGAT
TACACCAAGG CCTGCAGCGG AGAGGCGCTC CTGATCCTGG CCCTGATCGC CACCCTTATG
GTCGCCTTCT TCCTGCTGAA CGCAGCCCAG ATCGCTCTCG GCGCCGACCA GGCGTTGAGG
CCCGTCTCCG GTGCGCTAGC CACGCTTTTG CAGGGGATGC CGCAAGCGTC GCTCGAAGGT
ATTGCTTCCG TCTCCTGGTG GGTGCATGCC GTGGTGCTCC TTCTTTTCAT CAACCTTTTG
CCCCGCAGCA AGCACATGCA CATACTCACC GCCATTCCCA ACTGCTACTT CCGCAACCTG
GAGAAGCCCA ACGTGCAGCC GCGCGAGAGC TTCGAGCTCG GCAAGCGTTT CGGCGTGAGC
GAGGTGGCGC AGTTCTCCTG GAAGGATCTC CTCGATTCCT TCTCCTGCAC CGAATGCGGG
CGCTGTCAGG ACCTCTGCCC GGCCCACAAC ACCGGAAAGC CCCTGAACCC GCGCCGGATC
ATCCACGACA TCAAGGTGAA CCTCTTGGAG AACGGCGTCG CCAACGCCGG GAAGGAGCAG
CTCCCGCTCA TCGGCGAGAA AGGAGAGGGG ACGAGCTGCG AGGACGCCCT TTGGTCCTGC
ACCACCTGCG GCGCCTGCCT GTCGGTCTGC CCGGTCCTCA TCGAGCACAT GCCCAAGATC
GTCAAGATGC GCCGCCACCT GGTCCAGGAA AAGGCCCGGT TCCCCGAGGA GCTTTTGAAC
CTCTTCGAGA ACATGGAGCA GCGTTCCAAC CCCTGGGGCA TCGCCCCCTC CGAGCGCGGC
AAGTGGGCGA ACCTCCTGGG GGACAGGGAG TTCACAGCAG GCAAGACCGA ATACCTCTTC
TTCGTAGGGT GCGCCGGTTC CTTCGACAGC CGCGCCAAGC AGACTACCGT GGCTCTCGCC
ACCGTCCTCG ACAAGGCCGG CGTCACCTGG GGCATCCTCG GCAGGGACGA GCTCTGCTGC
GGCGACAGCG TGAGGCGCCT GGGGAACGAA TTCGTCTTCG ATAAGATGGC GCGGGAGAAC
GTGGCCAAGT TCAAGGAGAA AGGGGTCACC AAGATCGTCA CCCAGTGCCC GCACTGCTTC
AGCACGCTCA AGAACGACTA CCGGCAGTAC GGCCTGGAGC TGGAGGTGCT GCACCACAGC
GAGCTGATCG CCGGCCTGGT GCAGGAGGGG AAACTGAGCA CCGCCAAAGG GGTCAACCTG
GGCAAGACCG TCTTCCACGA CTCCTGCTAC CTGGGGCGCC ACAACGACAC GTACGCAGCA
CCCCGCCAGG TGATCGAGGC CGCGACCGGT GTCGCTCCCG GTGAGTTCGA GCGCCGGAAA
GAGAACGGAT TCTGCTGCGG AGCAGGCGGC GGGCGCATGT GGATGGAAGA GCAGATCGGC
ACGAGGATCA ACCACGACCG TGTCAACGAG GCCCTGAAGC AGCAGCCCGA CACCATCTGC
GTCAGCTGTC CCTACTGCAT GACCATGCTG GAGGACGGAC TTAAGGACCA GGGCGCGGAA
AAGGTGAGGG TGAAGGATAT AGCAGAGGTA ATGGCCGAGG CAATCAACTA G
 
Protein sequence
MPTNQMFFMP LFVIALVAFC FSCYQRLQLV AVGTPEDRFD RPGERLAGMF RYAFGQERVL 
ARPYGLNHFA LFWAFMLLLV ANVSFLAEGL FPGFTLSILP APLHHALALS FDLVSVVALV
SVAVALARRL FFAPSYLGND YTKACSGEAL LILALIATLM VAFFLLNAAQ IALGADQALR
PVSGALATLL QGMPQASLEG IASVSWWVHA VVLLLFINLL PRSKHMHILT AIPNCYFRNL
EKPNVQPRES FELGKRFGVS EVAQFSWKDL LDSFSCTECG RCQDLCPAHN TGKPLNPRRI
IHDIKVNLLE NGVANAGKEQ LPLIGEKGEG TSCEDALWSC TTCGACLSVC PVLIEHMPKI
VKMRRHLVQE KARFPEELLN LFENMEQRSN PWGIAPSERG KWANLLGDRE FTAGKTEYLF
FVGCAGSFDS RAKQTTVALA TVLDKAGVTW GILGRDELCC GDSVRRLGNE FVFDKMAREN
VAKFKEKGVT KIVTQCPHCF STLKNDYRQY GLELEVLHHS ELIAGLVQEG KLSTAKGVNL
GKTVFHDSCY LGRHNDTYAA PRQVIEAATG VAPGEFERRK ENGFCCGAGG GRMWMEEQIG
TRINHDRVNE ALKQQPDTIC VSCPYCMTML EDGLKDQGAE KVRVKDIAEV MAEAIN