Gene GM21_1314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1314 
Symbol 
ID8136641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1541999 
End bp1544134 
Gene Length2136 bp 
Protein Length711 aa 
Translation table11 
GC content63% 
IMG OID644868928 
Productprotein of unknown function DUF162 
Protein accessionYP_003021132 
Protein GI253699943 
COG category[C] Energy production and conversion 
COG ID[COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain 
TIGRFAM ID[TIGR00273] iron-sulfur cluster-binding protein 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.000000000000036921 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAAAAAG AGTTCAAGGC ATCGATCGAC CGGGCCCTCA ACGACGCCAA CCTGACCGGC 
GCGCTGGGGA AGTTTTCCGA AGCGTACAAG GTGAACCGCG CCAAGGCCTA CGAGGGAATC
GACTTCGAGG CGCTCCGCTC CACCGTGGCC GATGCGAAGT CCAAGGCGGC CTGCCATCTG
GACGAGGTGG CCGATCTCTT CAAGGCGAAC GCCGAGGCGC TCGGGGCCAA GGTGTTCCGC
ACCCGGGACC CCCAGGAGGT GAAACGCTAC ATCCTGCAAC TGGCCAAGGA GAAGGGGGTC
CGGAGCGTGG TGAAGTCGAA GTCCATGGCG ACCGAGGAGA TCCACCTGAA CCGGGCACTG
CTGGAGGAGG GAATCGCGGT CGCCGAGACC GACCTCGGCG AGTGGATCAT CCAGCTCGCC
GGCCAAACTC CGTCGCACAT GGTCATGCCG GCGATCCACA TGACCAAGGA GGAAGTGGCG
GAGATCTTCA GCAAGGAGGT CGACGAGCGG CTCTCCACCG ACATCCCGAG GCTGGTGAAG
GTGGCCCGTA ACGAACTCCG CCCTAAGTTC CTGGCGGCGG ACATGGGGAT CTCCGGCGGC
AACATCGCCG TCGCCGAGAC CGGAAGCATA GTGCTCGTGA CCAACGAAGG GAACGCGAGG
CTCGTGACCA CCCTTCCCAA AATCCACGTG GCGCTGATCG GCGTCGAGAA ACTGGTGGAG
AAGTTCGAGA GCGTCGCTCC CATCCTGGAC GCGCTCCCCA GGAGCGCCAC GGCGCAGCTT
CTCACCAGCT ACGTCTCCAT CATCACCGGC CCCACGCAGA ACGACGACGG GAGCGACAAG
GAGCTGCACA TCATCCTGAT GGACAACCGG CGCACCGAAA TGGCGCAGGA TCCCAAGTTC
AAGCAGGCGC TGCAATGCAT CCGCTGCGGC TCCTGCCTGA ACGTCTGTCC CATCTTCCGG
CTGGTTGGGG GGCACGTCTT CGGCAGCATC TACACCGGGG GGATCGGCAC CATCCTCACC
GCCTGGTTCG ACGAGCTGAA GAAGTCCGAG GATATACAGG GGCTGTGCAT CCAGTGCGGC
AACTGCAAGG AGGTCTGCCC AGGGAAGCTC GATATCCCCG AGATGATCAT GGAGATCCGG
CGCCGGCTGG TGCTGGAAAA AGGGCAGCCG CTGCTGCAGA AGGCGATCTT CGGCGTGGTG
AACAACAGAA GGCTTTTCCA CGGCATGCTG CGCGCCGCCT CCGTCGCCGC AAAGCCCTTC
AGCACCGCCG GTTTCATCCG CCACCTGCCG CTGTTTCTTG CCGACTTAAC CGACGGCCGC
AGCCTCCCTG CCATCGCGGA GAAGCCGTTC AGGGACATCT TCCCGGAGAT CGTGCAGCCG
CAGGCAAAGG AAAAGGCCGT CTTCTACGCG GGCTGCCTGA TCGACTTCGC CTACCCCGAG
ACGGGTGTCG CGCTGGTGCG GCTTCTCAAT AAGGCGGGGA TCGAGGTGAT CTTCCCCGAG
GAACAGACCT GCTGCGGAGC CCCCGCACTC TACAACGGGG CCTACGAGGT CGCGGCGCAA
AACGCGATCG ACAACATAGA AGTGCTCTTG CAGCACGAGG CGCAGTACGT GGTTTCCGCC
TGCCCCACCT GTACGGTGGC GCTGGCGCAC GAGTTCGGTA AGACCCTGGA AAGCGTGGGA
CAGACCAAGT GGCTGGAGAA GGCGCAGGAA CTGGCCGCGA AGACGGTGGA TCTATCCACA
CTCGTGAAGC GGTTAACGGA TGAAGGAAGG CTGAGCTTTG AGGAAGGGGA AGGGCTTGCG
AAAATCACCT ACCACGACTC CTGCCACCTC AAACGGACGC TCAAGGTGTC GGAAGAGCCG
CGCGAACTGC TGCAAAAGGC GGGCTACCAG TTGGAGGAGA TGTTCGAGTG CGACATGTGC
TGCGGCATGG GAGGCTCCTA CTCCATGAAG CTTCCCGAGA TCTCGGCGCC GATCCTGAAG
CGCAAACTGC AGAACATAAA GGATACGGGG GCGCCGGTAG TGGCGATGGA CTGCCCGGGG
TGCGTGATGC AGATACGCGG CGGGTTCGAC CAGCAGGGCG GAGAGGTGAA GGTGAAGCAC
ACCGCCGAGC TTTTGGCCGA GCGGTTGAAA GGCTAA
 
Protein sequence
MKKEFKASID RALNDANLTG ALGKFSEAYK VNRAKAYEGI DFEALRSTVA DAKSKAACHL 
DEVADLFKAN AEALGAKVFR TRDPQEVKRY ILQLAKEKGV RSVVKSKSMA TEEIHLNRAL
LEEGIAVAET DLGEWIIQLA GQTPSHMVMP AIHMTKEEVA EIFSKEVDER LSTDIPRLVK
VARNELRPKF LAADMGISGG NIAVAETGSI VLVTNEGNAR LVTTLPKIHV ALIGVEKLVE
KFESVAPILD ALPRSATAQL LTSYVSIITG PTQNDDGSDK ELHIILMDNR RTEMAQDPKF
KQALQCIRCG SCLNVCPIFR LVGGHVFGSI YTGGIGTILT AWFDELKKSE DIQGLCIQCG
NCKEVCPGKL DIPEMIMEIR RRLVLEKGQP LLQKAIFGVV NNRRLFHGML RAASVAAKPF
STAGFIRHLP LFLADLTDGR SLPAIAEKPF RDIFPEIVQP QAKEKAVFYA GCLIDFAYPE
TGVALVRLLN KAGIEVIFPE EQTCCGAPAL YNGAYEVAAQ NAIDNIEVLL QHEAQYVVSA
CPTCTVALAH EFGKTLESVG QTKWLEKAQE LAAKTVDLST LVKRLTDEGR LSFEEGEGLA
KITYHDSCHL KRTLKVSEEP RELLQKAGYQ LEEMFECDMC CGMGGSYSMK LPEISAPILK
RKLQNIKDTG APVVAMDCPG CVMQIRGGFD QQGGEVKVKH TAELLAERLK G