Gene GM21_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1298 
Symbol 
ID8136625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1522297 
End bp1523532 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content65% 
IMG OID644868912 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003021116 
Protein GI253699927 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0000000000000177669 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAGA GCACGCCGGT CAGATCCCCA CTCCAGAGTC TTTGCCAAAC GAACATCGAG 
GGGTGCACCA ACTGCGGGAA GTGCGTCCGC GAGTGCGCCT TCCTGCGCAA ATACGGCACC
CCCAAGAAGA TCGCCGCGGA GTTCGACCCG GCCGACTCCA TGTCCTTGCA CCGCGCCTTC
GAGTGCAACC TCTGCGGGCT CTGTTCCGCG GTCTGCCCGG AGAAGCTCGA CGTGGACGGC
ATGTTCCTGG AGATGCGGCG GGAAGCGGTG GACCGCGACT TGGGCGCCTA CCCGGAACAC
AAGCCCCTGC TCAATTACGA GAAGGTCGGG ACCTCGCGGC GGTTCAGCCT CTACCGGCTC
CCCGAGGGAT GCAGGACCAT CTTTTTCCCC GGCTGCTCGC TCCCGGGAAC GCGCCCGGAC
GCGGTGCACA ACCTCTTGGC GCTCATGCAC CAGGCCGACC CGACTGTGGG GGTGGTGTTC
GACTGCTGCC TCAAGCCCTC CTATTCGCTG GGGCGCGAGC AGTACGTGAA TTCGATGTTC
GAGGAGATGA ACGACTGGCT CCTGCGGCAC GGGGTGCGGG AGGTGCTGGT TGCCTGCCCC
AACTGCCAGG TGATGTTCGA GCGCCTGGGG CACGGGATGC GGGTGCGCAC GGTATGGGAG
GCCTTGGCCG AGGCTGGGCT TCAGCCGGAA CGGGCGGCGG GGACGGTCAC GGTGCACGAC
CCTTGCGTCA TCCGCAACTC TGAGCCGGTG CACCAGGCGG TGCGCACCCT TTTGGAGCGG
CAGGGACTGG TGGTCGAAGA GATGAAGCAT GCGGGGAAGA AGACGGTCTG CTGCGGCAAG
GGGGGCGGGG TGAACCTTTT GAACCCATCG TTGGCGGGGG AGTGGGGGGA GCTGCGCAAA
AAGGAGGCCG CCGGCAGGAG GGTGATCACC TACTGCGCCG GGTGCGTCCA GGCGCTGGAA
CAGCACACCC CGACCAACCA CCTGGTGGAC CTGCTCTTCG CGCCGGCACA GACCCTGGCG
GGCAAGAAGA AGGGGGCCAA AGCCCCCATT ACTTACCTGA ACCGGCTGCG TCTCAAGATG
TCGTTCAAAA AGAAGAAGGG GAATGCGGTG TTGAGGGAGC GGAGCTTCGT CGCGCAGCAG
GCACTGCGGA AAAAACGCAG GTGGAAGATC CCTTTCACGC AGATCCTTTG CGGGATAGCC
GCGGCCGCAG CCGGGATGCA TTTGCTATCC CTCTGA
 
Protein sequence
MKQSTPVRSP LQSLCQTNIE GCTNCGKCVR ECAFLRKYGT PKKIAAEFDP ADSMSLHRAF 
ECNLCGLCSA VCPEKLDVDG MFLEMRREAV DRDLGAYPEH KPLLNYEKVG TSRRFSLYRL
PEGCRTIFFP GCSLPGTRPD AVHNLLALMH QADPTVGVVF DCCLKPSYSL GREQYVNSMF
EEMNDWLLRH GVREVLVACP NCQVMFERLG HGMRVRTVWE ALAEAGLQPE RAAGTVTVHD
PCVIRNSEPV HQAVRTLLER QGLVVEEMKH AGKKTVCCGK GGGVNLLNPS LAGEWGELRK
KEAAGRRVIT YCAGCVQALE QHTPTNHLVD LLFAPAQTLA GKKKGAKAPI TYLNRLRLKM
SFKKKKGNAV LRERSFVAQQ ALRKKRRWKI PFTQILCGIA AAAAGMHLLS L