Gene GM21_1320 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1320 
Symbol 
ID8136647 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1549805 
End bp1551046 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content64% 
IMG OID644868934 
Productprotein of unknown function DUF224 cysteine-rich region domain protein 
Protein accessionYP_003021138 
Protein GI253699949 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value1.5522600000000002e-26 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGATTACG TCAAGCTGAT CAACTGCATG CGCTGCGGCA TGTGTCTGCC GCACTGCCCG 
ACCTACAAGG AGACCTTCCT GGAGACAGCC TCGCCGCGCG GACGCGTGGC GCTGGTGCGC
AAGCTCCAGG AAGGGGAGCT GGTGGAAAGC GAGAAGTTCC TGGAATACGT CTCGCTCTGC
CTCGACTGCC AGGCCTGCGC CTCCGCCTGC CCCTGCGGCG TCAACGCCGG GGAGCTGGTC
GCCGAGTTCC GCTGCGAGAG CAAAAAGGAG AAGGGGCTCT CCCTCATGGA GGACCTGGTC
CTCAGAAAGC TGGTGCCGCA CCCCGATCGC CTGGAGGCCG CGACGGCGCC GCTTCGGCTC
TACCAGAAAA CCGGTCTGCA GCAGGTGGTC CGCTCGCTGG GTCTCTTGAA ACTCTTCCCC
GAGGCGCTCG GGCGCATGGA AGGTCTTTTG CCGAGGCTCC CCGATGCGCC GCTGCGCCAG
ACCATCAGCG AGGTGACGCC GGCGGCCGGA AAAGAGAAGG GGACCGCGGG CTTTTTCCTT
GGCTGCGTCA TGAGCCTCAT CTTCAGCGAG GCGAGCCTCG CCACGGTGCA GCTTCTTTCT
GCCTTGGGGT ACCGGGTGGT GACCCCAAAG GCGCAGAAGT GCTGCGGCGC CCCCAACATG
CTGGGGCACG ACCTGGACGG GCTCAAGGAG GCGGCACGCT TCAACACCGG CCTCTTCGAC
TCCTTCGACC TCGATTTCGT GGTCACCGAC TGCGGCGGCT GCGGGGCCGA GCTTAAGAAG
TACGGTCAGC ACCTGGACGG CGAGCTCAAG GCGGTTGAAT TCAGCGCCGG CGTGCGCGAC
ATCTCCGAAG TCCTCTTCGC CGAGTCGGAG GCTTTGGCCG CGAAACTGAA ACCCCTGCCG
CTCAAGGTCA CCTACCACGA CCCCTGCCAC ATCGCTCACT GCCAGGGGAT CCGCAGCCAG
CCCCGGGCGC TGATCAAGCT GATACCGGGA ATCGACTTCC GCGAGCTCCC GGAGGCGGAT
GCCTGCTGCG GGTCGGCGGG TACCTACAAC ATCGAGAAGC CGGAGATGTC GGATCTGGTG
CTGTCGCGCA AGGTGCAAAA CGTGCAGAAG ACGGGGGCCG AATACCTCGT GACCAGCAAC
CCGGGTTGCC TCTTGCAGCT CAAGAAGGCG CTCTCGGAGG CCGAGCCTCA GGTAAGGGTG
ATTCATCTCA CCGAACTGCT CCAGATGAGC ATGAATGCGT GA
 
Protein sequence
MDYVKLINCM RCGMCLPHCP TYKETFLETA SPRGRVALVR KLQEGELVES EKFLEYVSLC 
LDCQACASAC PCGVNAGELV AEFRCESKKE KGLSLMEDLV LRKLVPHPDR LEAATAPLRL
YQKTGLQQVV RSLGLLKLFP EALGRMEGLL PRLPDAPLRQ TISEVTPAAG KEKGTAGFFL
GCVMSLIFSE ASLATVQLLS ALGYRVVTPK AQKCCGAPNM LGHDLDGLKE AARFNTGLFD
SFDLDFVVTD CGGCGAELKK YGQHLDGELK AVEFSAGVRD ISEVLFAESE ALAAKLKPLP
LKVTYHDPCH IAHCQGIRSQ PRALIKLIPG IDFRELPEAD ACCGSAGTYN IEKPEMSDLV
LSRKVQNVQK TGAEYLVTSN PGCLLQLKKA LSEAEPQVRV IHLTELLQMS MNA