Gene GM21_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1503 
Symbol 
ID8136832 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1757379 
End bp1758407 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content56% 
IMG OID644869115 
Producthypothetical protein 
Protein accessionYP_003021317 
Protein GI253700128 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1
[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value0.641769 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACTGC GTTTTAACAG AACAAACGAT GAAATCGACC AGATGATCGA CGCATTGATG 
GAGAAAGCTG GAGGGGTGCA CCACCCGGAC CTGGCACGCG AGATGATCAT CTCCGCACTG
AAGGCGGGGC AGGACACCGA TTATTTAGCC GACCTCAAAC TTCTGAGCAA CACCATGAAG
GAGATGCGCT ACACCACGAA GATCTTCGCT CCCTACCGGC ACAAGAAGAA GGTGACCATC
TTCGGCTCCG CCCGGACCCG CCCCGAAGAG CCGATGTACA AGAAGTGCAT CGACTTCGCC
GCCCTATTGG CGGAGAAGGG ATACATGATC ATCACCGGCG GCGGCGGCGG GATCATGCAG
GCCGGAAACG AGGGAGCCGG CAGCGAATCG TCCTTTGCCG CCAACATACG GCTCCCGTTC
GAGCAGTCCG CGAACCGGGT CATGCTGAAG AACCCGCGAC TCATTACCTA CAAGTACTTC
TTCAACCGCA AGGTGGCCTT CGTGAAAGAA TCCGACGCCA TCGCGGTATT CCCGGGCGGC
TTCGGGACGC TCGACGAGGC GATGGAAGTA TTCACCCTGA TCCAGACCGG GAAGACTTCC
CCCAAACCGC TGGTTCTGGT TGACGACGAG GAAGGGTACT GGGAGCACTT CTTCAGGTTC
ATCAAGGAAA GACTGCTGGT TATGGGGTTC ATCTCCGCAG AGGACTTCTC CATCTTCACC
ATCACCAAGA GCTACGAGGA AGCGGTCCAG GTCATCGAGG AGTTCTATAC CAACTACCAT
TCTATGCGGT TCGTCAACGG CGAGCTCATC ATCCGTGTAA CGAAAATTCT GGCTCCCGAG
CAGATCGAGA TGCTGGAGAA CGAATTCCCC GAATTGAGAT TAAACAACAG CCGGATCGAA
TTAATTAGCG CTCGACCGGA GGAAGCGGAC GAGCCGGATC TCCTTGATTT GCCGAGGATA
GCCTTCCACT TCCACCACCA GCACTACGGG CTGCTGATGG CCTTCATTAG GCGGCTGAAC
ACCTTCTGA
 
Protein sequence
MQLRFNRTND EIDQMIDALM EKAGGVHHPD LAREMIISAL KAGQDTDYLA DLKLLSNTMK 
EMRYTTKIFA PYRHKKKVTI FGSARTRPEE PMYKKCIDFA ALLAEKGYMI ITGGGGGIMQ
AGNEGAGSES SFAANIRLPF EQSANRVMLK NPRLITYKYF FNRKVAFVKE SDAIAVFPGG
FGTLDEAMEV FTLIQTGKTS PKPLVLVDDE EGYWEHFFRF IKERLLVMGF ISAEDFSIFT
ITKSYEEAVQ VIEEFYTNYH SMRFVNGELI IRVTKILAPE QIEMLENEFP ELRLNNSRIE
LISARPEEAD EPDLLDLPRI AFHFHHQHYG LLMAFIRRLN TF