Gene GM21_1721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1721 
Symbol 
ID8137052 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp2002184 
End bp2003329 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content63% 
IMG OID644869333 
Producthypothetical protein 
Protein accessionYP_003021533 
Protein GI253700344 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value0.00204898 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATCGC CCCTGCACCT CATCATGATT CTGTCGGCCG CCTTGGCGCT TTGCGGCTGC 
TCCGGACCGG AACCCGGCTC CACAGGCCTT GCCGGCTCCT CCGTGAAAGC CCACGCCAAA
AAAGCCGCTT ACGGCAGCTA TCGCTTCGGC ATGACCCAGG GGATCGACAT CGGCGCTCAA
CCTCTGACGC TTCCTGAGTT TTCGGTCGCC GAGCTCATGG CGCGCGACCG CGTGCTGGCC
GAGCGCCTGC ACCGCGGCGG GATGCGACTG CAGATGCTCC CCTTCTACAA CGGTAAGGAC
ATCGGCGATT TCCTCTCCAC CGGTGAACTG GAGGGGGGGA TATTCGCGGA CATGCCGGCC
TTGACGGCTG CCGCCGGCGG AGACGTGGTG CTGCTTGCCA TGCTGAAACA GGGGGCCGCC
TCCATCGTCG CCAGGGCTCC CATGCTGGTG AAGGATCTGG ATGGGAAGAG GGTCGGGGTC
ACCACCGGCA GCGCCGCCCA CTTCACCCTT CTGCGGGCGC TGGGCAACGC AGGGTTGGCC
GAGAAGGACG TCGAGCTGGT GCCGATGGAG GTGAGCGAGA TGGCCCGGGC GCTTGCCGAC
GGCAGGATCG ACGCCTTCTG CGCCTGGGAG CCGACCCCTT CCATAGCCTT TTCCTCCTAT
CCCGATTTCC ACCTGGTCCA CAAGGGGCTC AACTACGGGT TCCTCTGCCT GCGACGCGAC
TTCGTGAACA GTCATCCCGG TGAAACCAGG GAAATCCTCG CCGCCGTCGC CAGGGCATGT
TTCTGGATGC GGGAGGGGGG ACAGATGCGG CAACTGGCCC AGTGGACGAC GCAAGCGGCG
ACGAAGTTTC AAGCTGAGCC CTTTGCCCTG AAGCCTGAGC AGATGATGTC CATCACCCGC
CGCGACCTGC TCGACGTCCA ATCCTCCCCG CGCATACCGG AGATGCTCCT GCGCGAGCAG
GAAGTGCTTT ATCAGAAGTT CCTTTTCCTC AAGAAGATAG GAAGGATACC GGAGACCGCC
TCCTGGGCCA AGGTGCGCGG TTCCATAAAT TTAGCGATGT TGCGGGAGGT CATGGCCGAC
TCCGACAGGT ACGCCCTGAG AGGGTTCGAC TACCGCGGCA ATACGGAAAC GGATGGAACA
AGATGA
 
Protein sequence
MKSPLHLIMI LSAALALCGC SGPEPGSTGL AGSSVKAHAK KAAYGSYRFG MTQGIDIGAQ 
PLTLPEFSVA ELMARDRVLA ERLHRGGMRL QMLPFYNGKD IGDFLSTGEL EGGIFADMPA
LTAAAGGDVV LLAMLKQGAA SIVARAPMLV KDLDGKRVGV TTGSAAHFTL LRALGNAGLA
EKDVELVPME VSEMARALAD GRIDAFCAWE PTPSIAFSSY PDFHLVHKGL NYGFLCLRRD
FVNSHPGETR EILAAVARAC FWMREGGQMR QLAQWTTQAA TKFQAEPFAL KPEQMMSITR
RDLLDVQSSP RIPEMLLREQ EVLYQKFLFL KKIGRIPETA SWAKVRGSIN LAMLREVMAD
SDRYALRGFD YRGNTETDGT R