Gene GM21_1590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_1590 
Symbol 
ID8136921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp1854802 
End bp1856406 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content63% 
IMG OID644869203 
Productextracellular solute-binding protein family 5 
Protein accessionYP_003021403 
Protein GI253700214 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones76 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTTTT CCCGCTGTCT CCCCCTCCTT CTTGCCCTCC TAGTCCTTGC CGGTTGCGCC 
AAGGGTGCGC CTGAGGACGC GGCGCGCGCG CAGGGTGGGG GCGCTTCCCG CGGCAGTTCC
TTTGTCACCG GGAGCATCGG GGAGCCTTCC ACGCTGATAC CGATACTCGC CAGCGATTCC
GCCTCCTTCC AGGTGGCCGG TCTGGTCTAC AACGGGTTGG TGCGTTACGA CAAGAACCTG
AAGCTGGAGG GGGATCTGGC CCAATCGTGG GAGGTTTCTC CCGACGGTTT GGGCATCACC
TTCCACCTGC GTCGCGGCGT CAAGTGGCAC GACGGCCACG ACTTCACCTC CCGTGACGTC
CTCTACACCT ACAAGGTGAC CATAGACCCC AAGACGCCGA CCGCGTATGC GGAGGACTTC
AAGCAGGTGC TGTCCGCACA GGCGGTGGAT ACTTACACCT TCAAGGTGCG CTACGCCAAG
CCCTTCGCGC CGGCGCTGGC CTCGTGGGCG TCCATGTCGG TCCTGCCTGC ACATCTTTTG
GAAGGGAAGG ACATCACCAA GAGCCCGCTC TCCAGAAAGC CGGTCGGCAC CGGTCCCTAC
ATCTTCAAGG AGTGGGTAGC GGGACAGCGG GTGATCCTGG AGGCAAACCC GCATTATTAC
GAGGGTGCGC CGCACCTCTC CCCCTACGTC TATCGCATCA TCCCGGACAA CTCCACCATG
TACATGGAGC TCAAGGCGGG CGGCATCGAC ATGATGGGGC TTTCCGCGGT GCAGTACCAG
CGCCAGACCG GAAGCCGAGA GTTCCTGTCC CGCTTCAACA AGTACCGCTA CCCGGCCTCC
GCCTACACCT ATCTCGGCTA CAACCTCAGG CTCCCGATGT TCCAGGACGT CAGGGTGCGC
CGCGCGCTCA CCTGCGCCAT CAACAAGGAA GAGATCATTC AGGGGGTCCT GCTCGGGATG
GGGCAGGTAG CACACGGCCC CTACAAGCCG GGCACCTGGG CCTGGAAGCC CACGATCGGG
GCGGACCCCG GCTACGACCC GGCCCGCGCC GCTGCACTCT TGAAAGAAGC GGGGTACGTC
ATGGGGCAAG ACGGCATCCT GGTCAAGGAC GGCAAGCCGC TGAGCTTCAC CATCATGACC
AACCAGGGAA ACGACGAGAG GCTCAAATGC GCCCAGATCA TACAAAGGCG CCTGAAGCGG
GTCGGCATCG ACGTGAAGAT CCGCGTCATG GAATGGGCCT CCTTCCTCAC CAACTTCATC
GACAAGGGGA GGTTCGAGGC GGTGCTGCTC GGCTGGACCA TCTCGCAGGA CCCCGACCTG
TACGACGTCT GGCACTCTTC CAAGACCGGT CCCAAGGAGC TCAATTTCGT CGGCTACAAG
AACCCGGAGC TGGACCGCCT GATCGTCGAG GGGCGCGGCA CCTTCGATAT GGCCAAACGG
CGCGAGAGCT ACTACCGGCT TCAGGAGATA CTGGCGCAGG ACCAGCCCTA CACCTTCCTT
TACGTCCCGG ACGCGCTGCC GGTGGTCGCC TCCAGGATCA AGGGGATTGA GCCGGCGCCT
GCGGGGATCA GCTACAACTT GATCAAGTGG TATGTAGAAC AATGA
 
Protein sequence
MTFSRCLPLL LALLVLAGCA KGAPEDAARA QGGGASRGSS FVTGSIGEPS TLIPILASDS 
ASFQVAGLVY NGLVRYDKNL KLEGDLAQSW EVSPDGLGIT FHLRRGVKWH DGHDFTSRDV
LYTYKVTIDP KTPTAYAEDF KQVLSAQAVD TYTFKVRYAK PFAPALASWA SMSVLPAHLL
EGKDITKSPL SRKPVGTGPY IFKEWVAGQR VILEANPHYY EGAPHLSPYV YRIIPDNSTM
YMELKAGGID MMGLSAVQYQ RQTGSREFLS RFNKYRYPAS AYTYLGYNLR LPMFQDVRVR
RALTCAINKE EIIQGVLLGM GQVAHGPYKP GTWAWKPTIG ADPGYDPARA AALLKEAGYV
MGQDGILVKD GKPLSFTIMT NQGNDERLKC AQIIQRRLKR VGIDVKIRVM EWASFLTNFI
DKGRFEAVLL GWTISQDPDL YDVWHSSKTG PKELNFVGYK NPELDRLIVE GRGTFDMAKR
RESYYRLQEI LAQDQPYTFL YVPDALPVVA SRIKGIEPAP AGISYNLIKW YVEQ