Gene Daro_2881 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_2881 
Symbol 
ID3566286 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3100673 
End bp3101776 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content59% 
IMG OID637681350 
Productperiplasmic sugar-binding protein, putative 
Protein accessionYP_286081 
Protein GI71908494 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value0.333959 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGTC TTGCTGGTCT CCTTGCGCTG CTGTTATTTG TTTTCCCAAG TCTGGTCTGG 
GCCATGTCAG TCGCCTTCAT CAATCCGGGA AAGTCGGATG AGGCTTACTG GCTCACTGCG
ACCAAGGCAA TGGAAGCGGC AGCGAAAGAC CTGGATATAC GCTTCGAAGT CTTTTATGCC
GAGCGGCAGC ATCCCCGGGT GTTCGAGCTC GCCCGTCAGA TTGTTGCCCG TCCTGTTGCC
GATCGACCGG ATTACGTGGT GATCACCAAT GATTATGCGA CCGGACCGGA GCTGTTACGG
CTGTTCGACG CTGCCGGAAT CAAGACTTTT CTGGCCTATA GCGGTATTTC AGAGCCGGTA
GAGCGGGCTG TGACCGGCCA GCCGCGTGAG CATTTCAAAG GGTGGTTGGG TTCTCTCGAA
CCGCGGGCCC AGGAGGCAGG TTATCTCACC GCCAAGGCGC TGATCCAGCA GGGGCGTCGG
GCTCAAGCTC AGGCAGCCGA TGGCCGTTTG CACTTTTTGG CGATCAGTGG CGATCGGTCG
ACGCCGGCCT CGAATCGGCG GGGCGAGGGA ATGCGCCGGG CGGTGGCCGA GGCGGGCGAT
GTGGTGCTCG AGCAGGAGAT TTTCTCCGGC TGGAACAGGG CCAAGGCGGC CGAACAGAGT
GAATGGTTGT TTCAGCGCTA TCCGCTGGCC AGACTGGTCT GGGCGGGAAA TGATCAGATG
GCCTTTGGCG CAATGCAGGT CTGGGAGAAA CGCGGCGGCA AGCCGGGTAA GGATGCCTGG
TTCAGTGCAG TGAATGCTTC CCCCGAGGCG ATGGCTGCCC TCAAATCCGG CCGCCTGGCG
GCACTGGCTG GCGGCCACTT CATTTGCGGC GCCTGGGCGT TGGTCATGCT CTACGACTAC
GACCATGGCC GGGATTTTGC AGAGGGAGAG GGCGTAGAGG TGAATCAGTC GATGTTCACG
CTGTTTTCGC AGAAAGATGC GGATCGTTTC ATGGTGCGCT TTGGTCAACT GCACTTCGAT
CAGGTGAATT TTCGCCGTTT CAGCAAGGCG CTGAATCCGA AGTTGAAACG CTACGATTTC
AATTTCCGGC AGCTACTGGA CTAA
 
Protein sequence
MKRLAGLLAL LLFVFPSLVW AMSVAFINPG KSDEAYWLTA TKAMEAAAKD LDIRFEVFYA 
ERQHPRVFEL ARQIVARPVA DRPDYVVITN DYATGPELLR LFDAAGIKTF LAYSGISEPV
ERAVTGQPRE HFKGWLGSLE PRAQEAGYLT AKALIQQGRR AQAQAADGRL HFLAISGDRS
TPASNRRGEG MRRAVAEAGD VVLEQEIFSG WNRAKAAEQS EWLFQRYPLA RLVWAGNDQM
AFGAMQVWEK RGGKPGKDAW FSAVNASPEA MAALKSGRLA ALAGGHFICG AWALVMLYDY
DHGRDFAEGE GVEVNQSMFT LFSQKDADRF MVRFGQLHFD QVNFRRFSKA LNPKLKRYDF
NFRQLLD