Gene Daro_3306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3306 
Symbol 
ID3566486 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3555203 
End bp3556513 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content60% 
IMG OID637681779 
Productextracellular solute-binding protein 
Protein accessionYP_286506 
Protein GI71908919 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones42 
Plasmid unclonability p-value0.148376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.12789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCATC GTCTTCGTCC TGCTTTCGTC ATCATCACGC TGGCTCTGGC CTGTTCTTCT 
GCCCTGGCCG CCAAGCCTGC CAAGTCGGCA AAGCCTGCTG CAAAGCCGGT TCACGCTCCG
GCGCCTGCTG CCGACTTCGA GTTGGCCCAT AATCTTGGCC CGGACGGTGA AGAGCAACTG
CAGGCAGTGG TTGATCGTTT CAACAAGGAA AACGGTGGCA ACCTGAAATT GGCACGCCTG
GAAAAGGGTG AAAAGCCGGC CGGGCTCAAC CTGATCCGTC GCTATGACAT GAGCGACGTA
CTGGTTCAAC CAAAGGCCTT CGTGCCGCTG TACGAGATGA TGACCAAGGC AGGGCAACCG
CTTCAGGTTG GCGAGTTGTC GGCGGATCTG AAGTCAGGTG CGGTCGATGC CAAGGGACGC
CTGGTCGCTT TGCCGCTGAT CTATTCGACG CCGGTTCTGT TCTACAACAA GAATGCTTTC
CGCAAGGCGA AGCTGGATCC CGAGCAGCCG CCGAAGACCT GGTTTGAAAT GCAGGGCGTA
CTCGACAAGC TGCAGGACGC TGGTTACACC TGTCCTTACA CATCGTCGTG GCCAGTCTGG
GTGCACATTG ATAACGTCAG TGCCGTGTCT GGTGTGCCGG CAGTCAGCGA CAAGGGCACG
CTGAGCTTCA ACGGCCTGCC GCAGGTCAAG CACGTGGCGA TGATGGCGAC CTGGACCAAG
GCCAATTACT TCAAGCTGTT CGGTCGTCGC AACGAAGCCA GCACCAAGTT CCATGACGGC
GAATGCGCAA TGATCACGAC CGATTCGCGC GAACATATTG ATTTCCGTGA TGCCAAGGGC
GTCGAACTAG GCGTTGCCCC GCTGCCCTAT CACGATGATG TTTACGGCGG CCGCCAGAAT
TCGCTGGCCG ATGGGGCGTC GCTGTGGGTT GGTGCGGGCA AGTCGCCTGC GGAGTACAAG
CAGGCAGCAA AATTCGTCTC CTTCCTGCTT TCGCCGGAAA TGCAGATCGA GATGGTGCGC
GTCTATGGCG GGCTGCCGCT GACCGCAGCC GCCCGTGCCG CAGCCCGCAG CAAGCTGCTG
CAGGATGGAG ACAAGACGCT GGAAGTTGCT TATGCCTCGA TGAAAGGCAA GGGGGCTTCG
CATGTTCCCC ATGTGTCCGA TGCCGACCCG GTGCGCATCC TGACCAATGA GGAACTGGAG
GCCGTGTGGT CCGACAAGAA GCCCGCCAAG GCTGCACTGG ATACGGCGGT TTCCCGCGGT
AACGCCATCA TGGCAGCCAA GCCGGCCCTG AAGAAGGCGC AGCCCTTCTA A
 
Protein sequence
MSHRLRPAFV IITLALACSS ALAAKPAKSA KPAAKPVHAP APAADFELAH NLGPDGEEQL 
QAVVDRFNKE NGGNLKLARL EKGEKPAGLN LIRRYDMSDV LVQPKAFVPL YEMMTKAGQP
LQVGELSADL KSGAVDAKGR LVALPLIYST PVLFYNKNAF RKAKLDPEQP PKTWFEMQGV
LDKLQDAGYT CPYTSSWPVW VHIDNVSAVS GVPAVSDKGT LSFNGLPQVK HVAMMATWTK
ANYFKLFGRR NEASTKFHDG ECAMITTDSR EHIDFRDAKG VELGVAPLPY HDDVYGGRQN
SLADGASLWV GAGKSPAEYK QAAKFVSFLL SPEMQIEMVR VYGGLPLTAA ARAAARSKLL
QDGDKTLEVA YASMKGKGAS HVPHVSDADP VRILTNEELE AVWSDKKPAK AALDTAVSRG
NAIMAAKPAL KKAQPF