Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Daro_3306 |
Symbol | |
ID | 3566486 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dechloromonas aromatica RCB |
Kingdom | Bacteria |
Replicon accession | NC_007298 |
Strand | - |
Start bp | 3555203 |
End bp | 3556513 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637681779 |
Product | extracellular solute-binding protein |
Protein accession | YP_286506 |
Protein GI | 71908919 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1653] ABC-type sugar transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 42 |
Plasmid unclonability p-value | 0.148376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.12789 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCATC GTCTTCGTCC TGCTTTCGTC ATCATCACGC TGGCTCTGGC CTGTTCTTCT GCCCTGGCCG CCAAGCCTGC CAAGTCGGCA AAGCCTGCTG CAAAGCCGGT TCACGCTCCG GCGCCTGCTG CCGACTTCGA GTTGGCCCAT AATCTTGGCC CGGACGGTGA AGAGCAACTG CAGGCAGTGG TTGATCGTTT CAACAAGGAA AACGGTGGCA ACCTGAAATT GGCACGCCTG GAAAAGGGTG AAAAGCCGGC CGGGCTCAAC CTGATCCGTC GCTATGACAT GAGCGACGTA CTGGTTCAAC CAAAGGCCTT CGTGCCGCTG TACGAGATGA TGACCAAGGC AGGGCAACCG CTTCAGGTTG GCGAGTTGTC GGCGGATCTG AAGTCAGGTG CGGTCGATGC CAAGGGACGC CTGGTCGCTT TGCCGCTGAT CTATTCGACG CCGGTTCTGT TCTACAACAA GAATGCTTTC CGCAAGGCGA AGCTGGATCC CGAGCAGCCG CCGAAGACCT GGTTTGAAAT GCAGGGCGTA CTCGACAAGC TGCAGGACGC TGGTTACACC TGTCCTTACA CATCGTCGTG GCCAGTCTGG GTGCACATTG ATAACGTCAG TGCCGTGTCT GGTGTGCCGG CAGTCAGCGA CAAGGGCACG CTGAGCTTCA ACGGCCTGCC GCAGGTCAAG CACGTGGCGA TGATGGCGAC CTGGACCAAG GCCAATTACT TCAAGCTGTT CGGTCGTCGC AACGAAGCCA GCACCAAGTT CCATGACGGC GAATGCGCAA TGATCACGAC CGATTCGCGC GAACATATTG ATTTCCGTGA TGCCAAGGGC GTCGAACTAG GCGTTGCCCC GCTGCCCTAT CACGATGATG TTTACGGCGG CCGCCAGAAT TCGCTGGCCG ATGGGGCGTC GCTGTGGGTT GGTGCGGGCA AGTCGCCTGC GGAGTACAAG CAGGCAGCAA AATTCGTCTC CTTCCTGCTT TCGCCGGAAA TGCAGATCGA GATGGTGCGC GTCTATGGCG GGCTGCCGCT GACCGCAGCC GCCCGTGCCG CAGCCCGCAG CAAGCTGCTG CAGGATGGAG ACAAGACGCT GGAAGTTGCT TATGCCTCGA TGAAAGGCAA GGGGGCTTCG CATGTTCCCC ATGTGTCCGA TGCCGACCCG GTGCGCATCC TGACCAATGA GGAACTGGAG GCCGTGTGGT CCGACAAGAA GCCCGCCAAG GCTGCACTGG ATACGGCGGT TTCCCGCGGT AACGCCATCA TGGCAGCCAA GCCGGCCCTG AAGAAGGCGC AGCCCTTCTA A
|
Protein sequence | MSHRLRPAFV IITLALACSS ALAAKPAKSA KPAAKPVHAP APAADFELAH NLGPDGEEQL QAVVDRFNKE NGGNLKLARL EKGEKPAGLN LIRRYDMSDV LVQPKAFVPL YEMMTKAGQP LQVGELSADL KSGAVDAKGR LVALPLIYST PVLFYNKNAF RKAKLDPEQP PKTWFEMQGV LDKLQDAGYT CPYTSSWPVW VHIDNVSAVS GVPAVSDKGT LSFNGLPQVK HVAMMATWTK ANYFKLFGRR NEASTKFHDG ECAMITTDSR EHIDFRDAKG VELGVAPLPY HDDVYGGRQN SLADGASLWV GAGKSPAEYK QAAKFVSFLL SPEMQIEMVR VYGGLPLTAA ARAAARSKLL QDGDKTLEVA YASMKGKGAS HVPHVSDADP VRILTNEELE AVWSDKKPAK AALDTAVSRG NAIMAAKPAL KKAQPF
|
| |