Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1438 |
Symbol | |
ID | 5712615 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1493812 |
End bp | 1494777 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641267351 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001532781 |
Protein GI | 159043987 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2113] ABC-type proline/glycine betaine transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.0879571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.595095 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAA CGTTGAAATC CTTCGGACTG GCCACCGCGG TCAGCGTGCT GGCCCTGCCG GCGCTGGCCT CGGAGAAGGT CAGCATCGGC GTGCCGTCCT GGACCGGCGC GCAGGCCATC GCCCATGTGC TGGGCGAGGT CGTCACCTCG CGCATCGGCG GCGAGGTCGA GTACGTGCCC GGCAACAACG CCACGATCTT TCAGGCGATG GACCAGGGCC GCGGCGATAT CGACGTCCAC CCGGATGTCT GGCTGCCCAA CCAGGAGAGC TTCACCAACA AGTATGTGGA CGAGGCCGGC ACCGTCACCC TGTCGTCGAA CCCCTACCAG GGCAACCAGG GCTTCTGCGT GACCCAGGAT TTCGCCGCAG CCCATGACAT TACCGACATC GCCGATCTGG GTCGGCCGGA CGTGGCCGCC CTGATGGACA GCGACGGCAA CGGGCGCGGC GAGATGTGGA TCGGCGCGCC CGGTTGGGCC TCGGCCAATG TCAACGAGGT CAAGGTCCGC GACTACGGCC TGCTGGACTT CATCGAGCCG ATCCGCGCCG AGGAGGCCGT GAAAACCGCG CGCATCAAGG ACTCCATTGC CAAGGGCGAG GGCTATGCGT TCTATTGCTA CGAGCCCCAC GCGGTGTGGT TCATGTTCGA TGTCACGATG TTGACCGAGC CGACCTTTGA CCCGGCGAAA TACGTCATGG TGCAGCCCTC CGATGACGCG GACTGGTACG AGAAGTCCAT GGTCGCGACC AAGGACGCCC TGAAGGACGT GCAGATCGCG TGGTCGAACT CCCTCGTGGA TCGCTCGCCC GCAATTGCGG AGTTCTTCGC CAATTTCCAG CTGAATGCCG AGGATGTCAG CCAGCTTGCC TACCAGATCA GTGCCCAGGG CCGTGATCCG GCCGAGGTCG CGGCGGAATG GGTGAACGCC AACTCCGACC GCGTCGATGG CTGGCTCGGC CTCTGA
|
Protein sequence | MKTTLKSFGL ATAVSVLALP ALASEKVSIG VPSWTGAQAI AHVLGEVVTS RIGGEVEYVP GNNATIFQAM DQGRGDIDVH PDVWLPNQES FTNKYVDEAG TVTLSSNPYQ GNQGFCVTQD FAAAHDITDI ADLGRPDVAA LMDSDGNGRG EMWIGAPGWA SANVNEVKVR DYGLLDFIEP IRAEEAVKTA RIKDSIAKGE GYAFYCYEPH AVWFMFDVTM LTEPTFDPAK YVMVQPSDDA DWYEKSMVAT KDALKDVQIA WSNSLVDRSP AIAEFFANFQ LNAEDVSQLA YQISAQGRDP AEVAAEWVNA NSDRVDGWLG L
|
| |