Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Dshi_1437 |
Symbol | |
ID | 5712614 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Dinoroseobacter shibae DFL 12 |
Kingdom | Bacteria |
Replicon accession | NC_009952 |
Strand | - |
Start bp | 1492683 |
End bp | 1493738 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 641267350 |
Product | glycine betaine/L-proline ABC transporter |
Protein accession | YP_001532780 |
Protein GI | 159043986 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG4175] ABC-type proline/glycine betaine transport system, ATPase component |
TIGRFAM ID | [TIGR01186] glycine betaine/L-proline transport ATP binding subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.102284 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.596955 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGGCG ATACCGTCAT CGAGATATCG AATGTCTGGA AGATTTTCGG CGCCAACGCC CAGGCAGCCC TTGAGGCGGT CCGCGACCGG GGGCTGAGCA AGGCCGAGAT CCTGGCCGAA TTCAACGCGG TCGTGGGCGT GGCCGATGTC AGCCTGTCGG TGCGGCGCGG CGAGATCTTT TGCATCATGG GGCTGTCGGG CAGCGGCAAG TCCACGCTGG TGCGCCATTT CAACCGCTTG CTGGAGCCGA CCGCGGGCAG GATCGAGATC GAGGGGACCG ATGTCATGGC GCTCGGCACC CAGGAGCTTC AACGCTTCCG CAACCGACAG ATCGGCATGG TGTTCCAGAA CTTCGCGCTG ATGCCGCACC GTTCGGTGCT GGACAACGTG GCGATGCCAC TGGAGATCCG GAAGGTCCCC AAGAACGAGC GCATGCGCCA GGCCGCCGCG ATCCTCGACA TCGTCGAGCT GGGCGCCTGG GGGGCGAAGT TCGCCCATGA ACTGCCGGGC GGGATGCAGC AGCGGGTGGG GCTGGCCCGG GCGCTGGCGG CGAATCCGGA CGTGTTGCTG ATGGACGAGC CCTTCTCGGC ACTCGATCCG CTGATCCGAA GGCAGTTGCA GGACGAATTC ATCCGATTGT CGAAGATCCT CAAGAAAACC ACGATATTCA TCACCCATGA CCTCGACGAG GCGGTGCGCA TCGGCGACCG GATCGCCATC ATGCGCGACG GCAAGGTGGT GCAGATGGGC ACCGCCGAGG ACATCGTGAT GCACCCGGCC GATGACTACG TGGCCGATTT CGTGGCCGGG ATCTCGCGGC TCAAGGTGGT TCATGCCCAC GCGGTGATGC AGCCGCTGGA GGCCTATCTC GCCACTCACG GCCCGCTTCC GGCCGCCGTC CCCAAGGTCG ACGAGGGCGA AACCCTGAGC AACCTGATCA CGCTCGCCAT CGATGACGAG AATCCGATCC TCGTGCAGGA CGCGGGTCGG GACGTCGGTA TCATCACCCG TGCGGACCTG TTGCGCACGG TCATCGAGGG AACGGAAGTC TCATGA
|
Protein sequence | MHGDTVIEIS NVWKIFGANA QAALEAVRDR GLSKAEILAE FNAVVGVADV SLSVRRGEIF CIMGLSGSGK STLVRHFNRL LEPTAGRIEI EGTDVMALGT QELQRFRNRQ IGMVFQNFAL MPHRSVLDNV AMPLEIRKVP KNERMRQAAA ILDIVELGAW GAKFAHELPG GMQQRVGLAR ALAANPDVLL MDEPFSALDP LIRRQLQDEF IRLSKILKKT TIFITHDLDE AVRIGDRIAI MRDGKVVQMG TAEDIVMHPA DDYVADFVAG ISRLKVVHAH AVMQPLEAYL ATHGPLPAAV PKVDEGETLS NLITLAIDDE NPILVQDAGR DVGIITRADL LRTVIEGTEV S
|
| |