Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1376 |
Symbol | |
ID | 5898831 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 1457840 |
End bp | 1459387 |
Gene Length | 1548 bp |
Protein Length | 515 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641561863 |
Product | substrate-binding region of ABC-type glycine betaine transport system |
Protein accession | YP_001683004 |
Protein GI | 167645341 |
COG category | [E] Amino acid transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG1174] ABC-type proline/glycine betaine transport systems, permease component [COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 37 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATAGCG GCCTTCTCTC CCTCCTGCCC GAGCGCCTGG CCTGGCACGT GCTGTTGTCG GCGGCGGCCC TGGCCCTGGG GCTGCTGATC GCCTTGCCGC TGGGCGTACT GGCCGCGCGC AGCCCGCGCC TGCGCTGGCC GTCCCTGGCC TTGGCCGGCC TGGTGCAGAC CATTCCCAGC CTGGCCCTGC TGGCCCTGTT CTATCCGCTG CTGCTGCTGC TCTCGAACCT GGCCAAGACG ACGTTCGGCC ATGGCTTCTC GGCCCTGGGA TTCCTGCCGT CGCTGCTGGC GCTGACGCTC TATTCGATGC TGCCGATCCT GAGGAACACC GTGGCCGGCC TGACCGGGGT CGATCCGGCG GTGGTCGAGG CGGCGCGCGG CGTGGGCATG ACCGACCGCC AGCGGTTGTG GCGGGTGGAA CTGCCGCTGT CGGTCCCGGT GATCATGGCC GGGGTGCGCA CGGCGGCGGT GTGGACCATC GGCGCGGCGA CGCTGTCGAC CCCCGTGGGC CAGACCTCGC TGGGCGACTA CATCTTCTCA GGCCTGCAGA CCGAGAACTG GGCGATGGTG CTGACCGGCT GCGTCGCCTC GGCGGGCCTG GCCCTGGTGG TCGACCAACT GCTGGGCCTG GTCGAGCGCG GGGCCGAGCG GCGCGACCGG CGGATCTGGG GCGCCGGCCT TTTGGGCCTG GCCGTCGGCC TGGCGGTCGC CGTCGCGCCC CTGGCGGCCA ACCTGGCGCC GGGGCCATCC AGTTACGTTA TCGGGGCCAA GAACTTTTCC GAGCAATATA TCCTGGCCGA GCTGATGGCC GACCGGCTGG AAGGGCAGGG CGCGCGGGTC ACCCGCAAGA TCAACCTCGG CTCGGCCGTC GCCTACCGCG CCCTCGCGGC CGGCGAGATC GACGCCTATG TCGACTATTC CGGCACCCTG TGGGCCAACG TCCTGGGCCG CAAGGACAAC CCCGGCCGCG CCGCCGTGCT CGACGGCCTG CGCGCCGAGC TCAGGCGCCG CGACGGCGTG GTGCTGCTGG CGCCCCTGGG CTTCGAGAAC GCCTACGCCC TGGCCATGCG CCGCGACCGC GCCGAGGCGC TGGGAATCCG CACGCTCGCC GACCTGGCCG CCAAGGCCCC GAACCTGACC CTGGGCGGCG ACCTGGAGTT CTTCTCGCGC CCCGAATGGG CCAGCGTCGA GGCGACCTAC GGCCTGCGCT TCAAGACCAA GCGTCAGTTC CAGCCGACGT TCATGTACCG CGCCCTCGGC TCGGGCGAGG CCGACGTGAT CTCGGCCTTC TCCAGCGACG GCCGCATCGC CGCTGACGAC CTGGTGGTGC TGGGCGATCC CAAGGGCGCG TTGCCGCCCT ACGACGCGGT GCTGCTGATC GCGCCAGGGC GGGCCGAGGA CCGGCGACTG CGAGCGGCGC TGGCTGGACT GGACGGCGCG ATCGGTGTCG AGGCCATGCG GGCGGCGAAC TATTCGGTCG ACCGCGACCA GGACAAGCGC TCGCCGGCCG AGGCGGCGCG GGCGTTGGAG AAGGGGCTGA AGCGCTAA
|
Protein sequence | MNSGLLSLLP ERLAWHVLLS AAALALGLLI ALPLGVLAAR SPRLRWPSLA LAGLVQTIPS LALLALFYPL LLLLSNLAKT TFGHGFSALG FLPSLLALTL YSMLPILRNT VAGLTGVDPA VVEAARGVGM TDRQRLWRVE LPLSVPVIMA GVRTAAVWTI GAATLSTPVG QTSLGDYIFS GLQTENWAMV LTGCVASAGL ALVVDQLLGL VERGAERRDR RIWGAGLLGL AVGLAVAVAP LAANLAPGPS SYVIGAKNFS EQYILAELMA DRLEGQGARV TRKINLGSAV AYRALAAGEI DAYVDYSGTL WANVLGRKDN PGRAAVLDGL RAELRRRDGV VLLAPLGFEN AYALAMRRDR AEALGIRTLA DLAAKAPNLT LGGDLEFFSR PEWASVEATY GLRFKTKRQF QPTFMYRALG SGEADVISAF SSDGRIAADD LVVLGDPKGA LPPYDAVLLI APGRAEDRRL RAALAGLDGA IGVEAMRAAN YSVDRDQDKR SPAEAARALE KGLKR
|
| |