Gene Caul_1376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1376 
Symbol 
ID5898831 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1457840 
End bp1459387 
Gene Length1548 bp 
Protein Length515 aa 
Translation table11 
GC content72% 
IMG OID641561863 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_001683004 
Protein GI167645341 
COG category[E] Amino acid transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1174] ABC-type proline/glycine betaine transport systems, permease component
[COG1732] Periplasmic glycine betaine/choline-binding (lipo)protein of an ABC-type transport system (osmoprotectant binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATAGCG GCCTTCTCTC CCTCCTGCCC GAGCGCCTGG CCTGGCACGT GCTGTTGTCG 
GCGGCGGCCC TGGCCCTGGG GCTGCTGATC GCCTTGCCGC TGGGCGTACT GGCCGCGCGC
AGCCCGCGCC TGCGCTGGCC GTCCCTGGCC TTGGCCGGCC TGGTGCAGAC CATTCCCAGC
CTGGCCCTGC TGGCCCTGTT CTATCCGCTG CTGCTGCTGC TCTCGAACCT GGCCAAGACG
ACGTTCGGCC ATGGCTTCTC GGCCCTGGGA TTCCTGCCGT CGCTGCTGGC GCTGACGCTC
TATTCGATGC TGCCGATCCT GAGGAACACC GTGGCCGGCC TGACCGGGGT CGATCCGGCG
GTGGTCGAGG CGGCGCGCGG CGTGGGCATG ACCGACCGCC AGCGGTTGTG GCGGGTGGAA
CTGCCGCTGT CGGTCCCGGT GATCATGGCC GGGGTGCGCA CGGCGGCGGT GTGGACCATC
GGCGCGGCGA CGCTGTCGAC CCCCGTGGGC CAGACCTCGC TGGGCGACTA CATCTTCTCA
GGCCTGCAGA CCGAGAACTG GGCGATGGTG CTGACCGGCT GCGTCGCCTC GGCGGGCCTG
GCCCTGGTGG TCGACCAACT GCTGGGCCTG GTCGAGCGCG GGGCCGAGCG GCGCGACCGG
CGGATCTGGG GCGCCGGCCT TTTGGGCCTG GCCGTCGGCC TGGCGGTCGC CGTCGCGCCC
CTGGCGGCCA ACCTGGCGCC GGGGCCATCC AGTTACGTTA TCGGGGCCAA GAACTTTTCC
GAGCAATATA TCCTGGCCGA GCTGATGGCC GACCGGCTGG AAGGGCAGGG CGCGCGGGTC
ACCCGCAAGA TCAACCTCGG CTCGGCCGTC GCCTACCGCG CCCTCGCGGC CGGCGAGATC
GACGCCTATG TCGACTATTC CGGCACCCTG TGGGCCAACG TCCTGGGCCG CAAGGACAAC
CCCGGCCGCG CCGCCGTGCT CGACGGCCTG CGCGCCGAGC TCAGGCGCCG CGACGGCGTG
GTGCTGCTGG CGCCCCTGGG CTTCGAGAAC GCCTACGCCC TGGCCATGCG CCGCGACCGC
GCCGAGGCGC TGGGAATCCG CACGCTCGCC GACCTGGCCG CCAAGGCCCC GAACCTGACC
CTGGGCGGCG ACCTGGAGTT CTTCTCGCGC CCCGAATGGG CCAGCGTCGA GGCGACCTAC
GGCCTGCGCT TCAAGACCAA GCGTCAGTTC CAGCCGACGT TCATGTACCG CGCCCTCGGC
TCGGGCGAGG CCGACGTGAT CTCGGCCTTC TCCAGCGACG GCCGCATCGC CGCTGACGAC
CTGGTGGTGC TGGGCGATCC CAAGGGCGCG TTGCCGCCCT ACGACGCGGT GCTGCTGATC
GCGCCAGGGC GGGCCGAGGA CCGGCGACTG CGAGCGGCGC TGGCTGGACT GGACGGCGCG
ATCGGTGTCG AGGCCATGCG GGCGGCGAAC TATTCGGTCG ACCGCGACCA GGACAAGCGC
TCGCCGGCCG AGGCGGCGCG GGCGTTGGAG AAGGGGCTGA AGCGCTAA
 
Protein sequence
MNSGLLSLLP ERLAWHVLLS AAALALGLLI ALPLGVLAAR SPRLRWPSLA LAGLVQTIPS 
LALLALFYPL LLLLSNLAKT TFGHGFSALG FLPSLLALTL YSMLPILRNT VAGLTGVDPA
VVEAARGVGM TDRQRLWRVE LPLSVPVIMA GVRTAAVWTI GAATLSTPVG QTSLGDYIFS
GLQTENWAMV LTGCVASAGL ALVVDQLLGL VERGAERRDR RIWGAGLLGL AVGLAVAVAP
LAANLAPGPS SYVIGAKNFS EQYILAELMA DRLEGQGARV TRKINLGSAV AYRALAAGEI
DAYVDYSGTL WANVLGRKDN PGRAAVLDGL RAELRRRDGV VLLAPLGFEN AYALAMRRDR
AEALGIRTLA DLAAKAPNLT LGGDLEFFSR PEWASVEATY GLRFKTKRQF QPTFMYRALG
SGEADVISAF SSDGRIAADD LVVLGDPKGA LPPYDAVLLI APGRAEDRRL RAALAGLDGA
IGVEAMRAAN YSVDRDQDKR SPAEAARALE KGLKR