Gene Caul_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2136 
Symbol 
ID5899591 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2306726 
End bp2308069 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content68% 
IMG OID641562625 
Productsugar transporter 
Protein accessionYP_001683762 
Protein GI167646099 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00879] MFS transporter, sugar porter (SP) family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.104462 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGCA GTTCGCCCGC CCAACGGGGC ATGAGCATGG CGCTTGTCGG CGCGATCGGC 
GCCGCGGCCT TGGCCGGACT GCTGTTTGGC TTCGACACCG CCGTCATCGC CGGCGTGACC
CACGATGTCA GCCGCGTCTA CGATTTGACG CCCGCCACCC TGGGCATGAC GGTCTCCAGC
GCGCTGTGGG GAACGCTGCT GGGCGCCATG GCTTCCGGCA AGCCGGGCGA CCGCTATGGC
GGCCGCGACG GGCTCAGGGC CATGGCGATC CTCTATCTGG TGTCCAGCCT CGGCTGCGCC
CTGGCCTGGT CCTGGCCGGT CCTGCTGACG GCGCGCTTCA TCGGCGGCGT GGCGATCGGC
GGATCCTCGG TGCTGGCGCC CATCTACATG GCGGAAATCT CGCCGGCCCG CCGTCGCGGC
GCCCTCGTCG GCCTGTTCCA GCTCAATATC GTCGTCGGCA TCCTGCTGGC CTATCTGAGC
AACTTCCTTG TGGACCAGCT GCATCTGGGA CCGGACGCCT GGCGCTGGAA ACTGGCGGTC
ACCGCCGCGC CCGCCGCCCT GCTGTGGCTG CTGTTGGCCC GCGCCGTGCA AAGTCCCCGC
TGGCTGCTGG CCCAGGGACG AGAGGCCGAG GCGATGGCCG CCCTGACCCG GCTGGGCGGC
GACCCGCGAA CCGAGCTTTC CGAGCCGCCG AACCCATCGG GCGCGCGACT GAGCTGGCGC
GCGCACCGCA TACCGATCCT GCTGGCCGTG ACCCTGGCGC TGTTCAACCA GCTGACCGGC
ATCAACGCCC TGCTCTATTA TCTCAACGAC ATCTTTGCCG CGGCCGGCTT TGGCCAGGCC
ACCGCGGGCC TGCAGGCCGT GGCGATCGGC GCGACCAACC TGGTCTTCAC TCTCCTGGCA
ATGAGCGTGA TCGACCGTTT CGGACGCAAG CGCCTGCTGC TGGTCGGCTC GGTGGGCATG
GCGATTTGTC TGGGCCTGGC GGCCTGGATC CTGGACGGCG GCCGCCATTC AAGCTGGCTG
CTCTATGTCC TGGTCGGCTT CATCGCAGCC TTCGCCTTCA GCCAGGGCGC GGTTATCTGG
GTCTATATCA GCGAGATCTT CCCTACCCCC GTTCGTGCGC GGGGCCAGGC CCTGGGCAGT
TCGACCCATT GGCTGGCCAA CGCCCTGATC GCGGGCGTCT TTCCGGCGAT CGCCGCCTGG
AAACCCGGCG CCCCATTCGC TGTCTTCGCG GCGATGATGG TCCTGCAGTT CATCGTCGTG
GCCGTGTTCT ATCCCGAGAC CATGGGCGTG CCGTTGGAAA CCATCGCCGA GAGACTCGGA
ACGCGCGAGG ACCGAGTGGC CTGA
 
Protein sequence
MTRSSPAQRG MSMALVGAIG AAALAGLLFG FDTAVIAGVT HDVSRVYDLT PATLGMTVSS 
ALWGTLLGAM ASGKPGDRYG GRDGLRAMAI LYLVSSLGCA LAWSWPVLLT ARFIGGVAIG
GSSVLAPIYM AEISPARRRG ALVGLFQLNI VVGILLAYLS NFLVDQLHLG PDAWRWKLAV
TAAPAALLWL LLARAVQSPR WLLAQGREAE AMAALTRLGG DPRTELSEPP NPSGARLSWR
AHRIPILLAV TLALFNQLTG INALLYYLND IFAAAGFGQA TAGLQAVAIG ATNLVFTLLA
MSVIDRFGRK RLLLVGSVGM AICLGLAAWI LDGGRHSSWL LYVLVGFIAA FAFSQGAVIW
VYISEIFPTP VRARGQALGS STHWLANALI AGVFPAIAAW KPGAPFAVFA AMMVLQFIVV
AVFYPETMGV PLETIAERLG TREDRVA