Gene Caul_1167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1167 
Symbol 
ID5898622 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1231428 
End bp1232879 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content68% 
IMG OID641561650 
Productmajor facilitator transporter 
Protein accessionYP_001682795 
Protein GI167645132 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.181603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0337638 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACA CCGCCGCGAC GGCGACGCCG CAGACCTCTG TCCCTCTGAC CGCCTGGCGC 
CGAACCCGGG CCATCCTCGG CGGGTCGGCC GGCAATCTGG TCGAGTGGTA CGACTGGTTC
GCCTACGCCG CCTTTTCGAT CTATTTCGCC AAGGTCTTCT TCCCCAAGGG CGACCAGACC
GCCCAACTGA TGCAGACGGC AGCCATCTTC GCCGTCGGCT TCGGGGCGCG CCCGGTCGGG
GCCTGGCTGA TGGGCCTCTA CGCCGATCGC AAGGGCCGCA AGGCCGCGCT GACCCTGGCG
GTAGGCCTGA TGTGCGCCGG CTCGCTGATT ATCGCCGTCA CCCCCGGCCA AGCCGCGATC
GGCGACCTAG CCCCGGTGAT CTTGCTGCTG GCCCGCCTGC TGCAGGGCTT GTCGGTAGGC
GGCGAGTACG GGGCCAGCGC CACCTATATG AGCGAGATGG CGGGAAAGAA GCGCCGCGGC
TTCTGGTCCA GCTTTCAGTA CGTGACCCTG ATCATGGGTC AGCTTGTCGC CGCGCTTGTG
CTGGTGATCC TGCAAAACAC CCTGGACAAG GCGCAACTGG CCAACTGGGG TTGGCGCATC
CCGTTCTTCG TCGGCGCGGC CCTGGCCGTG GTGGTGTTCT GGATCCGCAC CGGCATCGAG
GAAAGCGTCT CGCACCAGAA CGTCACCCAG CGCGATCCGA TCAGCAGGCG GCAGGTCGTC
TGGGTCGCCG TCCTGCTGCT GACCACCATC GCGGCGATGG TCGTGGGCTT CACCGAGGCG
CCCTACGCCG CGACCGCCCA GTACGGCGCC GTGCTGGCCT TGCTCCTGAC CTATGTCGCC
CTGGCCGCGC CCCTGGTCTC GCGACACCCC AAGCAGGCCC TGGCGATCAT CGGCCTGACC
GCCGCGGGCT CCTTGGCCTT CTACGCCTAC ACCACCTACA TGCTGAAGTT CCTGACCAAC
ACGGCGGGCT TCGACAAGGC CACGGCCGGG GCGATCAACC TGGCCACCCT GGCCGGCTTC
ATGCTGATCC AGCCGCTGTT CGGCTGGCTG TCGGACAAGG TCGGGCGCAA GCGGATGCTG
GTCTTCGCCT TCGGGGCGGG CGCCCTGATC GCCTGGCCGG TGTTCACCCT GACCGCCAAG
GCGACCAGTC CCTACGTCGC CTTCGGCCTG ATCTTCGCCG CCCTGGTCGT GCAGTCGGGC
TACACCTCGA TCAGCGCCGT GGTGAAGGCC GAGTTGTTCC CCACCCACGT GCGGGCCCTT
GGCGTCGCCC TGCCCTACGC CCTGGGCAAC GCCGCGTTCG GCGGCACCGC CGAATATGTC
GCCCTGTGGT TCAAGCACGA GGGCATGGAG AGCGGCTTCT ACCTCTACGT CGCGGCGATC
ATGGCCGTGG GCCTGACCGT GTCGCTGCTG CTGCGCGACA CCGGCAAGCA CAGCCTGATC
CTCGAGGATT GA
 
Protein sequence
MTDTAATATP QTSVPLTAWR RTRAILGGSA GNLVEWYDWF AYAAFSIYFA KVFFPKGDQT 
AQLMQTAAIF AVGFGARPVG AWLMGLYADR KGRKAALTLA VGLMCAGSLI IAVTPGQAAI
GDLAPVILLL ARLLQGLSVG GEYGASATYM SEMAGKKRRG FWSSFQYVTL IMGQLVAALV
LVILQNTLDK AQLANWGWRI PFFVGAALAV VVFWIRTGIE ESVSHQNVTQ RDPISRRQVV
WVAVLLLTTI AAMVVGFTEA PYAATAQYGA VLALLLTYVA LAAPLVSRHP KQALAIIGLT
AAGSLAFYAY TTYMLKFLTN TAGFDKATAG AINLATLAGF MLIQPLFGWL SDKVGRKRML
VFAFGAGALI AWPVFTLTAK ATSPYVAFGL IFAALVVQSG YTSISAVVKA ELFPTHVRAL
GVALPYALGN AAFGGTAEYV ALWFKHEGME SGFYLYVAAI MAVGLTVSLL LRDTGKHSLI
LED