Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1167 |
Symbol | |
ID | 5898622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 1231428 |
End bp | 1232879 |
Gene Length | 1452 bp |
Protein Length | 483 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641561650 |
Product | major facilitator transporter |
Protein accession | YP_001682795 |
Protein GI | 167645132 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.181603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0337638 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGACA CCGCCGCGAC GGCGACGCCG CAGACCTCTG TCCCTCTGAC CGCCTGGCGC CGAACCCGGG CCATCCTCGG CGGGTCGGCC GGCAATCTGG TCGAGTGGTA CGACTGGTTC GCCTACGCCG CCTTTTCGAT CTATTTCGCC AAGGTCTTCT TCCCCAAGGG CGACCAGACC GCCCAACTGA TGCAGACGGC AGCCATCTTC GCCGTCGGCT TCGGGGCGCG CCCGGTCGGG GCCTGGCTGA TGGGCCTCTA CGCCGATCGC AAGGGCCGCA AGGCCGCGCT GACCCTGGCG GTAGGCCTGA TGTGCGCCGG CTCGCTGATT ATCGCCGTCA CCCCCGGCCA AGCCGCGATC GGCGACCTAG CCCCGGTGAT CTTGCTGCTG GCCCGCCTGC TGCAGGGCTT GTCGGTAGGC GGCGAGTACG GGGCCAGCGC CACCTATATG AGCGAGATGG CGGGAAAGAA GCGCCGCGGC TTCTGGTCCA GCTTTCAGTA CGTGACCCTG ATCATGGGTC AGCTTGTCGC CGCGCTTGTG CTGGTGATCC TGCAAAACAC CCTGGACAAG GCGCAACTGG CCAACTGGGG TTGGCGCATC CCGTTCTTCG TCGGCGCGGC CCTGGCCGTG GTGGTGTTCT GGATCCGCAC CGGCATCGAG GAAAGCGTCT CGCACCAGAA CGTCACCCAG CGCGATCCGA TCAGCAGGCG GCAGGTCGTC TGGGTCGCCG TCCTGCTGCT GACCACCATC GCGGCGATGG TCGTGGGCTT CACCGAGGCG CCCTACGCCG CGACCGCCCA GTACGGCGCC GTGCTGGCCT TGCTCCTGAC CTATGTCGCC CTGGCCGCGC CCCTGGTCTC GCGACACCCC AAGCAGGCCC TGGCGATCAT CGGCCTGACC GCCGCGGGCT CCTTGGCCTT CTACGCCTAC ACCACCTACA TGCTGAAGTT CCTGACCAAC ACGGCGGGCT TCGACAAGGC CACGGCCGGG GCGATCAACC TGGCCACCCT GGCCGGCTTC ATGCTGATCC AGCCGCTGTT CGGCTGGCTG TCGGACAAGG TCGGGCGCAA GCGGATGCTG GTCTTCGCCT TCGGGGCGGG CGCCCTGATC GCCTGGCCGG TGTTCACCCT GACCGCCAAG GCGACCAGTC CCTACGTCGC CTTCGGCCTG ATCTTCGCCG CCCTGGTCGT GCAGTCGGGC TACACCTCGA TCAGCGCCGT GGTGAAGGCC GAGTTGTTCC CCACCCACGT GCGGGCCCTT GGCGTCGCCC TGCCCTACGC CCTGGGCAAC GCCGCGTTCG GCGGCACCGC CGAATATGTC GCCCTGTGGT TCAAGCACGA GGGCATGGAG AGCGGCTTCT ACCTCTACGT CGCGGCGATC ATGGCCGTGG GCCTGACCGT GTCGCTGCTG CTGCGCGACA CCGGCAAGCA CAGCCTGATC CTCGAGGATT GA
|
Protein sequence | MTDTAATATP QTSVPLTAWR RTRAILGGSA GNLVEWYDWF AYAAFSIYFA KVFFPKGDQT AQLMQTAAIF AVGFGARPVG AWLMGLYADR KGRKAALTLA VGLMCAGSLI IAVTPGQAAI GDLAPVILLL ARLLQGLSVG GEYGASATYM SEMAGKKRRG FWSSFQYVTL IMGQLVAALV LVILQNTLDK AQLANWGWRI PFFVGAALAV VVFWIRTGIE ESVSHQNVTQ RDPISRRQVV WVAVLLLTTI AAMVVGFTEA PYAATAQYGA VLALLLTYVA LAAPLVSRHP KQALAIIGLT AAGSLAFYAY TTYMLKFLTN TAGFDKATAG AINLATLAGF MLIQPLFGWL SDKVGRKRML VFAFGAGALI AWPVFTLTAK ATSPYVAFGL IFAALVVQSG YTSISAVVKA ELFPTHVRAL GVALPYALGN AAFGGTAEYV ALWFKHEGME SGFYLYVAAI MAVGLTVSLL LRDTGKHSLI LED
|
| |