Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0585 |
Symbol | |
ID | 5898040 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 640273 |
End bp | 641901 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641561067 |
Product | major facilitator transporter |
Protein accession | YP_001682216 |
Protein GI | 167644553 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.895306 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGGCGG GCGCGACACC CAATCGGCTT GGCGGTCGAC CAGCCGATTC CGCAGCTCCC ATGGGGGGGT CCTATGCTTG GTACGCCACA GGCGTGCTGG CGCTTGTCTA CGTCCTGAAT TTCGTCGACC GCCAGATCAT TTCAATTCTC GCCGAAGACA TCAAGCGTGA CCTGCATGTG ACGGACGCGC AGCTGGGCTT CCTGTACGGC ACGGCCTTTG CGATCTTCTA CGCTCTTTTT GGCATCCCCT TTGGCATGCT CGCCGATCGT TGGCGCCGCG GCCGGCTGAT CGCCATTGGA CTGGTGGTCT GGTCTGCGAT GACCGCCGCG TCCGGCTTCG CGTTCAACTT CCTGCAACTG GCCCTCGCGC GGGTCGGCGT CGGCGTTGGA GAAGCAACCG CCTCCCCGGC CGCCTTCTCG ATGCTGGGCG ACTATTTTCC GCGTGAACGC CGCGCGCTGG CGGCCTCGCT CTACTCCACC GGTCTCTACC TTGGCATGGG CCTCAGCCTG CCGATCGGCG GCTGGATCGC CCAGTCTTGG AACGATACCT ACGCCGCCGG CGCGGCGCCC TTCGGCCTGG CGGGTTGGCA GGTCGCCTTC CTCGCCGTCG GCTTACCCGG CCTGGCCATG GCGCTATGGG TGCTGACCCT GCGCGAACCG GTGCGCGGCT GCAACGACGG CGCGCCGCGT CCGCTGGTCA CGCCGGGCGC CGGCAAGCTG TTCTTGGCCG ACCTCGCGGC GATCCTGCCC CCGCTCACCC TGTGGTCGGT GTCGCGCCAG CCCCGCATGC TGGCGGTGAA CCTGGCCGTC GCGGTCCTAG TGGCGGGCGT CGCGACGCTC CTGTGCCGTT TTGTCGGCGA TCCGCCACAG TGGATCGCCT ATGGGGTCGG CGTCTATGCC GTGTTCTCCT GGGTCCAGGT GATCAAGGTC ACCGACCGGC CGATCTACGC CCTGATCTGG GGCGACCCCA GGATGCTGGT CGCGATCGTC GCCTTCGGCA GCCTGTCTGT CTTTGTCTAC AGCTACGGGT TTTGGGTGGC GCCCTACGCC ATCCGCACCT TTGGCGTCAC CAAGGCCATG GCCGGGATCG AGCTTGGCAT ACCCGGAGCC TTCGCCTCGG CGATTGGGGT GCTCATCGGC GGTCGGCTGT CGGATCTATG GAGGGCGCGC GATCCGCGCG GACGGATCTT CGTCTGCATG CTGGCGATCG CTTTGCCCTT GCCCGCCCTG TTGTGGATGT TCACCACGGC GCAGTACGAG ACCTACCGTC TCATCAGTCC GGTGATCTAT CTGGTGAGCA GTTCGTGGGT CGGTTCCGCA GTGGCAAGCT ATCAGGATCT GGTCTTGCCA CGCATGCGGG GGCTCGCCGG TTCGACCTAT CTGCTGGGCG CCACGATGGT GGGCCTGGCC CTTGGCCCCT ACGTCACCGG CAAGGTGGCG ACAGTCACAG GCTCGCTGCA AGCCGGTGTG CTAACGCTGT TTCTGGTGGC CCCCCTGTCG CTCCTGCTTC TCGGGCTCAC CGCGCGTTGG GCTCCGGGTC TGGAGGCCAG CAAGTTCGAC CGCGCGCGCG CCGCCGGAGA GCCGGATGAG CCGGGGCGGG CGACGCCTTT GCCCGCTACG CCCCTCTAG
|
Protein sequence | MKAGATPNRL GGRPADSAAP MGGSYAWYAT GVLALVYVLN FVDRQIISIL AEDIKRDLHV TDAQLGFLYG TAFAIFYALF GIPFGMLADR WRRGRLIAIG LVVWSAMTAA SGFAFNFLQL ALARVGVGVG EATASPAAFS MLGDYFPRER RALAASLYST GLYLGMGLSL PIGGWIAQSW NDTYAAGAAP FGLAGWQVAF LAVGLPGLAM ALWVLTLREP VRGCNDGAPR PLVTPGAGKL FLADLAAILP PLTLWSVSRQ PRMLAVNLAV AVLVAGVATL LCRFVGDPPQ WIAYGVGVYA VFSWVQVIKV TDRPIYALIW GDPRMLVAIV AFGSLSVFVY SYGFWVAPYA IRTFGVTKAM AGIELGIPGA FASAIGVLIG GRLSDLWRAR DPRGRIFVCM LAIALPLPAL LWMFTTAQYE TYRLISPVIY LVSSSWVGSA VASYQDLVLP RMRGLAGSTY LLGATMVGLA LGPYVTGKVA TVTGSLQAGV LTLFLVAPLS LLLLGLTARW APGLEASKFD RARAAGEPDE PGRATPLPAT PL
|
| |