Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_3438 |
Symbol | |
ID | 5900893 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 3718235 |
End bp | 3719755 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641563944 |
Product | major facilitator transporter |
Protein accession | YP_001685063 |
Protein GI | 167647400 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.356572 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCAGG CGATTACGCC GGGCGGCGAC AAGCCGCTGT ATTCCAACGG CTACAAGGCC ACGGTGCTGG GGCTGCTGCT CGCCACCTAC ACCTTTAATT TCATCGACCG CACCATCATC GCCACCATCG GCCAGGCCAT CAAGGTCGAC CTCAAGCTCA CCGACACCCA GCTGGGCCTG CTGGGCGGGC TCTATTTCGC CCTGCTCTAC ACCCTGCTGG GCATTCCGAT CGCCCGGATG GCCGAGCGCT GGAACCGGGT GACGATCATC TCGATCTCGC TGGTCATCTG GTCGGGCTTC ACGGCCCTGT GCGGCAGCGC CTCCAGCTTC GCCCAGCTGG CCCTCTACCG CTTCGGCGTC GGGGTCGGCG AGGCCGGCTG CTCGCCGCCC AGCCACTCGC TGATCAGCGA CTATTACGAG CCCAAGAAGC GCGCCTCGGC GCTGTCGATC TATTCGTTCG GCATCCCCCT GGGTACGATG TTCGGGGCGG TGGCCGGCGG CTGGCTGGCC CAGGAGTTCA GTTGGCGCGT GGCCTTCGTG ATCGTCGGCC TTCCGGGCGT CATCCTGGCC CTGCTGGTCA AACTCCTGGT CAAGGAACCG CCGCGCGGCC ATTCGGAGAT GAAGGAACGG CCGCTGGAAG CCGAAGACCT CGTCATCGAA CCGATCGCTA CGCCGAAGCT CGGCTTTATC GCCTTCATTC ACCGTGAACT CGACGAGCTG GGCGCGGTGA TGAAGGTGCT GTTTGGCAAG TGGCCCGTCC TGCACATGAT GCTGGGCGTG ACCATCGCCT CGTTCGGCTC CTATGGCTCG GGCGCGTTCG TGCCGCCCTA TTTCGTGCGG ACCTATGGCC TGGGCCTGGC CCAGGTGGGC CTGATCGTCG GGCTGATCGG CGGCTTCTCG GCGGGCGTCG GCACCCTGGT CGGCGGCTTC CTGACCGACT GGTCGGGCAA GCGCAGCGCC AAGTGGTACG CCCTGGTGCC GGCCCTGGGC CTGCTGATCG CCACCCCGAT CTACATCGCC GCCTATCTGC AGACCAGCTG GCAGACCACC GCCCTGATCC TGCTGGTCCC GGGGATCTTC CACTACACCT ACCTGGCCCC CACCTTCGGC GTGGTCCAGA ACTCGGTCGA GCCGCGGCGC CGGGCCACCG CCACGGCCCT GCTGTTCTTC TTCCTCAACC TGATCGCCCT GGGCGGCGGG CCGGTGTTCA CCGGCTGGCT GATCGACCAC CTGGCGCGCT TCAACTTCAA CCACCCCGCC TCCACCAGCC TCTTCCAGGC CCTGGTCGGA TCGTTCGCCG ACCCCGGCGC GGCCAGCTTC ACGGCCCAGT GCCCCGGCGG CCTGGCCCCC AAGGGCTCGC CGGTCGACCT GGCCAAGGCC TGCCACGGCG CCATGGCCCG CTCGACCCAG CAGGGCATCA TCGTCTCGCT GTCCTTCTAC GCCTGGGCCG CCCTGCACTA CGCCCTGGCG GCCATCGGCA TGACCAGGCA CATGCGGGAA CGGGCGGTGG CGCAGGCCTA G
|
Protein sequence | MAQAITPGGD KPLYSNGYKA TVLGLLLATY TFNFIDRTII ATIGQAIKVD LKLTDTQLGL LGGLYFALLY TLLGIPIARM AERWNRVTII SISLVIWSGF TALCGSASSF AQLALYRFGV GVGEAGCSPP SHSLISDYYE PKKRASALSI YSFGIPLGTM FGAVAGGWLA QEFSWRVAFV IVGLPGVILA LLVKLLVKEP PRGHSEMKER PLEAEDLVIE PIATPKLGFI AFIHRELDEL GAVMKVLFGK WPVLHMMLGV TIASFGSYGS GAFVPPYFVR TYGLGLAQVG LIVGLIGGFS AGVGTLVGGF LTDWSGKRSA KWYALVPALG LLIATPIYIA AYLQTSWQTT ALILLVPGIF HYTYLAPTFG VVQNSVEPRR RATATALLFF FLNLIALGGG PVFTGWLIDH LARFNFNHPA STSLFQALVG SFADPGAASF TAQCPGGLAP KGSPVDLAKA CHGAMARSTQ QGIIVSLSFY AWAALHYALA AIGMTRHMRE RAVAQA
|
| |