Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_1905 |
Symbol | |
ID | 5899360 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 2042734 |
End bp | 2044014 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 641562395 |
Product | major facilitator transporter |
Protein accession | YP_001683532 |
Protein GI | 167645869 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAGACCT CCTCGTCCCC CGCCCTCGCC AAGGGCGCCT GGTACACCCT GGTCATCCTG ACGCTCGTCT ATGTCTCGAA CTCGATCGAC AGGACGGCGA TGTCGATCCT CATCGAACCG GTGAAGGCGG AGTTCAAGCT TTCGGACAGC CAGCTGGGCC TGCTGACGGG CCTCGCGTTC GGCCTGACCT ATGCCCTGGC GGGCCTGCCG TTGGGGTGGC TGATCGACCG GGTGAACCGC ACCAGGCTGC TCGCGGCGGT GGTCGCGATC TGGAGCCTTT GCACCGCCGT CTGCGGCCTC GCCCAAAGCT ATCCGGCGCT GGTGATGGCG CGGCTGGCGG TCGGCGCGTC GGAATCGGCG GCGGCGCCCA CGGCGATGTC GATGATCGCG GACCTCTTTC CCAAGAACCG GCGCTCGACG GCCATGGGCG TGTTCTGGAC CAGCACGGCT TTCGGCACGG CCATCAGCCT CGTGCTCGGC GGCGTGATCG CCGCCAACTA CGGGTGGCGC GCGGCCTTCT TCGTCGCCGG CGTTCCTGGA CTGATCCTGG CCGTCCTGAT CATCCTGACC GTCCGTGAAC CCGCCCGCGA GCGCGATCTC GGCCAAGGCG ACGCCGGGCC GGCGCCGTCG CTGTTTCAGA CGCTGCGGTT CGTCTGCGCT AATCCGACGG TCTTCCACGC CTTCGTCGGC ATAGGGCTGG CCTCATTGGC CATGTCGGGC GTTCCGGTAT GGGCCGCGTC CTTTCTGGTC CGCACCCAGG GCTTCACCCT GCCGCAGGCC GGCCTGATGG CGGGCCTCGG CGTCGGGCTC TTCGGCGCGC TGGGATCGCT CATGGGCGGT CCGGTCGGCG ACGCCGTGGT TCGTCGTTGG GGCGTCCAGG CCTTGCCGGC CGCGCCGATG GTCGCCTGCG TTCTGGCCTG CGCTTCGGGT CTTGTCTTCG CCCTGGGGTC GTCCCTCGCG GTCGTGGCCC TTGGCTTTAT CGTCTTCGAG ATCGTCTCGC GCGGCTTTAC CGCTCCGGCC TATGCGATCC TCGTCACCGG CGTGGAGCCG CGCATGCGAG GCGTCGTCGT GTCGGCGGTC CAAGCCGTGA CCAATCTCAT CGGTTACGGC GTTGGCCCCC TGGTCGTGGG CGTAGTCAGC GACCGCGTCG GGGGAACCCA CTCCCTTAAG GCCGGCATCG CCGCGGTGAT GATCTTCAGC CTATGGTCGG GCCTGCATTT CTTCGCCGCT TGGGCCGCGG CGCGCCGCTC GGAGCGTTTC GCCGGCGGAG CAACCGCATG A
|
Protein sequence | MQTSSSPALA KGAWYTLVIL TLVYVSNSID RTAMSILIEP VKAEFKLSDS QLGLLTGLAF GLTYALAGLP LGWLIDRVNR TRLLAAVVAI WSLCTAVCGL AQSYPALVMA RLAVGASESA AAPTAMSMIA DLFPKNRRST AMGVFWTSTA FGTAISLVLG GVIAANYGWR AAFFVAGVPG LILAVLIILT VREPARERDL GQGDAGPAPS LFQTLRFVCA NPTVFHAFVG IGLASLAMSG VPVWAASFLV RTQGFTLPQA GLMAGLGVGL FGALGSLMGG PVGDAVVRRW GVQALPAAPM VACVLACASG LVFALGSSLA VVALGFIVFE IVSRGFTAPA YAILVTGVEP RMRGVVVSAV QAVTNLIGYG VGPLVVGVVS DRVGGTHSLK AGIAAVMIFS LWSGLHFFAA WAAARRSERF AGGATA
|
| |