Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_0847 |
Symbol | |
ID | 5898302 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | - |
Start bp | 900157 |
End bp | 901533 |
Gene Length | 1377 bp |
Protein Length | 458 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641561329 |
Product | major facilitator transporter |
Protein accession | YP_001682476 |
Protein GI | 167644813 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.362439 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.458775 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGCCCC CCAACAGCTG GCGCGCCCTG CTGAACGCCG AACTGGCCCC GCGCTTCGCC CTGCTGTGCC TGGGCATCTG GCTGAACGCG GCCGACACCC TGGTGACGGT GACGATCATG CCCAGCGTCG CCAAGGAGAT CGGCGGCTGG CAGTATTTCG GCTGGTCGAT CGCCGCCTTC CTGCTGGGCT CGATCCTGGC GGGGGCCTGC GCGGGCAAGC TGTCGATCCG TTTCGGCCTG AGGCGCGCCA CCGCCCTGGC CGGGGTGATC TACGCGATCG GCTGCGCCAT GGGGGCCTGC GCGCCGGAGT TCCTGACCTT CGTGGCCGGC CGCCTGGTCC AGGGGCTGGG CGCGGGGGCG ATCGTCTCGC TGTGCTACGT GGCGATCAGC GCCCTGTTCC CCGAAACCCT GTGGCCGCGC GTCTATGGCG CGATCGCCGG GGTGTGGGGC GCGGCGACCC TGCTGGGTCC GCTGTGCGGC GGCCTGTTCG CCCAGGCCCA CTTCTGGCGC GGGGCCTTCT GGCTGTTCGC GATCCAGGGC GTGATCTTCG TCGGCGCGGT GCTGGTCATG GTGCCGGCCG CGCCAAGAGC GCCGGATGGC GGACGCATCC CCGGGCGGCA ACTGGCCCTG CTGACCCTGG GCGTCAGCCT GATCGCCGCC GCCGGCGTCG TGCCCAGCGG ACTGGCGGCG GCCCTGTGCG CGGCGTTCGG GACCCTGGCC ATGGCCGCCC TGCTGCTGGT CAACGGCCGC GCCGACAACC GCCTGCTGCC CCGCGCCGCC GGCGACCTGG CCACCGCCAC GGGCCTGGGC CTGATGGTGA TCTTCTTCTG CGAGGCGGCG ACGGTCGGCT TCTCGGTCTA TGGCCCGACC TTCATCCAGG TGCTGCACGG CGCGGGACCC CTGCTCGGCG GCTATGTGAT CGGCGGCATC GCGGCGGGCT GGACGGCCTG CTCGTTCGTG GTGGCGGGGC TGAAGCCCAG GCACGAGGGC CTGGCCATCC GCCTGGGCGC CGCGATCATC GTGGCCGGCG TCGCCTGGGG CGCGGTCGAG ATGGTGCGGG GCGGGTTGAT CGGCATAACC CTGTCGATGG TCCTGCTGGG CAGCGGCTTC GGGATCTGCT GGGCCTTCCT CGCCAAGCGG ACGATCAGCG GAGCCGGCGA GGCCGAGCAG GCCCTGGCCT CGGCCGCCGT GCCGACTACC CAGTTGATCG GCGGCGCGGT CGGCGCGGCT GCGGCCGGCG CCCTGGCCAA CGCCCTGGGC TTCGCGCACG GCGTCACGCC GGAGAGCGGC GCGGCCCGCG GTCTGTGGCT GTTCGCGGCC TTCGTCCCCC TGGCGGTGGT GGGCCTGGCG GCGGCGTGGC GGCTGGGGCA GGATTAG
|
Protein sequence | MTPPNSWRAL LNAELAPRFA LLCLGIWLNA ADTLVTVTIM PSVAKEIGGW QYFGWSIAAF LLGSILAGAC AGKLSIRFGL RRATALAGVI YAIGCAMGAC APEFLTFVAG RLVQGLGAGA IVSLCYVAIS ALFPETLWPR VYGAIAGVWG AATLLGPLCG GLFAQAHFWR GAFWLFAIQG VIFVGAVLVM VPAAPRAPDG GRIPGRQLAL LTLGVSLIAA AGVVPSGLAA ALCAAFGTLA MAALLLVNGR ADNRLLPRAA GDLATATGLG LMVIFFCEAA TVGFSVYGPT FIQVLHGAGP LLGGYVIGGI AAGWTACSFV VAGLKPRHEG LAIRLGAAII VAGVAWGAVE MVRGGLIGIT LSMVLLGSGF GICWAFLAKR TISGAGEAEQ ALASAAVPTT QLIGGAVGAA AAGALANALG FAHGVTPESG AARGLWLFAA FVPLAVVGLA AAWRLGQD
|
| |