Gene Caul_0847 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0847 
Symbol 
ID5898302 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp900157 
End bp901533 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content74% 
IMG OID641561329 
Productmajor facilitator transporter 
Protein accessionYP_001682476 
Protein GI167644813 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value0.362439 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.458775 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCCC CCAACAGCTG GCGCGCCCTG CTGAACGCCG AACTGGCCCC GCGCTTCGCC 
CTGCTGTGCC TGGGCATCTG GCTGAACGCG GCCGACACCC TGGTGACGGT GACGATCATG
CCCAGCGTCG CCAAGGAGAT CGGCGGCTGG CAGTATTTCG GCTGGTCGAT CGCCGCCTTC
CTGCTGGGCT CGATCCTGGC GGGGGCCTGC GCGGGCAAGC TGTCGATCCG TTTCGGCCTG
AGGCGCGCCA CCGCCCTGGC CGGGGTGATC TACGCGATCG GCTGCGCCAT GGGGGCCTGC
GCGCCGGAGT TCCTGACCTT CGTGGCCGGC CGCCTGGTCC AGGGGCTGGG CGCGGGGGCG
ATCGTCTCGC TGTGCTACGT GGCGATCAGC GCCCTGTTCC CCGAAACCCT GTGGCCGCGC
GTCTATGGCG CGATCGCCGG GGTGTGGGGC GCGGCGACCC TGCTGGGTCC GCTGTGCGGC
GGCCTGTTCG CCCAGGCCCA CTTCTGGCGC GGGGCCTTCT GGCTGTTCGC GATCCAGGGC
GTGATCTTCG TCGGCGCGGT GCTGGTCATG GTGCCGGCCG CGCCAAGAGC GCCGGATGGC
GGACGCATCC CCGGGCGGCA ACTGGCCCTG CTGACCCTGG GCGTCAGCCT GATCGCCGCC
GCCGGCGTCG TGCCCAGCGG ACTGGCGGCG GCCCTGTGCG CGGCGTTCGG GACCCTGGCC
ATGGCCGCCC TGCTGCTGGT CAACGGCCGC GCCGACAACC GCCTGCTGCC CCGCGCCGCC
GGCGACCTGG CCACCGCCAC GGGCCTGGGC CTGATGGTGA TCTTCTTCTG CGAGGCGGCG
ACGGTCGGCT TCTCGGTCTA TGGCCCGACC TTCATCCAGG TGCTGCACGG CGCGGGACCC
CTGCTCGGCG GCTATGTGAT CGGCGGCATC GCGGCGGGCT GGACGGCCTG CTCGTTCGTG
GTGGCGGGGC TGAAGCCCAG GCACGAGGGC CTGGCCATCC GCCTGGGCGC CGCGATCATC
GTGGCCGGCG TCGCCTGGGG CGCGGTCGAG ATGGTGCGGG GCGGGTTGAT CGGCATAACC
CTGTCGATGG TCCTGCTGGG CAGCGGCTTC GGGATCTGCT GGGCCTTCCT CGCCAAGCGG
ACGATCAGCG GAGCCGGCGA GGCCGAGCAG GCCCTGGCCT CGGCCGCCGT GCCGACTACC
CAGTTGATCG GCGGCGCGGT CGGCGCGGCT GCGGCCGGCG CCCTGGCCAA CGCCCTGGGC
TTCGCGCACG GCGTCACGCC GGAGAGCGGC GCGGCCCGCG GTCTGTGGCT GTTCGCGGCC
TTCGTCCCCC TGGCGGTGGT GGGCCTGGCG GCGGCGTGGC GGCTGGGGCA GGATTAG
 
Protein sequence
MTPPNSWRAL LNAELAPRFA LLCLGIWLNA ADTLVTVTIM PSVAKEIGGW QYFGWSIAAF 
LLGSILAGAC AGKLSIRFGL RRATALAGVI YAIGCAMGAC APEFLTFVAG RLVQGLGAGA
IVSLCYVAIS ALFPETLWPR VYGAIAGVWG AATLLGPLCG GLFAQAHFWR GAFWLFAIQG
VIFVGAVLVM VPAAPRAPDG GRIPGRQLAL LTLGVSLIAA AGVVPSGLAA ALCAAFGTLA
MAALLLVNGR ADNRLLPRAA GDLATATGLG LMVIFFCEAA TVGFSVYGPT FIQVLHGAGP
LLGGYVIGGI AAGWTACSFV VAGLKPRHEG LAIRLGAAII VAGVAWGAVE MVRGGLIGIT
LSMVLLGSGF GICWAFLAKR TISGAGEAEQ ALASAAVPTT QLIGGAVGAA AAGALANALG
FAHGVTPESG AARGLWLFAA FVPLAVVGLA AAWRLGQD