Gene Caul_1194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1194 
Symbol 
ID5898649 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1255864 
End bp1257114 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content71% 
IMG OID641561677 
Productmajor facilitator transporter 
Protein accessionYP_001682822 
Protein GI167645159 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGC TTTCCAGTCC CACGACTCCG AACGCCTCGA CGACGCCGGA CGCCAGTCCG 
CGCGCGCTCT ACGTCCTGCT GCTGGTGGTG TTCATCAACC TGGTGGGCTT CGGGTTGGTG
ATCCCGCTGC TGCCCTTCTA CGCCAAGTCG CTGAACGCCA GCCCGTGGCA GGTGACGGCG
CTGTTTTCGG CCTATTCGCT GGGTCAGTTC GTCGCCGAGC CGTTCTGGGG CCGGCTTAGC
GACCGCATCG GCCGGCGACC GGTGCTGATC GTCACCATCC TGGCCAACAC CGTCTCCTAT
GTGGCCCTGG CCTTCGCCCC AAACATCGCC TGGGCGTTCG CCATCCGCCT GGCCAGCGGT
TTCGGCAGCG GCAACATCTC GACCATCCAG GGCTACATGG CCGACGTCAC CCCGCCCGAG
AAGCGGGCCG GACGCATGGG CCTGCTGGGC GCGGCGTTCG GCATGGGCTT CGTCGTCGGC
CCCACCCTGG GCGGCCTGCT GCCCGGCCTC GCCAAGGTCT TCGGCCATTC CGACACCGGC
CGCCTGGCCT TCCAGATCCC GCTGCTGACC GCCGCTGTCC TGGCCGCCAT CGCCTCGCTG
GGGGTGTTCC TGTTCGTGGT CGAGAGCCGC GCGCCCAGCG CCAAGGACGC GCCCCGGCCG
CACCGCCGCG AGGCCCTGGA GATGGCGCGC GCCCACCCCG TGCTGTCGCG GGTGCTGCTG
GTCACCCTGA TCTCGACCGC CGCCTTCGCC GGCATGGAGG CGGTCTTCGG CCTGTGGACC
CAGGCCCGGT TCGACTGGGG ACCCAGGCAG GTCGGCCTGT GCTTCGCGGT GATCGGGATC
ATCGCCTCGA TCGGCCAGGG CCTGATCACC GGTCGGCTGG CGCGCCGCTT CGGCGAGGCC
AAGGTGCTGA CCGCAGGCCT GTCGATCATC GCCGTCAGCC TGGCCCTGAC GCCGTTCGTG
CCGACCAGCG CCTTCGTGCC GGTGGTCGTG GGCTGCACGG CGTTCGGCCA GTCGCTGGTG
TTTCCCTGCG TCGCCGCCCT GATCTCGCGC GCCACCCCGC CCGACAAGCA GGGCGCCATG
CTGGGCCTGA ACATGGCCGC GGGCTCGCTG GCCCGCATGG CCGGCCCGAT GCTGGCCGGC
CCGCTGTTCG GCCTGGCGAT CGGCGGCCCC TACTGGCTGG GAGCCGTCTT GATGATCCCC
GCCATCGCCT TCGCCCTGAC GATCGAGCAC CGGGCCAAGG CGGCGGCGTA G
 
Protein sequence
MTTLSSPTTP NASTTPDASP RALYVLLLVV FINLVGFGLV IPLLPFYAKS LNASPWQVTA 
LFSAYSLGQF VAEPFWGRLS DRIGRRPVLI VTILANTVSY VALAFAPNIA WAFAIRLASG
FGSGNISTIQ GYMADVTPPE KRAGRMGLLG AAFGMGFVVG PTLGGLLPGL AKVFGHSDTG
RLAFQIPLLT AAVLAAIASL GVFLFVVESR APSAKDAPRP HRREALEMAR AHPVLSRVLL
VTLISTAAFA GMEAVFGLWT QARFDWGPRQ VGLCFAVIGI IASIGQGLIT GRLARRFGEA
KVLTAGLSII AVSLALTPFV PTSAFVPVVV GCTAFGQSLV FPCVAALISR ATPPDKQGAM
LGLNMAAGSL ARMAGPMLAG PLFGLAIGGP YWLGAVLMIP AIAFALTIEH RAKAAA