Gene Caul_3250 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3250 
Symbol 
ID5900705 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3514359 
End bp3515660 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content68% 
IMG OID641563755 
Productmajor facilitator transporter 
Protein accessionYP_001684875 
Protein GI167647212 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.613026 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCCGACAT TGCCCCCCGC TGGATCCGCC CAACAGGAAA GCACGCGCTA CCGATACGTC 
GTGGTGGTCT CCCTGGCCGT CGTCTACATG TTCAACTTCA TCGATCGGCA GCTGCTGTCG
ATCCTTGCCG AACCGGTCAA ACGCGACCTG GGGCTCTCCG ACACTCAGCT GGGGATGTTG
ACGGGACTGA TGTTCGCCCT GTTCTACACG GTGTTCGGCA TTCCCGTGGC GCTCCTGGCT
GACCGCTGGC GTCGTGTCCG CTTGATCGCC CTGGCGTGCG GCCTTTGGAG CCTGTTCACC
GCCTCCTCGG GCCTGGCGGT CAACTTCTTC ACCCTGGCGC TGGCTCGGGT CGGGGTGGGG
ATCGGCGAGG CCGGCTGCTC GCCGCCGTCC TATGCGATCA TTTCCGACTA CTTCCCGCCC
GAGCGTCGCG GTCGCGCCCT GGCGATCTAC GTGCTGGGCG TGCCGGCCGG CAGTTTCGTC
GGCGCCCTGG CCGGCGGGTG GATCGCCGCC CACTATGGCT GGCGCGCCGC GTTCTTCGCC
GTCGGGCTGG CTGGTCTCCT GATCACGCCC CTGATTCCGC TGGTCGTCCG CGAGCCGCGC
AGGGGCCGCT ACGACCTCGA AGCCGCTCCG GTCGGCCCCG CCGCGACCTC CAGCGAAACC
CTGTGGGGCG CCTTCGGTTT CTTCTGGCGT TCGCCGACGC TGGTGCTCAG CGCCCTGGCC
TCTGGCGTGA CGGCCTTCGT CAGCTACGGA TTGATCAACT GGTCACCGGC CTTTCTGACC
CGGGTCCAAG GGATGAACCT GAGCCAGGTC GCCGGCTATT TCGGTCTCTC CATTGCCGGC
GCCATGGTCA TCGGCGCCTG GCTCGGCGGC CTGATCTCCG ACCGCGCCGG GGCCCGCAAT
CCGATCTTCT ACGCCTTGTT GCCGGGTCTT GGCCTGCTGA GCATCACACC CTTCCTCTTC
GCCTTCACCA CGGCCGCCAC CTGGCAGGCG TCGCTGGGCC TGCTTATCAT TCCGCTGATC
GCTACCTCCA CCTACCTGGT GCCGGCCCTG GCGCTGCTGC AGAACCGCAC GCCGGCCCGC
TATCGCGCGA CGACCAGTTC GATCCTGCTC TTCCTGATCA ACCTGACGGG ACTGGGGTGC
GGCCCCTTGT TCGTCGGCGC CGTCAGCGAC GCCCTGCAAC CGCGCTATGG CGTGCACGCC
CTGGGCCATG CGTTGCAGTG GCTGACGCCA TTCATCGTCC TGGCCTTCGG CCTGCAATGC
GCCGCCGCCT GGACCCTTCG GCGCAAGGTC GCGGCGGTCT GA
 
Protein sequence
MPTLPPAGSA QQESTRYRYV VVVSLAVVYM FNFIDRQLLS ILAEPVKRDL GLSDTQLGML 
TGLMFALFYT VFGIPVALLA DRWRRVRLIA LACGLWSLFT ASSGLAVNFF TLALARVGVG
IGEAGCSPPS YAIISDYFPP ERRGRALAIY VLGVPAGSFV GALAGGWIAA HYGWRAAFFA
VGLAGLLITP LIPLVVREPR RGRYDLEAAP VGPAATSSET LWGAFGFFWR SPTLVLSALA
SGVTAFVSYG LINWSPAFLT RVQGMNLSQV AGYFGLSIAG AMVIGAWLGG LISDRAGARN
PIFYALLPGL GLLSITPFLF AFTTAATWQA SLGLLIIPLI ATSTYLVPAL ALLQNRTPAR
YRATTSSILL FLINLTGLGC GPLFVGAVSD ALQPRYGVHA LGHALQWLTP FIVLAFGLQC
AAAWTLRRKV AAV