Gene Caul_0585 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0585 
Symbol 
ID5898040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp640273 
End bp641901 
Gene Length1629 bp 
Protein Length542 aa 
Translation table11 
GC content67% 
IMG OID641561067 
Productmajor facilitator transporter 
Protein accessionYP_001682216 
Protein GI167644553 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.895306 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAAGGCGG GCGCGACACC CAATCGGCTT GGCGGTCGAC CAGCCGATTC CGCAGCTCCC 
ATGGGGGGGT CCTATGCTTG GTACGCCACA GGCGTGCTGG CGCTTGTCTA CGTCCTGAAT
TTCGTCGACC GCCAGATCAT TTCAATTCTC GCCGAAGACA TCAAGCGTGA CCTGCATGTG
ACGGACGCGC AGCTGGGCTT CCTGTACGGC ACGGCCTTTG CGATCTTCTA CGCTCTTTTT
GGCATCCCCT TTGGCATGCT CGCCGATCGT TGGCGCCGCG GCCGGCTGAT CGCCATTGGA
CTGGTGGTCT GGTCTGCGAT GACCGCCGCG TCCGGCTTCG CGTTCAACTT CCTGCAACTG
GCCCTCGCGC GGGTCGGCGT CGGCGTTGGA GAAGCAACCG CCTCCCCGGC CGCCTTCTCG
ATGCTGGGCG ACTATTTTCC GCGTGAACGC CGCGCGCTGG CGGCCTCGCT CTACTCCACC
GGTCTCTACC TTGGCATGGG CCTCAGCCTG CCGATCGGCG GCTGGATCGC CCAGTCTTGG
AACGATACCT ACGCCGCCGG CGCGGCGCCC TTCGGCCTGG CGGGTTGGCA GGTCGCCTTC
CTCGCCGTCG GCTTACCCGG CCTGGCCATG GCGCTATGGG TGCTGACCCT GCGCGAACCG
GTGCGCGGCT GCAACGACGG CGCGCCGCGT CCGCTGGTCA CGCCGGGCGC CGGCAAGCTG
TTCTTGGCCG ACCTCGCGGC GATCCTGCCC CCGCTCACCC TGTGGTCGGT GTCGCGCCAG
CCCCGCATGC TGGCGGTGAA CCTGGCCGTC GCGGTCCTAG TGGCGGGCGT CGCGACGCTC
CTGTGCCGTT TTGTCGGCGA TCCGCCACAG TGGATCGCCT ATGGGGTCGG CGTCTATGCC
GTGTTCTCCT GGGTCCAGGT GATCAAGGTC ACCGACCGGC CGATCTACGC CCTGATCTGG
GGCGACCCCA GGATGCTGGT CGCGATCGTC GCCTTCGGCA GCCTGTCTGT CTTTGTCTAC
AGCTACGGGT TTTGGGTGGC GCCCTACGCC ATCCGCACCT TTGGCGTCAC CAAGGCCATG
GCCGGGATCG AGCTTGGCAT ACCCGGAGCC TTCGCCTCGG CGATTGGGGT GCTCATCGGC
GGTCGGCTGT CGGATCTATG GAGGGCGCGC GATCCGCGCG GACGGATCTT CGTCTGCATG
CTGGCGATCG CTTTGCCCTT GCCCGCCCTG TTGTGGATGT TCACCACGGC GCAGTACGAG
ACCTACCGTC TCATCAGTCC GGTGATCTAT CTGGTGAGCA GTTCGTGGGT CGGTTCCGCA
GTGGCAAGCT ATCAGGATCT GGTCTTGCCA CGCATGCGGG GGCTCGCCGG TTCGACCTAT
CTGCTGGGCG CCACGATGGT GGGCCTGGCC CTTGGCCCCT ACGTCACCGG CAAGGTGGCG
ACAGTCACAG GCTCGCTGCA AGCCGGTGTG CTAACGCTGT TTCTGGTGGC CCCCCTGTCG
CTCCTGCTTC TCGGGCTCAC CGCGCGTTGG GCTCCGGGTC TGGAGGCCAG CAAGTTCGAC
CGCGCGCGCG CCGCCGGAGA GCCGGATGAG CCGGGGCGGG CGACGCCTTT GCCCGCTACG
CCCCTCTAG
 
Protein sequence
MKAGATPNRL GGRPADSAAP MGGSYAWYAT GVLALVYVLN FVDRQIISIL AEDIKRDLHV 
TDAQLGFLYG TAFAIFYALF GIPFGMLADR WRRGRLIAIG LVVWSAMTAA SGFAFNFLQL
ALARVGVGVG EATASPAAFS MLGDYFPRER RALAASLYST GLYLGMGLSL PIGGWIAQSW
NDTYAAGAAP FGLAGWQVAF LAVGLPGLAM ALWVLTLREP VRGCNDGAPR PLVTPGAGKL
FLADLAAILP PLTLWSVSRQ PRMLAVNLAV AVLVAGVATL LCRFVGDPPQ WIAYGVGVYA
VFSWVQVIKV TDRPIYALIW GDPRMLVAIV AFGSLSVFVY SYGFWVAPYA IRTFGVTKAM
AGIELGIPGA FASAIGVLIG GRLSDLWRAR DPRGRIFVCM LAIALPLPAL LWMFTTAQYE
TYRLISPVIY LVSSSWVGSA VASYQDLVLP RMRGLAGSTY LLGATMVGLA LGPYVTGKVA
TVTGSLQAGV LTLFLVAPLS LLLLGLTARW APGLEASKFD RARAAGEPDE PGRATPLPAT
PL