Gene Caul_3438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3438 
Symbol 
ID5900893 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3718235 
End bp3719755 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content68% 
IMG OID641563944 
Productmajor facilitator transporter 
Protein accessionYP_001685063 
Protein GI167647400 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.356572 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCAGG CGATTACGCC GGGCGGCGAC AAGCCGCTGT ATTCCAACGG CTACAAGGCC 
ACGGTGCTGG GGCTGCTGCT CGCCACCTAC ACCTTTAATT TCATCGACCG CACCATCATC
GCCACCATCG GCCAGGCCAT CAAGGTCGAC CTCAAGCTCA CCGACACCCA GCTGGGCCTG
CTGGGCGGGC TCTATTTCGC CCTGCTCTAC ACCCTGCTGG GCATTCCGAT CGCCCGGATG
GCCGAGCGCT GGAACCGGGT GACGATCATC TCGATCTCGC TGGTCATCTG GTCGGGCTTC
ACGGCCCTGT GCGGCAGCGC CTCCAGCTTC GCCCAGCTGG CCCTCTACCG CTTCGGCGTC
GGGGTCGGCG AGGCCGGCTG CTCGCCGCCC AGCCACTCGC TGATCAGCGA CTATTACGAG
CCCAAGAAGC GCGCCTCGGC GCTGTCGATC TATTCGTTCG GCATCCCCCT GGGTACGATG
TTCGGGGCGG TGGCCGGCGG CTGGCTGGCC CAGGAGTTCA GTTGGCGCGT GGCCTTCGTG
ATCGTCGGCC TTCCGGGCGT CATCCTGGCC CTGCTGGTCA AACTCCTGGT CAAGGAACCG
CCGCGCGGCC ATTCGGAGAT GAAGGAACGG CCGCTGGAAG CCGAAGACCT CGTCATCGAA
CCGATCGCTA CGCCGAAGCT CGGCTTTATC GCCTTCATTC ACCGTGAACT CGACGAGCTG
GGCGCGGTGA TGAAGGTGCT GTTTGGCAAG TGGCCCGTCC TGCACATGAT GCTGGGCGTG
ACCATCGCCT CGTTCGGCTC CTATGGCTCG GGCGCGTTCG TGCCGCCCTA TTTCGTGCGG
ACCTATGGCC TGGGCCTGGC CCAGGTGGGC CTGATCGTCG GGCTGATCGG CGGCTTCTCG
GCGGGCGTCG GCACCCTGGT CGGCGGCTTC CTGACCGACT GGTCGGGCAA GCGCAGCGCC
AAGTGGTACG CCCTGGTGCC GGCCCTGGGC CTGCTGATCG CCACCCCGAT CTACATCGCC
GCCTATCTGC AGACCAGCTG GCAGACCACC GCCCTGATCC TGCTGGTCCC GGGGATCTTC
CACTACACCT ACCTGGCCCC CACCTTCGGC GTGGTCCAGA ACTCGGTCGA GCCGCGGCGC
CGGGCCACCG CCACGGCCCT GCTGTTCTTC TTCCTCAACC TGATCGCCCT GGGCGGCGGG
CCGGTGTTCA CCGGCTGGCT GATCGACCAC CTGGCGCGCT TCAACTTCAA CCACCCCGCC
TCCACCAGCC TCTTCCAGGC CCTGGTCGGA TCGTTCGCCG ACCCCGGCGC GGCCAGCTTC
ACGGCCCAGT GCCCCGGCGG CCTGGCCCCC AAGGGCTCGC CGGTCGACCT GGCCAAGGCC
TGCCACGGCG CCATGGCCCG CTCGACCCAG CAGGGCATCA TCGTCTCGCT GTCCTTCTAC
GCCTGGGCCG CCCTGCACTA CGCCCTGGCG GCCATCGGCA TGACCAGGCA CATGCGGGAA
CGGGCGGTGG CGCAGGCCTA G
 
Protein sequence
MAQAITPGGD KPLYSNGYKA TVLGLLLATY TFNFIDRTII ATIGQAIKVD LKLTDTQLGL 
LGGLYFALLY TLLGIPIARM AERWNRVTII SISLVIWSGF TALCGSASSF AQLALYRFGV
GVGEAGCSPP SHSLISDYYE PKKRASALSI YSFGIPLGTM FGAVAGGWLA QEFSWRVAFV
IVGLPGVILA LLVKLLVKEP PRGHSEMKER PLEAEDLVIE PIATPKLGFI AFIHRELDEL
GAVMKVLFGK WPVLHMMLGV TIASFGSYGS GAFVPPYFVR TYGLGLAQVG LIVGLIGGFS
AGVGTLVGGF LTDWSGKRSA KWYALVPALG LLIATPIYIA AYLQTSWQTT ALILLVPGIF
HYTYLAPTFG VVQNSVEPRR RATATALLFF FLNLIALGGG PVFTGWLIDH LARFNFNHPA
STSLFQALVG SFADPGAASF TAQCPGGLAP KGSPVDLAKA CHGAMARSTQ QGIIVSLSFY
AWAALHYALA AIGMTRHMRE RAVAQA