Gene Caul_0047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_0047 
Symbol 
ID5897759 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp59376 
End bp61037 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content68% 
IMG OID641560530 
Productgeneral substrate transporter 
Protein accessionYP_001681683 
Protein GI167644020 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.252827 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.746127 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCC AGGCTCAGGG CGAGGATCGC GCCTCGATGC GTACGGTGGT GGCGGCCTCG 
TCGGCGGGCA CCGCCTTCGA ATGGTACGAC TTCTTCATCT TCGGCAGCCT GGCCCAGGTG
ATCTCCAAGA CCTTCTTCAC GGGCCTGTCG GAGACGGCCG GCTATGTCGC GGCCCTGGCC
CTGTTCGGGG TGGGTTTCGC CTTCCGGCCG CTGGGCGCGC TGGTGTTCGG CAAGATCGGC
GACCAGGCCG GCCGCAAGGG CGCGTTCCTG GCCACCGTGC TGCTGATGGG CGGCGCCACC
TTCGCCATCG CCCTGCTGCC GACCTACGGG CAGGCCGGAA TCATTTCGCC GATCCTGCTG
ATCCTGCTGC GCTGCGTGCA GGGCTTCGCC CTGGGCGGCG AGTATGGCGG GGCGGCGATC
TATGTCGCCG AGCATTCGCC GCCCAACGAG CGCGGCTGGT CGACCTCCTG GGTGCAGACT
TCGGCGGCGT TCGGCCTGTT CGGCGCCCTG CTGGTGATCC TGCTGACCCG CTGGCTGCTG
GGCGTCCAAT TCGGCCCCGA GGCTTTCGAC GCCTGGGGCT GGCGCGTCCC GTTCGCCGTC
TCGATCGGCC TGCTGGGCGT CTCGGTCTGG ATGCGCCTGA AGCTCAGCGA AAGCCCGGCC
TTCGCTAAGA TGAAGGAAGA GGGCGAGGCC TCCAAGGCGC CCTATGCCGA GGCCTTCGGC
CAGTGGAAAA ACCTCAAGCT GGTGCTGCTG GCCTTCTTCG CCATGATGTC GGCCCAGGGG
GCGGTCTGGT ACACCAGCTT CTTCTACGTC CAGACCTTCA TGGAGAAGTT CCTCAAGGTC
TCGCCCACCA CGATCAACGG CCTGATGATG GCGGCCACGG CGGTCAGCGC CGTCTTCTAC
GTCGTGTTCG GCTGGCTGTC GGACAAGGTC GGCCGCAAGC CGGTGATGCT GGGCGGCATG
ACCTTGGCCC TGGTCTTCTA TTTCCCTGGC TTCCACCTGC TCGAGCGCGC CGCCAACCCG
GCCCTGGCCG AGGCCACGGT CCGGGCGCCG GTCACCGTGA CCGCCGACCC CAGGGACTGC
GCCCTGCAGT TCGATCCGGT CGGCAAGGCC GCCTTCGTCT CGTCCTGCGA CATCGCCAAG
AGCGTCCTGG CCAACGCCGG CATATCCTAC GCCAACCATG CCGGCCCCGC GGCCTCGGCC
GCGGTGGTCC AGGTCGGCGA CACGCGAATC GTCTCCCAGA GCGCCAAGGG ACTGCCCCCC
AAGGAGGCCA AGGCCGTGAA GACCGCCGGC GAGGCCGCGA TCAAGGCCGC CTTGGCCAAG
GCCGGCTACC CCACCAAGGC CGACCCGGCG CGGATGAACT GGTGGGGCAT GTTCGGCGTG
CTGTTCATCT TCGTGGTCGC CGCCACCGCC CTGTTCGGCC CCCTGGCCGC CTGCCTGGTC
GAACTGTTCC CCACCCGAGT GCGCTACACT GCCCTGTCGC TGCCCTACCA CATCGGAACG
GGGTGGATCG GCGGCTTCGT GCCGTTCAGC GCCTTCGCCA TCGTGGCGGC GGTGGGCGAT
ATCTATGCCG GGCTTTGGTA CCCGGTGTTC TTCACCCTGA TCAGCGTGCT GACCACGCTG
TTCCTGCTGC CCGAGACCAA GAACCGGTCC TTGGATCAGT GA
 
Protein sequence
MTTQAQGEDR ASMRTVVAAS SAGTAFEWYD FFIFGSLAQV ISKTFFTGLS ETAGYVAALA 
LFGVGFAFRP LGALVFGKIG DQAGRKGAFL ATVLLMGGAT FAIALLPTYG QAGIISPILL
ILLRCVQGFA LGGEYGGAAI YVAEHSPPNE RGWSTSWVQT SAAFGLFGAL LVILLTRWLL
GVQFGPEAFD AWGWRVPFAV SIGLLGVSVW MRLKLSESPA FAKMKEEGEA SKAPYAEAFG
QWKNLKLVLL AFFAMMSAQG AVWYTSFFYV QTFMEKFLKV SPTTINGLMM AATAVSAVFY
VVFGWLSDKV GRKPVMLGGM TLALVFYFPG FHLLERAANP ALAEATVRAP VTVTADPRDC
ALQFDPVGKA AFVSSCDIAK SVLANAGISY ANHAGPAASA AVVQVGDTRI VSQSAKGLPP
KEAKAVKTAG EAAIKAALAK AGYPTKADPA RMNWWGMFGV LFIFVVAATA LFGPLAACLV
ELFPTRVRYT ALSLPYHIGT GWIGGFVPFS AFAIVAAVGD IYAGLWYPVF FTLISVLTTL
FLLPETKNRS LDQ