Gene Caul_1648 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1648 
Symbol 
ID5899103 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1727596 
End bp1728843 
Gene Length1248 bp 
Protein Length415 aa 
Translation table11 
GC content67% 
IMG OID641562137 
Productmajor facilitator transporter 
Protein accessionYP_001683275 
Protein GI167645612 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0811808 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCAAGG CCATCAGCAG CAACCAACGC CAGGCGGCGC TCGGCTTCAT CTTCGTCACC 
GCCTGCATGG ACGTGCTGTC GCTGGGCGTG ATGATCCCGG TGCTGCCCGA ACTGATGAAG
CGCTTCAACG GCGGCGACAC CGCCGCGACC GCCCTGTGGA TGGTGCTGTT CGCCACCACC
TGGGGCGTGA TGCAGTTCTT CTGCGGACCC ATCCTGGGCC TGATGTCGGA CCGCTTCGGC
CGCCGGCCGG TGATCCTGAC CTCGATCTTC GGCCTGGGCG TCGATTTCCT GTTCATGGCC
TTCGCGCCGA CGATCTGGTG GCTGTTCGTC GGCCGGGTGT TCAACGGCAT GACCGCCGCC
AGCTTCTCGA CCGCCGGGGC CTATGTGGCC GACGTCACCA AGCCGGAGGA CCGGGCCAAG
GGCTTTGGCC TGATGGGCGC GGCGTTCGGC GTGGGCTTCA CCTTTGGCCC GGCGCTGGGG
GCCGTGCTGT GGGGTTTCGA CCATCGCCTG CCGTTCCTGG TCTGCGCGGG CCTGGCCCTG
TGCAACTGGC TCTATGGCTT CTTCGTGCTG CCGGAATCCC TGCCGCCGGA GAAGCGAATC
GCGCGTTTTG ACTGGAAGAA AGCCAACCCC GTCGGCTCGC TGAACCTGCT TAGGAGCAAG
CCCAACCTGC TGGGCCTGGC CGGCGTCGGC TTCCTGTTCC AACTGGCGCA CAACGTCCTG
CCCAGCGTCT TCGTCCTCTA TATGGGCTAT CGCTATCACT GGCCGGTGCT GATCATCGGC
CTGACCCTGA TGGGTAGCGG GATGGCGGGG ATCCTGCTGC AGAGCCTGCT GGTCGGCCCG
ATCGTCAAGA AGGTCGGCGA GCGCGGCGCG CTGTTGATCG GCCTGTTCTC TGGCTGCGTC
GGCTTCATGA TCTATGGGCT GGCCCCTGTC GGTTGGCTCT ATCTATGCGG CCTGCCGATC
TTCGCCTTCT CGGGCCTGAT CCAACCCGGC TTGCAAGGGC TGATGACCCG GCGGGTCCAG
CCGTGGGAGC AGGGCCAGCT CCAGGGCGCG AACGCCGCGA TGATGGGCGT CACCGCCATC
GTCGGACCGA CGCTCTACCT GCTGCCGTTC GCTTGGGCCA TCCGCCACGA CGCCAGCCTG
CACATGCCCG GCCTGCCGGT GCTGATCGCC GCCCTGCTGC TGCTGGCGGC CACGGTGTTG
GCGATCCGCG TGGCGCGGCC CGTGGCGGTG GAACCCAGCG TCGCCTGA
 
Protein sequence
MIKAISSNQR QAALGFIFVT ACMDVLSLGV MIPVLPELMK RFNGGDTAAT ALWMVLFATT 
WGVMQFFCGP ILGLMSDRFG RRPVILTSIF GLGVDFLFMA FAPTIWWLFV GRVFNGMTAA
SFSTAGAYVA DVTKPEDRAK GFGLMGAAFG VGFTFGPALG AVLWGFDHRL PFLVCAGLAL
CNWLYGFFVL PESLPPEKRI ARFDWKKANP VGSLNLLRSK PNLLGLAGVG FLFQLAHNVL
PSVFVLYMGY RYHWPVLIIG LTLMGSGMAG ILLQSLLVGP IVKKVGERGA LLIGLFSGCV
GFMIYGLAPV GWLYLCGLPI FAFSGLIQPG LQGLMTRRVQ PWEQGQLQGA NAAMMGVTAI
VGPTLYLLPF AWAIRHDASL HMPGLPVLIA ALLLLAATVL AIRVARPVAV EPSVA