Gene Caul_1602 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1602 
Symbol 
ID5899057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1691510 
End bp1692736 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content71% 
IMG OID641562089 
ProductBcr/CflA subfamily drug resistance transporter 
Protein accessionYP_001683229 
Protein GI167645566 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00710] drug resistance transporter, Bcr/CflA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.940845 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACG CCGCCCAGCC CGTTCCCGCC GTCATCCCCT GGCGGCTCGT CCTGATGCTC 
GGCGCTCTGA CCGCCTTCGC CCCGATGTCG ATCGACATGT ACCTGTCGAG CATGCCCGAG
ATCGGCCGGC GGCTGCACGC CGGGGCCGAC GACGTCCAGG CCACCCTGGC GGCCTTCTTC
GCCGGCATGG CCATCGGGCA ATTCCTCTAT GGACCGGCGT CGGACCGTTT CGGTCGCCGG
CCGCCGTTGC TGCTGGGCAT CGGCATCTAT GTCGCCGCCT CCGTGGTTTG CGCCCTGGCC
CCCTCCATCG AGGTGCTGAT CGCCGCCCGC TTCGTCCAGG CCCTGGGCGG CTGCGCGGGG
GCGGTGGTGG CGCGGGCGGT GGTCCGTGAC CGCTTCAACC ACGCCGACAC GGCCCGCGTG
CTGTCGCTGA TGACCTTGAT CATGGGCCTG GCCCCGGTGC TGGCCCCGCA GCTGGGCGGG
GTGATCCAGT TCTTCGCCGG CTGGCGGGGC GTGTTCTGGT CGCTCGTGGT GTTCGGCCTG
CTGATCGGCC TGTGGATCGC CCTGGGCCTG AGCGAGAGCC GCTCCGAGGC CACCGCCGTC
CAGGCCCGCT CGGAGAACCC GTTCAAGGCC TATGGCGCGC TGCTGAGCCA GAAGCGGCTG
GTCGGCTACG GCCTGGCGGG GGCCCTGAAC GGCGCGACCC TGTTCACCTA CATCTCGACC
GCCCCGGACC TGGTGATGGG GACCTATGGC CACACGCCGC TGGTGTTCAA CCTGATCTTC
GCCTTCAACG CCGTGGGCAT CATCGGGGCC AGCCAGGTCA ACCGGCTGCT GCTGCGTCGC
GCGACGCCGG ACAGGGTGCT GGTGCGGGCC AGCATCGCCT CGATCGTCGC CGCCTTCCTG
CTGGCCGCCG CCGCCTGGAC CGGAGTGGGC GGACAGTTCA CGGTCCTGCC GCTGCTGTTC
GCCGCCCTGT CGAGCTACGG CCTGATGGCC GGCAACACGA TGGCCGGGGC GCTCAGCGTC
GATCCCAAGC GCGCCGGTTC GATCTCGGCC CTGATGGGCG GAGCCTCGTT CGCGGCCGGC
GCCCTGGCCG CGTGGATCGG CGGCCTGCTG CATGACGGCA CGGCCCGCCC CGTGGCGGCG
GTGATGTTCG CCTGCCTGAT CGGCTCCAGC CTGGCGATCT TCGGCCTGGC GGTCCCGAAG
GGGTTGCGGG GCAAGGCGAG GGTTTGA
 
Protein sequence
MTDAAQPVPA VIPWRLVLML GALTAFAPMS IDMYLSSMPE IGRRLHAGAD DVQATLAAFF 
AGMAIGQFLY GPASDRFGRR PPLLLGIGIY VAASVVCALA PSIEVLIAAR FVQALGGCAG
AVVARAVVRD RFNHADTARV LSLMTLIMGL APVLAPQLGG VIQFFAGWRG VFWSLVVFGL
LIGLWIALGL SESRSEATAV QARSENPFKA YGALLSQKRL VGYGLAGALN GATLFTYIST
APDLVMGTYG HTPLVFNLIF AFNAVGIIGA SQVNRLLLRR ATPDRVLVRA SIASIVAAFL
LAAAAWTGVG GQFTVLPLLF AALSSYGLMA GNTMAGALSV DPKRAGSISA LMGGASFAAG
ALAAWIGGLL HDGTARPVAA VMFACLIGSS LAIFGLAVPK GLRGKARV