Gene Caul_1281 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_1281 
Symbol 
ID5898736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp1345811 
End bp1346971 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content73% 
IMG OID641561766 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_001682909 
Protein GI167645246 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0104305 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGTGC ATCAACGGAT CAATAGACGC GGCCTGCGTT CGACTGCGTT ACCGGTGGTG 
CTTTGCATCG GCGCTCTGAG CCTGGCCGCC TGCGACCACA ACGACAAGGC CAAGGCCAAG
GCGACGAAGC CTTCGCAGGC CGCCAGCCAG ACCGTCGGAG TGGCCGTGGT CACCGTCCAG
GCCCTGCCCC GGATCATCAA CGCCTCCGGC ACCGTCACCC CCTGGGAGGA AGTTCCCGTC
GGCGCCGAGA CCGGCGGCCT GACCGCCGTG TCGGTCAACG CCGAGGAAGG CCAGACCGTG
CGCCAGGGCC AAGTCCTGGT GGCGATGAAC GACACCATGC TGCGCGCCCA GGCGCGTCAG
CAGGAGGCCT CCGTGGCCAG CGCCCGCGCC ACCCTGGCCG AGGCGCAGTC CGCCCTGGCC
CGCTCGCGCG AACTGCAGGC CAAGGGTTAT CTGGCCGCCT CGGCGCTCGA CACCGCGAAC
ATGCGCCAGC AGACCGCCAG CGCCCAGGTG GCCGCGGCCG AGGCCGCGCG CGGCGAGACC
CTGGCCCGCC TGGGCCAGGC CGTGGTCCGC GCCCCGGTCT CGGGCCTGAT CAGCCGTCGC
AGCGTCACCA AAGGCCAGAT CATCTCGCCC GGAACCGAGC TGTTCCGCAT CGTCCGCGAC
GGCCGCCTTG AGCTGGACGC CGAGATTCCC GAATCCGACC TGTCGGCGCT GCGCGCCGGC
ATGCCCGCCA CGGTCACCTC CGACCAGGTC GGCCAGACCA CGGGGACGAT CCGCATCGTC
ACCTCCGAGG TCAACACCCA GACCCGCGTC GGCCTGGCCC GCATCAGCCT GGCGCCGGGC
AGCGGCTTCC GCTCCGGCAT GTTCGCCCGC GCCCAGATCG CGGCGGGCTC CCAGCCGGCC
CCGACCATCC CGACCGCCGC GATCCTCTAT CGCCAGAACC AGGCGGGCGT GTTCGTCGTT
GGCGCCAACA ATCGCGCCCA GTTCCGGCGC ATCGACATCC TGGCTCGCAA CGCCGACCGC
ACCGCCGCGG GCGGGCTGAA CCCCGGCGAG CGAGTGGTGG TCGAGGGAGC CGGCTTCCTG
GGCGACGGCG ACGCCGTGCG CGTCGCCCCG ACCTCCGGCA AGGCCCCGGC GCCCGCCGTG
GCCGTCGCGG CGAAACCCTA G
 
Protein sequence
MVVHQRINRR GLRSTALPVV LCIGALSLAA CDHNDKAKAK ATKPSQAASQ TVGVAVVTVQ 
ALPRIINASG TVTPWEEVPV GAETGGLTAV SVNAEEGQTV RQGQVLVAMN DTMLRAQARQ
QEASVASARA TLAEAQSALA RSRELQAKGY LAASALDTAN MRQQTASAQV AAAEAARGET
LARLGQAVVR APVSGLISRR SVTKGQIISP GTELFRIVRD GRLELDAEIP ESDLSALRAG
MPATVTSDQV GQTTGTIRIV TSEVNTQTRV GLARISLAPG SGFRSGMFAR AQIAAGSQPA
PTIPTAAILY RQNQAGVFVV GANNRAQFRR IDILARNADR TAAGGLNPGE RVVVEGAGFL
GDGDAVRVAP TSGKAPAPAV AVAAKP