Gene Caul_3081 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_3081 
Symbol 
ID5900536 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp3343338 
End bp3344453 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content69% 
IMG OID641563584 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_001684706 
Protein GI167647043 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.153012 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCCGCA GACACTTCTT CCTCGTCGCG GCGGTCGTCG CGGTCCTCCT CATGCTCCTA 
GTTGGCGGGC TGAAGCTCGC CTTCGGCTCC AAGGCGCCGG GAGCGGGGGG ACCTGGCGGC
GGCGGACGGG CCACGGTTGT TTCGCAGGTT GTCGTCCAGC CACGCGCCTT CACTGACCGT
GTCGAGGTGC TGGGCGTGGC CAAGGGGCGC CAGTCGGTGA CCATCACCTC CAACACCGCC
GAACTGATCA CCGCCGTTCA TTTCAGCGAC GGTCAGCTGG TGTCCAAGGG CCAGGTGCTG
GTCGAACTCA AGGCTGACCA GGAGACCGCC GGCATCGCGG AGGCCCAGGC CCAGCTGGCC
CAGGCCGAGC GGGAATACGC GCGCTGGAAG ACGCTGGCTG ATCGCGGCGT CGCCCCGCGC
GCCTCGGCGG AGCAGTACAT GGCCGCCCGC GACACCGCCC GCGCCGCCCT GGCCTCGGCT
AGCGCTCAGA AGCTGGACAA GGTGATCCGC GCGCCGTTCT CCGGCCGCGT CGGCATCTCG
GACATCGCGC CGGGCACGCT GATCAGCCCA GGAACCCCGA TCGTCAGCCT CGACGACGTC
TCGCTGATCC GCGTCGATTT CTCGGTGCCT GACCGCTACT TGCCGATCCT GAGCCAGGGC
CTGACCATCA GCGCCGCGCC GGACGCCCTG CCGGGCCAGA TCTTCACCGG CCGCATCGCC
CAGATCGACA CCCGCATCGA CCCGGCCACC CGCGCGATCA AGGCTCGGGC CGAGTTCCCC
AACGCCGACG GGCGTCTCAA GCCGGGCATG CTGATCAAGG TCGGTATCGA CCAGGGCCAG
CGTCAGGCCG TGGCGGCGCC TGAGGCGGCG ATTCAGTTCG AGGGAACCCA GGCCTCGGTA
TTCCTTGTCG CCGACGGACC CAAGGGCAAG ATCGCCCGTC GCACCACGGT GCAGACGGGG
TTGTCGTCGG GCGGCTATGT CGAGATCGTC TCGGGTCTTA AGGCCGGCGA CAGGATCGTC
GCCGATGGTC TCAACCGGGT GCGGGACGGC GCGCCGATCG GCGCTGGCGG TGCGGGCGGC
GCTCAAAAGG GCGGCAACCA GAAGAAGGCC GGCTGA
 
Protein sequence
MIRRHFFLVA AVVAVLLMLL VGGLKLAFGS KAPGAGGPGG GGRATVVSQV VVQPRAFTDR 
VEVLGVAKGR QSVTITSNTA ELITAVHFSD GQLVSKGQVL VELKADQETA GIAEAQAQLA
QAEREYARWK TLADRGVAPR ASAEQYMAAR DTARAALASA SAQKLDKVIR APFSGRVGIS
DIAPGTLISP GTPIVSLDDV SLIRVDFSVP DRYLPILSQG LTISAAPDAL PGQIFTGRIA
QIDTRIDPAT RAIKARAEFP NADGRLKPGM LIKVGIDQGQ RQAVAAPEAA IQFEGTQASV
FLVADGPKGK IARRTTVQTG LSSGGYVEIV SGLKAGDRIV ADGLNRVRDG APIGAGGAGG
AQKGGNQKKA G