Gene Caul_5105 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5105 
Symbol 
ID5897297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp21425 
End bp22471 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content70% 
IMG OID641555208 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_001676539 
Protein GI167621754 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0845] Membrane-fusion protein 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGTATC GATCCAAATT CACTGTCGCC GCCGTGCTAG GAAGCTTGGC GCTCCCCCTC 
GCGGCCTGCG GCGGCCACAA GGAAGCGGCT GATCCGCGCA CGGAGGTCCC GCTGGTTCGC
GTGGCGACCG CCGGCGGCGC GGCCAAGGCC GAGCGCGGCT TCTCCGGCGT CGTCACCGCC
CGCGTCCAGA GCGATCTGGG CTTCCGTGTC CCAGGCAAGG TCATCGAGCG TCTGGTCGAC
GCCGGCCAGA CCGTGCGCCG CGGCCAACCG CTGATGCGGA TCGACCGCAC CGATTACGCC
CTTGCCCTGG ACGCCATTCG CGCGCGGGCC CAACAAACCG CCGCCGATGA AAAGCGCTAT
CGCGATCTGG TCGCGGTCGG CGCGGTCTCG GCTTCGGCCT ACGACCAGAT CAAGGCCGCC
GCCGACGCGG CTCAAGCCCA GATGCGCGCC GCGCAAAACG AGGCCAACTA CGCCCTGCTC
GTGGCCGACG CCGACGGCGT CGTCGTCGAG ACCCTGGCCG AGCCCGGCCA GGTCGTGACG
GCCGGTCAAA CCGTTGTGCG CCTGGCGCAC GCCGGCGCCC GCGAAGCCAA GATCAGCTTG
CCGGAAACCG TGCGCCCGCA GATCGGCTCG TCGGCGACGG CGGTGCTGTT CGATGGCGGT
CAGATCAGCT CCGCGCACCT GCGACTGCTC TCGGACGCGG CCGACCCTCA GACCCGCACC
TTCGAAGCGC GCTATGTCCT GGAGGGCCAG GCGGCCAGCG CTCCACTCGG CGCGACCGTG
CAGATTTCCC TTCCCGACAC CCGCTCTGGC GCGGCGCTGC AGGTGCCGCT GGCGGCCATC
TATGACACCG GCAAGGGCGC CGGGGTTTGG GTTGTCGATC CGGCCAAGTC GGTTGTCGTC
TGGCGTCCCG TGCAACTGAC CGGCCTGTCG GAAGAAACCG CGACCGTCGC CAGCGGCCTG
TCGTCGGGCG AACGTTTCGT GGCCTTGGGC GCGCACCTCC TGCATGCCGG CCAGAAGGTC
CGCGTCCAAA CCGGAGCGGC CCAATGA
 
Protein sequence
MSYRSKFTVA AVLGSLALPL AACGGHKEAA DPRTEVPLVR VATAGGAAKA ERGFSGVVTA 
RVQSDLGFRV PGKVIERLVD AGQTVRRGQP LMRIDRTDYA LALDAIRARA QQTAADEKRY
RDLVAVGAVS ASAYDQIKAA ADAAQAQMRA AQNEANYALL VADADGVVVE TLAEPGQVVT
AGQTVVRLAH AGAREAKISL PETVRPQIGS SATAVLFDGG QISSAHLRLL SDAADPQTRT
FEARYVLEGQ AASAPLGATV QISLPDTRSG AALQVPLAAI YDTGKGAGVW VVDPAKSVVV
WRPVQLTGLS EETATVASGL SSGERFVALG AHLLHAGQKV RVQTGAAQ