Gene Caul_5379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5379 
Symbol 
ID5897112 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010333 
Strand
Start bp88389 
End bp89837 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content66% 
IMG OID641550669 
ProductRND efflux system outer membrane lipoprotein 
Protein accessionYP_001672155 
Protein GI167621647 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.320478 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.694464 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCATGC GCACCTATTG GCTCTTCGCC CTGCTTCCTG TGCTGTCCGC CTGCACGGTC 
GGTCCCGACT ATGTCGGCCC ACCGGACAAG GGGATTTCAT CCCGCTTCGT GCGCATGGAG
GACACCGCCG TCATCGCAAC GCCTCTCGTT GCCCGGTGGT GGGAGACGCT GGATGACCCT
ATCCTTGACG GAATTGAAGA GCGGGCGCTG CGTGCAAATC CGAACATGGC GGCTTCCCAG
GCTCGGTTGC GTCAAGCGCG CGCCATGCTG CGCATCGAGC GAGCCAACGC GGCGCCGAAC
GTCGCCGCCG GTGCGCTGGC TGGCCATGGC CGCGTGCCGG AGAGCGGCTC CGGCGCTGCA
GGCGCCATGA TACCGGCTCT GGTCGGAACC GACGAGAGAT CCTTCGATCT ATATACCGCC
GCCTTCGACG CCAGTTGGGA GATCGATCTG TTCGGCGGGC GTCGGCGCGG AGTCGAAGCC
GCAACGGCTA CCGCGCAAGC CGCAGAGGCC ACACTGGCCG ACGCGCAGGT GAGCCTGACC
GCCGAGGTCG CCCATGCCTA TATCGGCCTT CGCGACACAC AGAAGCGGAT CGTATTGGCG
AAGCGGTCGG TGGAACTCCA GCAGCAGATG CTCGACCTCA CGCGCCAGCA GTTCGAAAGG
GGCGTCGCTT CCGCGCTCGA TGTCGAACGG CTCAGCATGC AATTTGAGAA CACCAATGCC
AGGGTGGCTC CACTCGTGGC CCAGGCCGAG GGCTTTATGA ACGCCCTCGC CTGGCTCTGC
GGAGAGCAGC CCGGCGCCCT TGATGCAATA CTCGATCCGC CCCGGCCAGC GCCACTGCCC
CCCCCTGCCG TTGCCATCGG CGATCCGGAA GCCATGCTCA GGCGGAGGCC CGACGTGCGT
GCCGCTGAAC GCCGGCTCGC GGCCGATACC GCCCGTATCG GGGTGGCGGA AGCCGCCAGG
TTCCCGCGGC TCAGTTGGAT GGGCGTCATC GGGATCGGCG GTACTCAACC TTCCGACTTG
ACGCATCTCG ATGATTTCGT CGCACTGGGG GCACCGACGT TGCAGTGGAG CGTGCTGAAT
TTTGGTCGCG TGCAAGGGCA GATCAGGCAG CGCGAGTCGG CGCGAGACGA AGCCGAAGCC
CTGTATGACG CGGCGGTCCT CGGGGCATTA CGAGACACGG AAGACGCGTT GGCGCGGTTC
CGAGCCGGTC GTGCAACCGT CGCGATTCTC GCGCGGGCGA AGGCGTCGGC CGATCGGTCA
GCCAATTTGA CGCAGCAGAA CTACCGAGCC GGCAGGGCAT CGCTCATCGA TGCGCTCGGC
GCAGAGCGAC ATCGGATCGA TGCTGAGGAC GGTCTCTCCG CGGCCACGGC CGGTTTGACG
GCAGACTATG TTGCGCTTCA GAAGGCGCTG GGCCTCGGAT GGCTCGACGG GGGAGTATCT
GCCGGCTGA
 
Protein sequence
MPMRTYWLFA LLPVLSACTV GPDYVGPPDK GISSRFVRME DTAVIATPLV ARWWETLDDP 
ILDGIEERAL RANPNMAASQ ARLRQARAML RIERANAAPN VAAGALAGHG RVPESGSGAA
GAMIPALVGT DERSFDLYTA AFDASWEIDL FGGRRRGVEA ATATAQAAEA TLADAQVSLT
AEVAHAYIGL RDTQKRIVLA KRSVELQQQM LDLTRQQFER GVASALDVER LSMQFENTNA
RVAPLVAQAE GFMNALAWLC GEQPGALDAI LDPPRPAPLP PPAVAIGDPE AMLRRRPDVR
AAERRLAADT ARIGVAEAAR FPRLSWMGVI GIGGTQPSDL THLDDFVALG APTLQWSVLN
FGRVQGQIRQ RESARDEAEA LYDAAVLGAL RDTEDALARF RAGRATVAIL ARAKASADRS
ANLTQQNYRA GRASLIDALG AERHRIDAED GLSAATAGLT ADYVALQKAL GLGWLDGGVS
AG