Gene Caul_5103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_5103 
Symbol 
ID5897295 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010335 
Strand
Start bp19138 
End bp20646 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content70% 
IMG OID641555206 
ProductRND efflux system outer membrane lipoprotein 
Protein accessionYP_001676537 
Protein GI167621752 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01845] efflux transporter, outer membrane factor (OMF) lipoprotein, NodT family 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCAG TTCCCGGGAC GCTTTCTCTC CTGCTCTCGA CGAGCCTGCT TGCCGGCTGC 
GCCGTGGGTC CCAACTATGT TCGACCGACC TTGGCGGCGC CGGCCGCTTT CATGGGGCAA
GCGGCGCTGG ACCAGCGCCA CGCGCCCGCG CCGGTCGAAC AGGTCGCCGA CCAGACGGCT
TGGTGGCGCC AGTTCGACGA TCCGGTTCTG ACGCGCCTGG TGGGCGTGGC GCTCGACCAG
AACCTGGACC TGGCCCAGTC GGTGGCGCGC GTGTCCCAGG CCCGCGCCTC GCTGCGCTCG
GCCAACGCCG CCCTGCTCCC GTCGGGCAAC ATCTCCGCCC AGGGCACCAA GGCCCATCTG
TCGACCCAGA CGCCGTTGGG GCGTCTGCTC AACGCCGACC CGGGCTTTGA CCGCAATGGC
TCGCTCTACG AGGCCAATAT CGACGCCAGC TGGGAGATCG ACGTGTTCGG CGGTCTTCGT
CGCGACCAGG AAGCCGCGCG AGCCGAGTAT CAGGCTGCTC AAGCCGATGT CGCCGCGGCG
CGGCTGGCGA TCGCGGCCCA GACCGCAGAC ACCTATGTGG TCATTCGAGG CCTGCAGTCG
CGCCTGAAGA TCGCTCGCGA CCAGGTGCAG ACCCAGACCA AGCTTCTGTC GACTGTGAAC
TACCAATTTG GCAAGGGCGT CGCGGCCGAA CTCCAGGTGC GCCAGGCCGA GGGCGCGTTG
GCCGAGACCC AGGCAGGCGT GCCGATGCTG GAAAACGGCC TGGAAGCGGC GCTGAACGCG
CTTGACGTCC TTCTGGGCGC GCAGCCGGGG ACCTATCGCG CCGAACTGAG CGCGGCCTCG
CCGGTGCCGA TCGCTCCCAC GATCGCCAAC GCCAGCGGTC CCGCCGACCT GATCCGGCGG
CGTCCGGACC TGGTGGTGGC CGAACGGCGT CTGGCTGCGT CGAACGCGAT GATCGGCTCG
GCGCTGTCGG AATACTACCC GAAGTTCTCC TTGAGCGGGC TGGCCGGTAC GGCCACGACC
GCTCGGAGCG GCGTCTTCGA CAACGGCGCC AATCAAGCCC AAGGGGTCCT TGGTCTGCGC
TGGCGGCTGT TCGACTTTGG CCGCGTCGGC GCTGAGGTCA AGGCGGCCAA GGGGCGCAAC
GCCGAGGAAC TGGCTCGCTA CCAGCAAGCC GTCCTGCAGG CGACCGAGGA CGTCGAGAAC
GCGTTCTCCA GCCTGGTCAA GCGCGAGAGC CAAGAGCAAA CCCTGGCGGG TGGGGAGACC
TCGCTCACCA AGGCGCGAGA CGCCTCGCTG GCCGCCTACA AGGGCGGCGT GGTCAGCCTG
ATCGAGGTTC TGGACGCCGA CAATCGGCTC CTGGCCACCC GCGATGCGCG CGCTCAGGCC
CAAACGGAAT CGGCGCGCGC CGCCATCGCG TCGTTCCGGG CGTTGGGCGG TGGCTGGGAC
GCTCAAGCGG CGGTCGTGGC CGCGGGGCAG GGGCAAGGCG CAACCGGCGC CCCGGCGCCA
CGCGGCTAG
 
Protein sequence
MRAVPGTLSL LLSTSLLAGC AVGPNYVRPT LAAPAAFMGQ AALDQRHAPA PVEQVADQTA 
WWRQFDDPVL TRLVGVALDQ NLDLAQSVAR VSQARASLRS ANAALLPSGN ISAQGTKAHL
STQTPLGRLL NADPGFDRNG SLYEANIDAS WEIDVFGGLR RDQEAARAEY QAAQADVAAA
RLAIAAQTAD TYVVIRGLQS RLKIARDQVQ TQTKLLSTVN YQFGKGVAAE LQVRQAEGAL
AETQAGVPML ENGLEAALNA LDVLLGAQPG TYRAELSAAS PVPIAPTIAN ASGPADLIRR
RPDLVVAERR LAASNAMIGS ALSEYYPKFS LSGLAGTATT ARSGVFDNGA NQAQGVLGLR
WRLFDFGRVG AEVKAAKGRN AEELARYQQA VLQATEDVEN AFSSLVKRES QEQTLAGGET
SLTKARDASL AAYKGGVVSL IEVLDADNRL LATRDARAQA QTESARAAIA SFRALGGGWD
AQAAVVAAGQ GQGATGAPAP RG