Gene Caul_4662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4662 
Symbol 
ID5902124 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp5039668 
End bp5040873 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content67% 
IMG OID641565181 
ProductOmpA/MotB domain-containing protein 
Protein accessionYP_001686280 
Protein GI167648617 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.287057 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTC AACTCCTGGC AGGGGTGGCT CTGGCCGCGG TGTTCGCCGC TGGCGCCGCT 
TCGGCCCAAG AGACCGGCTG GTACGGCGCG GTCGACCTTG GCTATCACTG GCCCGAGGGC
ATCAACACCG AATCGAGTCA GAACCTGCCG GACGGCGCCC ACGCTCACTG GACGTGGTCG
ACCGACGACG ACTGGGCCGG CTTCGTTCGC CTGGGCTATC AGTTCACCCC GAACTGGCGC
GCCGAACTCG AGGGCGGCTA TCGCCCCGGT GATCTGGTGG GCGTCCGCGG CAACCCGGTC
CGTCAGCAGC CGATCGGCCT CTGCACCCCC GGCGTGACCC GCACCACGGC CGCTCCCAAG
TGCGGCAGCC CGTCGGGCTC GATCGACTCC TGGTCGCTGA TGGTCAACGT CCTGTACGAT
TTCGCGCCGG ACTCGTGGAT CAACCCCTAT CTGGGCGCCG GTGTCGGCAT CAATCGCCTG
GATGTCAGCG CCCTGGGTCA ATTCAGCGGC GTTCCCGGTC CGATTAACGC CGGCAACCCG
GCGATCCAGA ATCTGACCGT TGACGACAAC GACATGGCCG TCGCCTGGCA AGCCATCGCC
GGCGCTTCGA TCAAGGCGAC CGACAAGCTG AAGGTCGACG TCACCTACCG CTGGTTCGCC
ACCCAGGATC AAGCCTGGAA CGCGACCGGT TCGCACCTGC TTCAGACGGG CAACTTCGAG
GGCCAGTACA AGGATCAATC GCTGACCGTC GGTCTGCGGT ATTCCTTCGC TTCGCCGCCC
CCGCCCCCGC CGCCGCCTCC TCCGCCCCCG CCTCCCCCGC CGCCGCCTCC CCCGCCTCCC
CCGCCTCCCC CGCCGCCTCC GCCGGCGTAT GAGGCTCGCG AGTTCATCGT CTACTTCCCG
TTCGATCAGT ACGTGCTGAC GCCGGAAGCC CAGTCGGTGG TTTCGGAAGC CGCGAACTAC
GCGACCTCGG GTCACGCCAC CAAGCTGGTG GTTGTCGGTC ACACCGACAC CTCGGGTTCG
CCGAAGTACA ACGCCAAGCT CTCGGAGCGT CGTGCGAAGG CCGTGGCTGA CGCCCTGGTC
GGCGCCGGTG TCGCCGCTGA CGCCCTGGCC GTTGATTGGA AGGGCGAAAG CGCCCCCGCC
GTCGCCACCG GCGATGGTGT GAAGGAACCG CTGAACCGCC GTTCCACGAT CTCGATCAAC
TTCTAA
 
Protein sequence
MKLQLLAGVA LAAVFAAGAA SAQETGWYGA VDLGYHWPEG INTESSQNLP DGAHAHWTWS 
TDDDWAGFVR LGYQFTPNWR AELEGGYRPG DLVGVRGNPV RQQPIGLCTP GVTRTTAAPK
CGSPSGSIDS WSLMVNVLYD FAPDSWINPY LGAGVGINRL DVSALGQFSG VPGPINAGNP
AIQNLTVDDN DMAVAWQAIA GASIKATDKL KVDVTYRWFA TQDQAWNATG SHLLQTGNFE
GQYKDQSLTV GLRYSFASPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPAY EAREFIVYFP
FDQYVLTPEA QSVVSEAANY ATSGHATKLV VVGHTDTSGS PKYNAKLSER RAKAVADALV
GAGVAADALA VDWKGESAPA VATGDGVKEP LNRRSTISIN F