Gene Caul_2199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_2199 
Symbol 
ID5899654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp2392071 
End bp2393858 
Gene Length1788 bp 
Protein Length595 aa 
Translation table11 
GC content68% 
IMG OID641562691 
Productsurface antigen (D15) 
Protein accessionYP_001683825 
Protein GI167646162 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0729] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.230672 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGTGCGTA AGAGATTTGC CCTTGGCGTA GCCGCGGCGG CCCTATTGGC CGCCGGCCAC 
GGCAATGCAG CCGAGCCCGC GGCGAGCATC ACCGGTGTGG AGGATCGCGC GCTGCGCGAG
GCGATTCAGC GCGCCATGAC CGAATCCAAG GCTCCGCCTC GCAGCCGTTC GGAGGCCCGC
CGCCGAGCCC GGGACGCCGG AGACGACGCG GTCCAGGTGC TGCGGTCCGA AGGCTACTAC
GCCTACGTCG TCGAGCCGGA TGTCAGCGAA TCCGATCCCC CGCGCGCCAT TCTCAAGATC
ACCCCCGGTC CGGTCTTCGT GATCGCCGAC CCCCGGTTGG CTTGGAGCGG CGCTCCGCCG
GACGAAGGCG TCGTCCAGCG CGCTCAGGAC GTCATCCGCC TGACCCCGGG GGAACCGGGG
CGCTCGGTGG ACGTCCTGGC GGCCGAGGGA CGCGTCGTCT CCCAGGTCCA GCAGTTGGGC
TACGCCGACG CTATGGCCGA ACCGCGCGAG GTGATCGTCG ATCACGCCGA CCACACGCTG
CGCCCGACGT TCCGCATCGC GGCCGGTGAC CTCACCCGTG TCGATGGGAT CGAAGTGGTC
ACCCAAGGGC GGTCGAATCC CGCCTGGGTG CAGCATCTGG CGCCCTGGAA GTCGGGCGAT
ATTTACGAGC CGGACGACAT CGCCGAGTTG GAGCGCCGCC TGCGCGACAC CGGCGTCTAC
GACACGGTCT CAGTGTCGCT CGCGCCCAAG GAGAAGGTTC GCCCAGACGG GCTGCGACCC
GTGGTCGTGA CTCTCTCGGA TCGCAAAGCT CACACCTTGG AACTGGGCGC GGGCTATTCG
AGCACCGACG GCCCCGGGGT TGACGCCAAG TGGATCCGCT ACAACCGCCT GCACCGGGCC
GACACCACGA CCTTGACGGC GCGCCTGTCC AGGCTGGACA GCCGGCTCGA GGCCGAACTG
GCCCTTCCGC ACTGGCGGCG CAGCCAGCAG ACCCTGAAGC TCAACACCGC CATTTTCCGT
GACAACACCG ACGCCTATGT CGAGACCGGC GCCCATGTCG GCGCCGACCT GACCCGACGT
CTGCGTCCGA CCGTCTACCG GACCTACGGC GTCTCGCTGG ACGTGTCGCA GATCGATTCT
CCGATCACCA CCAACGGCGT GACCGTCGAT GAGCGGCAGA ACTTCGCGAC CTTCACGCTG
CTCGGCGCCC AGGCCTGGGA TCGGTCCGAC AACGTGCTGG ATCCAACGAA GGGCTGGCGC
CTGGAGGTGC GGGGCGAGCC GACCGCCATC ACCGGCGACC TGACCATGGC GTTCTTCAAG
GTCCAGGCCC AGAGCACGGC CTATCTGCCG TTCGGCAAGG GCGCGCGGAC CGTGCTGGCG
GGCCGCTTCA AGGCTGGACA GATCCTTGGC GGAACCATGC CCGAAGTCCC GGCCTCCAAT
CGCTTCTACG CGGGCGGCGG CGGTTCGGTG CGAGGCTATT CCTATCAGGC GATCGGACCG
CGGATCGGCG ACACCACGAC GCCGCAGGGC GGCCTGTCGC TGTTGGAGAC CTCGATCGAG
ATCCGCCACA AGTTCACCGA GAAATGGGGG GGCGTGGCCT TTATCGACGC CGGCGGCGTG
GGCGTCGACA AATGGCCGAA CGGCGATGAT TTCGGAGTGG GCGTGGGCGT CGGCGTCCGC
TACGACCTGG GTTTCGGACC GATCCGAGCC GACATCGCGG TGCCGATCAG CCGCCGAGAG
GGAGACCCGG CCTTCCAGAT CTACATCAGC ATCGGGCAGA GCTTTTGA
 
Protein sequence
MVRKRFALGV AAAALLAAGH GNAAEPAASI TGVEDRALRE AIQRAMTESK APPRSRSEAR 
RRARDAGDDA VQVLRSEGYY AYVVEPDVSE SDPPRAILKI TPGPVFVIAD PRLAWSGAPP
DEGVVQRAQD VIRLTPGEPG RSVDVLAAEG RVVSQVQQLG YADAMAEPRE VIVDHADHTL
RPTFRIAAGD LTRVDGIEVV TQGRSNPAWV QHLAPWKSGD IYEPDDIAEL ERRLRDTGVY
DTVSVSLAPK EKVRPDGLRP VVVTLSDRKA HTLELGAGYS STDGPGVDAK WIRYNRLHRA
DTTTLTARLS RLDSRLEAEL ALPHWRRSQQ TLKLNTAIFR DNTDAYVETG AHVGADLTRR
LRPTVYRTYG VSLDVSQIDS PITTNGVTVD ERQNFATFTL LGAQAWDRSD NVLDPTKGWR
LEVRGEPTAI TGDLTMAFFK VQAQSTAYLP FGKGARTVLA GRFKAGQILG GTMPEVPASN
RFYAGGGGSV RGYSYQAIGP RIGDTTTPQG GLSLLETSIE IRHKFTEKWG GVAFIDAGGV
GVDKWPNGDD FGVGVGVGVR YDLGFGPIRA DIAVPISRRE GDPAFQIYIS IGQSF