Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Caul_2199 |
Symbol | |
ID | 5899654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Caulobacter sp. K31 |
Kingdom | Bacteria |
Replicon accession | NC_010338 |
Strand | + |
Start bp | 2392071 |
End bp | 2393858 |
Gene Length | 1788 bp |
Protein Length | 595 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641562691 |
Product | surface antigen (D15) |
Protein accession | YP_001683825 |
Protein GI | 167646162 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0729] Outer membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.230672 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGTGCGTA AGAGATTTGC CCTTGGCGTA GCCGCGGCGG CCCTATTGGC CGCCGGCCAC GGCAATGCAG CCGAGCCCGC GGCGAGCATC ACCGGTGTGG AGGATCGCGC GCTGCGCGAG GCGATTCAGC GCGCCATGAC CGAATCCAAG GCTCCGCCTC GCAGCCGTTC GGAGGCCCGC CGCCGAGCCC GGGACGCCGG AGACGACGCG GTCCAGGTGC TGCGGTCCGA AGGCTACTAC GCCTACGTCG TCGAGCCGGA TGTCAGCGAA TCCGATCCCC CGCGCGCCAT TCTCAAGATC ACCCCCGGTC CGGTCTTCGT GATCGCCGAC CCCCGGTTGG CTTGGAGCGG CGCTCCGCCG GACGAAGGCG TCGTCCAGCG CGCTCAGGAC GTCATCCGCC TGACCCCGGG GGAACCGGGG CGCTCGGTGG ACGTCCTGGC GGCCGAGGGA CGCGTCGTCT CCCAGGTCCA GCAGTTGGGC TACGCCGACG CTATGGCCGA ACCGCGCGAG GTGATCGTCG ATCACGCCGA CCACACGCTG CGCCCGACGT TCCGCATCGC GGCCGGTGAC CTCACCCGTG TCGATGGGAT CGAAGTGGTC ACCCAAGGGC GGTCGAATCC CGCCTGGGTG CAGCATCTGG CGCCCTGGAA GTCGGGCGAT ATTTACGAGC CGGACGACAT CGCCGAGTTG GAGCGCCGCC TGCGCGACAC CGGCGTCTAC GACACGGTCT CAGTGTCGCT CGCGCCCAAG GAGAAGGTTC GCCCAGACGG GCTGCGACCC GTGGTCGTGA CTCTCTCGGA TCGCAAAGCT CACACCTTGG AACTGGGCGC GGGCTATTCG AGCACCGACG GCCCCGGGGT TGACGCCAAG TGGATCCGCT ACAACCGCCT GCACCGGGCC GACACCACGA CCTTGACGGC GCGCCTGTCC AGGCTGGACA GCCGGCTCGA GGCCGAACTG GCCCTTCCGC ACTGGCGGCG CAGCCAGCAG ACCCTGAAGC TCAACACCGC CATTTTCCGT GACAACACCG ACGCCTATGT CGAGACCGGC GCCCATGTCG GCGCCGACCT GACCCGACGT CTGCGTCCGA CCGTCTACCG GACCTACGGC GTCTCGCTGG ACGTGTCGCA GATCGATTCT CCGATCACCA CCAACGGCGT GACCGTCGAT GAGCGGCAGA ACTTCGCGAC CTTCACGCTG CTCGGCGCCC AGGCCTGGGA TCGGTCCGAC AACGTGCTGG ATCCAACGAA GGGCTGGCGC CTGGAGGTGC GGGGCGAGCC GACCGCCATC ACCGGCGACC TGACCATGGC GTTCTTCAAG GTCCAGGCCC AGAGCACGGC CTATCTGCCG TTCGGCAAGG GCGCGCGGAC CGTGCTGGCG GGCCGCTTCA AGGCTGGACA GATCCTTGGC GGAACCATGC CCGAAGTCCC GGCCTCCAAT CGCTTCTACG CGGGCGGCGG CGGTTCGGTG CGAGGCTATT CCTATCAGGC GATCGGACCG CGGATCGGCG ACACCACGAC GCCGCAGGGC GGCCTGTCGC TGTTGGAGAC CTCGATCGAG ATCCGCCACA AGTTCACCGA GAAATGGGGG GGCGTGGCCT TTATCGACGC CGGCGGCGTG GGCGTCGACA AATGGCCGAA CGGCGATGAT TTCGGAGTGG GCGTGGGCGT CGGCGTCCGC TACGACCTGG GTTTCGGACC GATCCGAGCC GACATCGCGG TGCCGATCAG CCGCCGAGAG GGAGACCCGG CCTTCCAGAT CTACATCAGC ATCGGGCAGA GCTTTTGA
|
Protein sequence | MVRKRFALGV AAAALLAAGH GNAAEPAASI TGVEDRALRE AIQRAMTESK APPRSRSEAR RRARDAGDDA VQVLRSEGYY AYVVEPDVSE SDPPRAILKI TPGPVFVIAD PRLAWSGAPP DEGVVQRAQD VIRLTPGEPG RSVDVLAAEG RVVSQVQQLG YADAMAEPRE VIVDHADHTL RPTFRIAAGD LTRVDGIEVV TQGRSNPAWV QHLAPWKSGD IYEPDDIAEL ERRLRDTGVY DTVSVSLAPK EKVRPDGLRP VVVTLSDRKA HTLELGAGYS STDGPGVDAK WIRYNRLHRA DTTTLTARLS RLDSRLEAEL ALPHWRRSQQ TLKLNTAIFR DNTDAYVETG AHVGADLTRR LRPTVYRTYG VSLDVSQIDS PITTNGVTVD ERQNFATFTL LGAQAWDRSD NVLDPTKGWR LEVRGEPTAI TGDLTMAFFK VQAQSTAYLP FGKGARTVLA GRFKAGQILG GTMPEVPASN RFYAGGGGSV RGYSYQAIGP RIGDTTTPQG GLSLLETSIE IRHKFTEKWG GVAFIDAGGV GVDKWPNGDD FGVGVGVGVR YDLGFGPIRA DIAVPISRRE GDPAFQIYIS IGQSF
|
| |