Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2804 |
Symbol | |
ID | 4071807 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3318764 |
End bp | 3321781 |
Gene Length | 3018 bp |
Protein Length | 1005 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637984822 |
Product | surface antigen (D15) |
Protein accession | YP_591879 |
Protein GI | 94969831 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4775] Outer membrane protein/protective antigen OMA87 |
TIGRFAM ID | [TIGR03303] outer membrane protein assembly complex, YaeT protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGGAC GTGTCGCTGG TGCAGTGCTG TTCCTTGCTT GCTTGTGTGC GACGAGAATG TGGGCACAGG GATCGGCTGA CCCCGGCACC CAGATCATGC AGTCGTATGA GGGCCAGAAT GTTTCCGTGG TAGAGATCGC AGGCCAGCCT GACATTGATA CCAGCAAATA TGAGTCCGTG CTGAAACAGC ATCAGGGACA GCCGTTCTCG ATGGAAAAGG TCGCTGCAAC AGTCGAGGCT TTGAAAAAGA CCGGCGACTT TAAGGACGTG ATTCTCGACC TGAGGCCGGA GACCTCAGGT GTGCACGTGA TGTTCATTGC GCAGCCGGCG TATTACATCG CTCTCTATGA CTTCCCCGGT GCGCTAAAGA ACTTTCCTTA TTCCCGGCTT ATCCAGGTGG CGAACTACCA GTCACAGGAA CCGTATTCAA AATTAGATAT TGAGACAGCG CAGAAATCGC TGGAGAAATT CTTCCACCAG GTTGGGTTCT TTGAGGCCAC GGTACAGCCC GAAATCCGCT TGAATAAAGA GCATGGAATC GTCAACGTTG ATTTCACTAC GACGCTGAAA CGTCACGCCA AATTTGGAGA AGTTAAAATC GAGGGAGCCA CGCTACAAGA CACTGCCTTT CTCCAAGGCA AAGTTCGCGG ATTCATGGCG CGCACGCGCG GGGCACAGAT CAGACCGGGC AAGCCGTATT CAAGCCGAAA GATCCAGCTT GCAACGAACT ATTTGCAAGG AGCGCTGGCG AAACAACAAC ATCTTGGGGC GGATGTGAAG TTTGTTACGG CAGAATATGA TCCGGCAACC AACGTCGCGA ATGTCGTCTT CCACGTGAAG GTCGGGCCAA AAGTTGAGGT GGACATTGTC GGAGCGCATT TGTGGCCGTG GACGAGGAAG AAGCTCATCC CGATCTATAT TGAGGGTTCT ATTGATGAGG ACCTGATCGA AGAGGGTGAA CGGAATCTGC ACTCGTATTT CCAATCAAAG GGCTTTTACG ACGCGGTCGT AAAAACGGAT GTGAAGCAGA ACGCGGAACT CACGACGATC AGTTACACCA TCACCAAGAA TGAAAAGCAT AAAGTGGAAC GCCTCGATGT TGAGGGAAAC AATTCGATCT CCTCAAAAGA CCTGCTGAAC AACTCCCAAG TGGAGAAGGC GCACTTCTAC AGCCATGGAA AATTCAGCGA GGATCTGGTG CGGAAGTCCT CCGCCAGCCT GCGCGCTGTG TATCTGAACG CCGGTTACAG CAAGGCGAAG GTCACGCCAC GGGTGACGCG CGATCGCGGC AACATCGTGG TGACGTACGT GGTGGAAGAG GGACCACGGG ATTACGTTGC CGAGCTGCAC ATTGTTGGTA ACGATACGGT GCCGATGGAG CAGCTTTCCC CCAAGGGACT ACAGCTTGGC GTGAACCGTC CATACTCGCC GCTCTTTGAG CAGCAGGACC GCAACAATAT CGCCGCGCAT TACCTGACGA ACGGCTATCT CACGGCGGGT GTGACTTCGA AAGCCGTACC GGTGAGCAAG TCCGATCCGC ATCATTTGAT CGTGACCTAC AAAATTCATG AGGGTCCGAA GGTCACGACG GCACGGATCA TTACGGTAGG AAAACAGCAG ACAAAGCAGG AGATCGTGGA CCGTGCTTCT GTCGTGAAGG TCGGCGTGCC GCTCAGCGAA GCAGACCTGC TTTCTTCCGA GAGCCGCTTG TATGCGATGG GAATCTTCGA CTGGGCGCAG GTGGACCCGA AACGCGGAAT CACGAGCCAG AACCAGGAAG AGGTACTGAT CAAAGTCCAC GAGACCAAGC GGAACACGAT TACGTACGGC TTCGGATTTG AGGTGATCAA CCGCGGCGGC AGCGTGCCGG GAGGAACGGT GAGCGTTCCG GGCATTCCGC CGGTTGGCCT GCCGCAGGGG TTCCGGACCA GCGAATCTAC GTTCTGGGGT CCGCGGGGAA CGTTTTCCTA TACGCGTCGC AACGTGCGCG GCCTCGCTGA GAGTTACACG TTGGGGGCGT TTGCCGGACG CCTGGACCAG CGAGTGTTCG GCAACTACAC GATTCCCTAC CTCCTCGCGT CGAGTTGGAG CGGAGCGTTC CAAGTCTCGG GCGAACACGA TGCGACAAAC CCAGTCTTCA GCGCCCTCAA TGGTGCAGCG GGGTTCCAGG TCCAACGCTA CCTCGACGCC AAGCAAACGA AGCAACTCTT CTTTCGATAC AAGTTTCAGT ACACCGACTT GAGCAACATC TTCCCGGGCT TCGAAATCCT GGTTCCGGAG GAAGACCGCC GGGTGCGGCT CTCGACACTA TCCACTTCAT TTGTGCGCGA CACGCGCGAC AACATACTGG ACGCGCACAA GGGCACATAT GGCACGGTAG ACCTCGGAAT TACACCGCAG GCGCTGGGAT CGAGCGAAAC GTTCGCGCGC TTCCTTGGTC AATTCGCATT TTACAAACAG ATTCCGCACG GAATTATCTG GGCCAACAGC TTCCGCTTGG GGATAGAGAC GCCGTTTGGC GGCAGTCATG TTCCGACGAG CGAACTATTC TTCAGCGGCG GCGGCAGCAC ACTGCGCGGC TTCCCGCTGA ACGGTGCTGG TCCGCAGCAG TACACGACGG TATGCGGCGA CCCGAACGAC ACGTCCACCT GCGGCCCGAT TACGGTGCCG ACGGGCGGTA AGCAGCTGAT CATCGTGAAT TCGGAACTGC GGATTCCGCT CAACCAGCTC TACAAGGGGC TGGGAATCGT GCCCTTTTAT GACGGCGGAA ACGTCTATAA GCACGTAGGG TTCAGTAGTT TTTCTACCAA CTGCAATGCG GCCGCTACGA CTAGCACCGG CAGCAATGGA CAAACCGTTA CGCTGGTGGA ACCCTCGTGT TTCACCAGTT CGATTGGATT GGGAGTGCGC TACAACACGC CGATTGGACC GGTACGACTG GATGTTGGCC ACAACTTGAA CCACATAACT GGAATCAAAT CGACCCAGGT ATTCATCACC TTGGGACAGG CATTCTGA
|
Protein sequence | MRGRVAGAVL FLACLCATRM WAQGSADPGT QIMQSYEGQN VSVVEIAGQP DIDTSKYESV LKQHQGQPFS MEKVAATVEA LKKTGDFKDV ILDLRPETSG VHVMFIAQPA YYIALYDFPG ALKNFPYSRL IQVANYQSQE PYSKLDIETA QKSLEKFFHQ VGFFEATVQP EIRLNKEHGI VNVDFTTTLK RHAKFGEVKI EGATLQDTAF LQGKVRGFMA RTRGAQIRPG KPYSSRKIQL ATNYLQGALA KQQHLGADVK FVTAEYDPAT NVANVVFHVK VGPKVEVDIV GAHLWPWTRK KLIPIYIEGS IDEDLIEEGE RNLHSYFQSK GFYDAVVKTD VKQNAELTTI SYTITKNEKH KVERLDVEGN NSISSKDLLN NSQVEKAHFY SHGKFSEDLV RKSSASLRAV YLNAGYSKAK VTPRVTRDRG NIVVTYVVEE GPRDYVAELH IVGNDTVPME QLSPKGLQLG VNRPYSPLFE QQDRNNIAAH YLTNGYLTAG VTSKAVPVSK SDPHHLIVTY KIHEGPKVTT ARIITVGKQQ TKQEIVDRAS VVKVGVPLSE ADLLSSESRL YAMGIFDWAQ VDPKRGITSQ NQEEVLIKVH ETKRNTITYG FGFEVINRGG SVPGGTVSVP GIPPVGLPQG FRTSESTFWG PRGTFSYTRR NVRGLAESYT LGAFAGRLDQ RVFGNYTIPY LLASSWSGAF QVSGEHDATN PVFSALNGAA GFQVQRYLDA KQTKQLFFRY KFQYTDLSNI FPGFEILVPE EDRRVRLSTL STSFVRDTRD NILDAHKGTY GTVDLGITPQ ALGSSETFAR FLGQFAFYKQ IPHGIIWANS FRLGIETPFG GSHVPTSELF FSGGGSTLRG FPLNGAGPQQ YTTVCGDPND TSTCGPITVP TGGKQLIIVN SELRIPLNQL YKGLGIVPFY DGGNVYKHVG FSSFSTNCNA AATTSTGSNG QTVTLVEPSC FTSSIGLGVR YNTPIGPVRL DVGHNLNHIT GIKSTQVFIT LGQAF
|
| |