Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2476 |
Symbol | |
ID | 4072100 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2930163 |
End bp | 2933171 |
Gene Length | 3009 bp |
Protein Length | 1002 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984493 |
Product | surface antigen (D15) |
Protein accession | YP_591551 |
Protein GI | 94969503 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4775] Outer membrane protein/protective antigen OMA87 |
TIGRFAM ID | [TIGR03303] outer membrane protein assembly complex, YaeT protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.738372 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGTGC GATTGCGTTT CTGGTGCAGA CACGGAAGCT GGCGACGAGG TGCGTTTGCG TGGCTACTGG CCATCGCGCT GGCTTGCGCC TGTTCTGCCC GCCTTGTCGC CCAGGACATC CCTGCTTCCA CCAACGGGCC AGCCGCAAAA CGTATCGCTG AAATCCGGTT CCGGGGCGCG AGCCTAGCCA GCACCGACCC TCTTACCGAG TACCTGACCG TGAAGGTGGG CGATCCCTTT ACCCGCGCCG CGGCGAGCGC AAGTATTAAG GCGCTTTTCG CTACCGGCCT CTTCTCCGAC ATCTCCGCTG AAACCGATCC CGCTCCCAAC GGCGACGTCG TTCTCACCTT CGCCTTGCAA TACCGCTATT TCATTGGTGA TGTGAACCTC AATGGCCGCC CGAAAGGTGC GCCAAGCCTG CGCCAGCTCC TCAACGCCAC TAAGCTCGAA CTCGGACACG CCCTCACCGA CACCGGGATT AAGCAGGCCA TCACGCAAAT GACCGGCGTC ATGGAAGACA ACGGCTACTA CGAGTCGAGT TTCACTTACA CGCTGAAGAA GCATGAAGAC AGCCGGCAGG CGGAAGTATT TTTCCATCTC GCTCCCGGCC CGCTCGCGCG TGTCGGCAAA ATCGAGGTCC ACGGCGAGTC GGGATTCACA CAGGAGGAAG TCGAGTCCAT CACCAAGATC AAGCCCGGCG CCAAGGTCAA GGCGTCCGAC GCCACCCGCG CCCTCGAGCG CCTCCGCAAG AAGTATCAGA AGCGCGACCT CCTCGAAGCC CAGGTCACGC TCGCGCGACA GTCCTACAAC CACGACAGCG ACACCGTGGA TTTCATCTTC ACGGTGCAAC GCGGGCCGGT GGTGAAGATT GACGTTGAAG GCGCGAAACT CAGCCGCGGT AAGATCAAGC GCTATGTTCC TGTGTACGAA GAGAACGCAG CCGACGACGA TCTCCTGAAC GAAGGAACCC GCAACCTCCG CGACTACTAT CAGTCCGAGG GTTATTTCGA CGTCAAGGTC AACTACTCGC GAAACCGCAC CCTCGATAAC CAGAAACTCG ACATCGTTTA CAACGTTGAC GCTGGTGAAC GCCACAGCCT CCAGTCAGTG GACGTCCAAG GCAACAAGTA CTTCCCGAAA GATACGATCC GGGAGCGCCT TAGCGTTCAG ACCGCGACCA TGCTGCTGAC GCACGGCAAG TTCAGCCAGG CCATGCTGGC GCGCGATGTT GCCGCCATCA CCGCCCTCTA TAAGACCAAC GGTTTTCAAG ATGTATCCGT AAAAGCCGAC GTCGAGGACA ACTATCGCGG TAAAAGTGGT GACCTTCGCA TCGTCTTCCG TATCGACGAA GGCGAACAGT CGCGCGTCCA CACTCTCACC GTAATCGGCA ATCTCGCGAT TCCAACCGCC GAGTTCCAGC CGCAGCTCTC ACTCGACGAA GGCCAGCCTT ATTCCGAGTA CGCCGTCGCA GCCGATCGCG ACGCGATCAT CAGTTACTAC TTCAATCGCG GCTTTCCCAA CATGGACATG AAGATCACCA CGCTCCCTTA CAGCGGCGAT CCTCACTCCA TGGACCTGAC TTATGAAATT CACGAGGGCA CCCGCGTCTT CGTAGACCGC GTTTATGTTT ACGGCTTGCA CTACACGCGT CCCGGAGTTG TTGCCAAGCG CATGCACGTG CATGACGGCG ACCCCCTCAG CCAGCTGGAT ATGCTCGATA CCCAACGTCG CCTCTACGAC CTGGGAATCT TCAGCGAAGC CAACGTCGCC ATCCAGGACC CCGACGGTAC CGCCCAGCGC AGGAACGTAA TCTTCCAGCT CGACGAGGCG CGCCGCTGGA CATTCAACTA CGGCGTCGGC TTCGAGTTCG CCACCGGCAG CAGCCAGGGA TCGTCCAACA CCCCGAACGG CACCACCGGA TGGAGTCCAC GTTTCTCGTT CGAACTGACG CGTCTCAACG TCTTTGGACG CGACCATACC TTCGTCATCA AAGCTCGCTA CGGAAAACTC GAACAGCGCG GGCTCGTCAG CTACACCGCG CCGCGCCTAT TCGCAAAGGA AAACTGGCGC CTCTCTCTGA CTGGTTTCTA TGACAAATCC GCCGACGTAC TGACCTTCAC CTCGGAGCGC GCCGAAGGCT CCATCCAGGC CGAGCAGGTC ATCAGTAAGA CATGGACGAT GCTTTACCGA TACAGCTATC GTCGCGTCAA CGTAGATCCC ACCACGCTGC AAATCGATCC CGCACTCATT CCCCTGTATT CGCAACCGAC GCGCATCGGC ATGCCGGGAG TCACCGTCAT CTACGATCGG CGCGACGACC CAATCGACGC CCACAAAGGC ATGTACACCA CCGCCGATAT CGGTATTGCT TCCACCAGGC TCGGTTCTGA AGAGGACTTC AGTCGCATCC TCGTACAGAA CTCCAGCTAC TACCAGTTCG GTGCGAAACA CTGGGTCTTC GCGCGGTCCT TGCGTATTGG TCTCGAGTCG CCATACCAGA ATTCCACTCT CGTCCCGCTG CCGGAACGCT TCTACGCAGG TGGCGGCAAC TCTCTCCGCG GGTACTCCAT TAACCAGGCC GGCCCGCGCG ATCAGTTCAC CGGATATCCC ATCGGAGGCA ACGCGCTTTT CGTGAACAGC CTTGAGTTGC GCATGCCACC ACCAACCTTG CCCTTCGTAG ACGACAACCT CAGCTTTGTC TTCTTTCACG ATATGGGCAA CGTCTTCGAC ACCGTCTCCC ACATGTGGAC TGGCCTCGGG CGGTTGCATC AGCCAACCAT TGCGGCCTGC TCGCAGAAGC CTGCCGACGG CTCAAACCCG CCGCCCTGCG ACTACGGTTA TCTTGCGCAA GCGGTCGGAC TCGGGATTCG CTACCATACG CCGGTTGGGC CCGTCCGATT CGACATCGGC TACGCCATCA ATCCGACGCG CTACCCGATC CTGAACGACA ACTCGACATC GTCCACCACG CGGGTCAACG TCTTCTTTAG CATCGGTCAA ACCTTCTGA
|
Protein sequence | MLVRLRFWCR HGSWRRGAFA WLLAIALACA CSARLVAQDI PASTNGPAAK RIAEIRFRGA SLASTDPLTE YLTVKVGDPF TRAAASASIK ALFATGLFSD ISAETDPAPN GDVVLTFALQ YRYFIGDVNL NGRPKGAPSL RQLLNATKLE LGHALTDTGI KQAITQMTGV MEDNGYYESS FTYTLKKHED SRQAEVFFHL APGPLARVGK IEVHGESGFT QEEVESITKI KPGAKVKASD ATRALERLRK KYQKRDLLEA QVTLARQSYN HDSDTVDFIF TVQRGPVVKI DVEGAKLSRG KIKRYVPVYE ENAADDDLLN EGTRNLRDYY QSEGYFDVKV NYSRNRTLDN QKLDIVYNVD AGERHSLQSV DVQGNKYFPK DTIRERLSVQ TATMLLTHGK FSQAMLARDV AAITALYKTN GFQDVSVKAD VEDNYRGKSG DLRIVFRIDE GEQSRVHTLT VIGNLAIPTA EFQPQLSLDE GQPYSEYAVA ADRDAIISYY FNRGFPNMDM KITTLPYSGD PHSMDLTYEI HEGTRVFVDR VYVYGLHYTR PGVVAKRMHV HDGDPLSQLD MLDTQRRLYD LGIFSEANVA IQDPDGTAQR RNVIFQLDEA RRWTFNYGVG FEFATGSSQG SSNTPNGTTG WSPRFSFELT RLNVFGRDHT FVIKARYGKL EQRGLVSYTA PRLFAKENWR LSLTGFYDKS ADVLTFTSER AEGSIQAEQV ISKTWTMLYR YSYRRVNVDP TTLQIDPALI PLYSQPTRIG MPGVTVIYDR RDDPIDAHKG MYTTADIGIA STRLGSEEDF SRILVQNSSY YQFGAKHWVF ARSLRIGLES PYQNSTLVPL PERFYAGGGN SLRGYSINQA GPRDQFTGYP IGGNALFVNS LELRMPPPTL PFVDDNLSFV FFHDMGNVFD TVSHMWTGLG RLHQPTIAAC SQKPADGSNP PPCDYGYLAQ AVGLGIRYHT PVGPVRFDIG YAINPTRYPI LNDNSTSSTT RVNVFFSIGQ TF
|
| |