Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0248 |
Symbol | |
ID | 4073098 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 260801 |
End bp | 263581 |
Gene Length | 2781 bp |
Protein Length | 926 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637982249 |
Product | surface antigen (D15) |
Protein accession | YP_589327 |
Protein GI | 94967279 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG4775] Outer membrane protein/protective antigen OMA87 |
TIGRFAM ID | [TIGR03303] outer membrane protein assembly complex, YaeT protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCTAC TGTTTTGCGG GATCGCACAA GCGCAAGAGG GAGTCATCGT CGACATCCGG GTCCACGGGA ACCGTCGCAT TCCCGCCGAC ACAGTTAAGT CCCGTATGTT CACGCACGCC GGGGATGTAT ATGACCAGAG CTCCCTGGAA CGAGATTTCA ATGCTCTTTG GAACGCGGGG TATTTCGATG ATCTGCGGCT TGAACGGGAA CAGACGGACA AGGGCTGGAT CATTCACGTT TACGTCAAAG AGAAGCCGAC GATCCGTGAA ATCAAGTACG AAGGCCTGAA CTCCGTCACC CAGAGCGACG TCCTCGATAA ATTCAAAGAA CGCAAGGTTG GTCTTTCCCA GGAAAGTCAA TACGACCCGA CCCGTGTGAA GCGGGCGGAA GTGGTCTTGA AAGAACTGCT CGCGTCGCAC GGCCGCCAGT TTGCGACCAT CCGCACGGAA GTCCGGCCGA TTCCGCCAGC GGCAGTTTCG ATTACGTTCG TAGTGAAGGA AGGACCGAAG GTCAAGGTCG GCAAGATCAT CTTCGAGGGC AACGCACACG TGAAAGCGCG CGAATTGCGC GCGGCGATGA AGAACCTGAA GCCGATTGGC ATTCCGAAAT CGATCTTCCT GGAAAACCTG TTCGCCCGGA CCTTCGACTC GACCAAGCTC GAAGAAGATG CCGAGCGCGT CCGCTACGAC TACCAGACGC GCGGCTACTT CAAGGCGATC GTCGGCGATC CGAAGACCAA GATCCGCGAC GTGAGCGGCA TCAAGTGGTA CATGCCGTGG AAGAAAACGG ACGGCAAAGT GGTGGACATC ACCATGCCGA TCGAGGAAGG CGATCGCTAC AAGCTGAAGG AGATCACCTT CAGCGGCAAT AAGGCGATCA GCAACACCAG GGCGCTCCGC GAAATCTTCA AGATGAAGGA TGGCGACTGG TTCGATGCGG AACTGGTGCG CAAAGGCCTC GACGACTTGA AGAAGGCCTA CGGCGAGTTC GGCTACATCA ACGCCACTGC CGTGCCCGAT ACGCAATTCG ATGATGTCAA CAAGAGCATC ACGCTGAAGG TTGATCTCGA CGAAGGTAAA CAGTTCTCGG TGCGCCGTAT CGAGTTCGTC GGAAATACGA CGACACGCGA TAAGGTCATT CGCCGCGAGT TGGCGCTCGA AGAAGGCGGC ATCTACAACA GTCGGTTGTG GGAGATGAGC CTCCTGCGCC TGAACCAACT CCAGTATTTC GAGCCTTTGA AGGCGGAAAC CGACTCCGAA ACCAAGCAGA ACAACCAGGA CAACACCATC GACCTGACGT TGAAGGTGAG GGAGAAGGGC AAGAACTCCA TTGGTTTGAC GGGTGGCGTC AGCGGCCTGG CGGGATCGTT CATCGGCGTG AATTACACGA CGAACAACCT GCTCGGCAAA GGCGAGACGC TTCAGCTTGA GGCCAACGTC GGACAGTTTG AGCGCAACAT CCAGTTCGGC TTCACCGAGC CGTATGCGTT CGACCGTCCG CTGCAATTGG GCGCGGTGGT GTTCAGCAGC AAGTACGACT ACAACTACGC CAAGCAGCTG GCACTTTCCA CCGGCCAGCA GTTGAACCTG TCGCAGAGCG TGCAGGACAC CCTGCAGAAC TACTCGCAAT CCACGACGGG CTTCACGCTT TCGTCGAGCT ACCCGCTGCA CCGCTCGTTC AAGCGCGTAG GGCTCTCGTA CACGTTCAGC GATTCGTCGG TCCAGACCTT CTCGACGGCT TCGACGCAAT ACTTCCAATA CTTGGCATTC CGCAGCGTTA CCGGTCCGAA CGCGCTCGAA GGCATTCTCA CCAGCAAGGT GACGCCGAGC TTCACCTGGA ACCGCATTGA CAATCCGCAG CGTCCACACC GCGGTAGCAG CTTCTTCCTG GCGGCGGACA TTTCGGGCCT TGGCGGCAAC GTACAGATGA TTCGGCCGGT AACCGAGTAC AAGCGCTTCA TCCCGGTGAA CAAGGGACGC AACGTGTTCG GCTTCCGTGT ACAGGGATCG TTTGTGACGG GCTACGGCGG CAACGTGGCG CCACCGTTCG AACGCTTCTA CATGGGCGGT GAAAACGATC TGCGCGGCTT CGACATCCGC TCGGTATCGC CGACGGCGTT CCTTACCGAC TTCACCTCGA TTGCCTTGAC GAATCCAGAC GGCACGACAG TTCCAATCGA TCCGGCTCAT CCGAATAAGG GTGCATACAC GATTGCGATT CCGGTGCAGC GCATTATCTA TCCGGGTGGC GACACCAGCG TAGTCACCAA CCTCGAGTAT CGCGTACCGA TCGCCGGTCC GGTGACCATC GCAGCCTTCG TGGATACCGG CTGGGACATG GTGCTGCGCA ACGACCAGCT TCGCATTAGC GATCAGCAGT ACAGCACGTT GACCAACACG ATCTTTGGCT GCGTGTACAA CCCGCTGATC CCGCTAACCG TGGGCTGTAC CGGTGGTGGG GATGCGAGCC ACTTCCTGAC AGGGCTCAGC CAGAACATCA CGCCGATCGA CAAGACCAAC TACCAGACGC GTATGTCTAC GGGTTTGGAG TTGCAGGTCA TCATGCCGAT CGTGAATGCG CCGTTCCGAA TTTATTACGC GTATAACCCG TTCCGCCTGG ATACGACGAC AACGACACCG TCGCCGATCA CCCGTTCGAT GTTCCCGGAT GGCGCCGCGG GCGATTACAC GTACAAGCAG GCAATCTCGT TGTATAACCC AACTTACGTG CTGCGCGAGC CTTTGAAGAC GTTCCGCTTC ACGGTGGCTA CAACGTTCTA A
|
Protein sequence | MTLLFCGIAQ AQEGVIVDIR VHGNRRIPAD TVKSRMFTHA GDVYDQSSLE RDFNALWNAG YFDDLRLERE QTDKGWIIHV YVKEKPTIRE IKYEGLNSVT QSDVLDKFKE RKVGLSQESQ YDPTRVKRAE VVLKELLASH GRQFATIRTE VRPIPPAAVS ITFVVKEGPK VKVGKIIFEG NAHVKARELR AAMKNLKPIG IPKSIFLENL FARTFDSTKL EEDAERVRYD YQTRGYFKAI VGDPKTKIRD VSGIKWYMPW KKTDGKVVDI TMPIEEGDRY KLKEITFSGN KAISNTRALR EIFKMKDGDW FDAELVRKGL DDLKKAYGEF GYINATAVPD TQFDDVNKSI TLKVDLDEGK QFSVRRIEFV GNTTTRDKVI RRELALEEGG IYNSRLWEMS LLRLNQLQYF EPLKAETDSE TKQNNQDNTI DLTLKVREKG KNSIGLTGGV SGLAGSFIGV NYTTNNLLGK GETLQLEANV GQFERNIQFG FTEPYAFDRP LQLGAVVFSS KYDYNYAKQL ALSTGQQLNL SQSVQDTLQN YSQSTTGFTL SSSYPLHRSF KRVGLSYTFS DSSVQTFSTA STQYFQYLAF RSVTGPNALE GILTSKVTPS FTWNRIDNPQ RPHRGSSFFL AADISGLGGN VQMIRPVTEY KRFIPVNKGR NVFGFRVQGS FVTGYGGNVA PPFERFYMGG ENDLRGFDIR SVSPTAFLTD FTSIALTNPD GTTVPIDPAH PNKGAYTIAI PVQRIIYPGG DTSVVTNLEY RVPIAGPVTI AAFVDTGWDM VLRNDQLRIS DQQYSTLTNT IFGCVYNPLI PLTVGCTGGG DASHFLTGLS QNITPIDKTN YQTRMSTGLE LQVIMPIVNA PFRIYYAYNP FRLDTTTTTP SPITRSMFPD GAAGDYTYKQ AISLYNPTYV LREPLKTFRF TVATTF
|
| |