Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0178 |
Symbol | |
ID | 4073065 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 189110 |
End bp | 190375 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637982178 |
Product | hypothetical protein |
Protein accession | YP_589257 |
Protein GI | 94967209 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.000132954 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.337224 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCCACCC ATCGTTTGGT TCGCTTGCTT GTGCTCGCGT TGATGCTGAT GAGCATCCCG GCACTTTCTT TCGGAGGCGT GTTCGTTTCC GTTTCGGTTG GACCGCCGCC GATTCCCGTC TATACACAAC CGTTATGCCC GGGCGCTGGC TATATGTGGA CGCCTGGCTA TTGGGCTTGG GGTGACGAAG GCTATTACTG GGTTCCGGGT ACGTGGGTGA TGGCCCCGAC TCCCGGTTTC CTGTGGACGC CTGGCTATTG GGGCTGGGGC GGCGGCGCCT ACCTATGGCA CGGCGGATAC TGGGGCCCGC ACGTCGGCTT CTACGGCGGT ATTAACTACG GTTTCGGCTA CGGCGGCGTC GGCTACGGCG GTGGCTACTG GCACGGCAAC AATTTCTACT ACAACCGCAG CGTGAACAAC GTGAATGTCA CGAATGTGAC GAACGTCTAC AACAAGACCG TCATCGTGAA CAACAACAAC CACGTGAGCT ACAACGGCGG GCACGGCGGC GTGACCCGGC AGCCCACTTC ACAAGAGCGC CAGTGGCAGA ACGAGAAACA TGTTGATGCC ACGAGCGCGC AGCAACAGCA CTTCCAGGAA GCGGGACGCA ACCCGCAGCT TCTGGCGAAG AACAACGGCG GCAAGCCGGC GATCGCTGCG ACCGCACGCC CAGCGGACTT TAAGTCTGCG GTACCGGCAA AGGCAGTGGG TGGCCCAATC AACAAGACGG CTTTGACGGC GACTCCGAAG AACATGCCTG CCCCGAAGAG CAATGCGGCT GCGACGGCAA ATGGTAACGT CAACGCCAAT GCGAAGGGCA ACGCAAACAT TCCGAAGCCT GGCAATGCGA GTGCGAGCAC GAACTCCAAG GTCGGCACGA ATGCGTCCGC GAACACCACG GCGCACAATG TTCCGAAGCC ACCGGCGGCG AGCAGCAATA CACGAGACGT GAACACGGCG CACACGAACA CGACAGCATC GCCGAGCACG AGCACGCATA ATGTTCCGAA GCCGCCAAGC ACGAATGCGA CGTCGACGAA CCGTAGCAAC GCGTCGGTGA ATTCGCCGAA GACGTACTCC TCGCAGCCGA ATACGGCTTC ACACCAGAGC CAGCCGGCGC CTCATTACAG CGCTCCAGCG ACTCATAATC CACCGCCGCA AACGCACGCG GCTCCTCAGG TGCAACATAA TGCGGCGCCA GCACAGCACA GTGCTCCTCC GCAGCACAGC GCACCGGCGA CTCACGACAA CAAACCAAAG CGCTAA
|
Protein sequence | MSTHRLVRLL VLALMLMSIP ALSFGGVFVS VSVGPPPIPV YTQPLCPGAG YMWTPGYWAW GDEGYYWVPG TWVMAPTPGF LWTPGYWGWG GGAYLWHGGY WGPHVGFYGG INYGFGYGGV GYGGGYWHGN NFYYNRSVNN VNVTNVTNVY NKTVIVNNNN HVSYNGGHGG VTRQPTSQER QWQNEKHVDA TSAQQQHFQE AGRNPQLLAK NNGGKPAIAA TARPADFKSA VPAKAVGGPI NKTALTATPK NMPAPKSNAA ATANGNVNAN AKGNANIPKP GNASASTNSK VGTNASANTT AHNVPKPPAA SSNTRDVNTA HTNTTASPST STHNVPKPPS TNATSTNRSN ASVNSPKTYS SQPNTASHQS QPAPHYSAPA THNPPPQTHA APQVQHNAAP AQHSAPPQHS APATHDNKPK R
|
| |