Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3824 |
Symbol | |
ID | 4071108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4520419 |
End bp | 4521639 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637985847 |
Product | hypothetical protein |
Protein accession | YP_592898 |
Protein GI | 94970850 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.372888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.874565 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGATCG GGGAACTCAT CCGGTACGAG CCGTTTGGAG AGCGCTTCGA GGAGACGACT GCGAGGTTTT TGAAGGCACG TTTTGGTGGT GATTGGAAAG TGCGCTGGAG TCCCGGGCGC GTGGGAACGG TGCCAGGCGC GCAGCAATGG CTGGTAAATT ACGAGATCAA CTCGGTGTTC CATCCCACTG CGAGAGCGAA TGTGTTTGAT GTAGTACGCC GCGAGTTCTC GTCTAGCCCG GTACGGTGGA AGCGGCCGTT GCAGCGGATG TATTTTGGGG CATCGGTTTC GAAAGTGTTC GCGCCGGGGA TGGCGCACGC GCGAGTTGAT ATTTCGCCGG CGGTCCCCGA TCCGCAGAAA TGGCTGATTG TGCCGGGGAC GCACAAGGTG AGGTACATCG ACACCGAAGA ACGGCGTGTG TACTGCCATT TGAAGCACGG CTCTCGGATG GATCGTTTTG CGAAGGAAAT CGAGGCACGA AGGTCGGCCT CGGGCGCTGG AGTGGCGGTG CCGGGGATCG TTGGCGAGCT TGGTGAAGAG TGCGTGATCG AGGAGATGGT CGTCGGGACG CCACTGAATC GGCTGTCCGA CGCTAAACTG CAGCAAGATT GTGTACTGCA GGCGAAGAGT TCTATGCAGC CTCTGTACGA CGCGACGGTA TGCCAGGAAC AGCAATCGGA ATACGCGAAA CGACTCTCGG GGGAGATCGC CGCAGCGGTT GCCGGTACGA GGATTGTAGC ATCGCTTCGC GATACAATCC TGAGTGCGGT TGAAAACATT CAGGATTGCC TGCAAGACCC CTCGGTCGTA CAAACAGTCC AGAGTCACGG CGATTTCCAG CCAGCGAACA TTCTCTGGGA CGGCCAGCGG GTTTGGATCA TTGACTGGGA ATACTCGGGG CGACGTCAGC GTGACTACGA TGCGCTGGTT TACGCGTTGC AGTCGCGTTT TGCACGGGGA ATCGCTGCCA GGACGCGAGT GTACTTAAAG GGAATTGCTA CACGGGAGCG GGCAGAGGCT GTCGCTCGAA TTCGTTGTTT CTTGCTCGAA GAGTTTGCGT TTCGCTGTGA AGAACTCACC GCCGCGACGC ATGAGGTAAT TGCGCCGACA TTTCTCGAAC TGCTCGAAGA AGCAGAGGAG ATGCTCGTCG TTTTAAGCGA AGAGGAAAAT GCGGGGAAAT CCAAAGTATC TGTTTTTCAA ACAAACGGCG CCGTTTCGTA G
|
Protein sequence | MKIGELIRYE PFGERFEETT ARFLKARFGG DWKVRWSPGR VGTVPGAQQW LVNYEINSVF HPTARANVFD VVRREFSSSP VRWKRPLQRM YFGASVSKVF APGMAHARVD ISPAVPDPQK WLIVPGTHKV RYIDTEERRV YCHLKHGSRM DRFAKEIEAR RSASGAGVAV PGIVGELGEE CVIEEMVVGT PLNRLSDAKL QQDCVLQAKS SMQPLYDATV CQEQQSEYAK RLSGEIAAAV AGTRIVASLR DTILSAVENI QDCLQDPSVV QTVQSHGDFQ PANILWDGQR VWIIDWEYSG RRQRDYDALV YALQSRFARG IAARTRVYLK GIATRERAEA VARIRCFLLE EFAFRCEELT AATHEVIAPT FLELLEEAEE MLVVLSEEEN AGKSKVSVFQ TNGAVS
|
| |