Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4269 |
Symbol | |
ID | 4071841 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5073537 |
End bp | 5075501 |
Gene Length | 1965 bp |
Protein Length | 654 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637986301 |
Product | hypothetical protein |
Protein accession | YP_593343 |
Protein GI | 94971295 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTCGGATT TCGAACCATG CTATTTTCGC GGTACCGGTG CACGAAAGAG GTCCATCGGC GTAGATGGAT TCGCATTCGA TGACGCCGAT GGGTCTGTGC GCCTCTTCAT CGCCGAGTTT GGCGGTGGTG AGGAGCCCGA GACCCTCACC CAAACCGACG CCAAGTCCCA TTTCGCTCGC CTTCAGGCTT TTTGTGAAGA GGCTGTATCT GGCAAGCTCC ACCGTGAAAT TGAAGAGAGC AACCCTGCTG CCGGACTCGC GCAATTACTA TTTAAAGAGA GAGGAGCTGT CACACGATTT CGCCTCTATC TGATTACCGA CGCCGAGATG AGTTCCCGTA TAAGGGATTG GCCTGAATCT GAAATCTCGA ACATCAAAGC TGAGTTTCAC ATATGGGACA TCGTCCGTTT TCAAAGAGCA TTCGAATCAC GCACCGGCAA GGATGAACTG GAGGTAGATT TGCAGGAGCT CGTTGAAGGT GGTGTTCCTT GCTTGGGCGC CAGCGTTGAT TCTGACGAAT ATCTTGCGTA TCTATGTGTT ATCCCAGGCG AGGCCCTCGC GAACATCTAC GACGAATACG GGAGTAGGTT ACTGGAAGGA AATGTGCGCT CATTCTTGAG CACGAAGGGC CGGGTCAACA AGGGAATTCG ACAGACAATC CTTACACGAC CTCACATGTT TTTCGCGTTC AACAATGGGA TTGCATGCAC CGCGTCTAAA GTGGATGTCG TCACCGGCCC ATCTGGGTTG CGGATCACTA AAGCGTCAGA CTTACAAATC GTGAATGGAG GTCAGACGAC CGCATCGCTG GCAGCGGCGA AACGAAACGA CAAAGCAGCG TTGGATCACG TTTTCGTCCA GATGAAGCTT TCAGTGGTGC CACCGGAGCG CTCGGGTCAG GTCATACCAG AGATTTCGCG ATGCGCTAAC AGTCAGAATC GAGTGAGTGA CGCGGACTTC TTCTCAAACC ATGAATTTCA TCGAAGAATT GAACAGATTT CTCGAAAATT ATGGGCGCCT GCTGTCGGAG GAGCACAGCA CGGAACTCAA TGGTTCTACG AACGCGCAAG AGGTCAATAC CTGAATGAAC AATCCGGCTT GTCATTGTCC GACCGCAAAC GCTTTGTTCT CCAGCATCCG CGCCATCAGG TGATCGCCAA GACGGACTTG GCGAAGTACG AAAATGCATG GCGGCAACTT CCGCATCTCG TCAGTCAGGG CGCGCAGAAG AACTTCCTCT CATTCAGTTC GTACGCTTCC GACGCCTGGG ACAAGAACGA GGTTCAGTTT AACGATGAGT ATTTCAAGAG GGTAGTTGCG AAGGCAATCT TGTTCCGTCG CACCGAACAA ATCGTGTCAA AACAACGTTG GTATCAGGGC GGATATCGCG CGAATATAGT CGCTTATTCT ATCTCTAAGT TGTCACGGAT GATCGAAGTC GAAGCACCGG GTCGCGCGCT AGATTTCCGA AGCATCTGGT TACGGCAAGC ATTAACGCCC GCGACCGAGA CTCAGATTGC AAAAATCGCG GAATCAGTTT TCGACGTGAT TGTAAATCCT GCTGGTGGAT TTCAAAACAT AACTGAATGG GGCAAGAAGG AGCTCTGCTG GAAGCGAGTT GCTGAACTCG AGATTCCGCT CGATTCGACG TTCTACAAAG AACTGGCCGA TCATGAATCT GATCTCCAGC AAAAAAAGGA CTCTGCAGTT GATCAGAAGA TCGAGATCGG GATTGAACAA CAGGCGTCGG TGTTACAGCT TGGCGCGTCG TATTGGAAGC AGATCCGGGA GTTCGGCGCG CGTGAAGGCT TGTTGAGCCC CGACGACATT TCCATTCTCG GGTTGGCGTG CCTAATTCCG AACAAGGTTC CGTCTGAAAA ACAGAGTTCA CGATTGTTAC AGATGAAAAC TCGCATGGAG TCGGAGGGTT TCCCCATTCG GACGATAGCG ACTGGAGCGT CCTAG
|
Protein sequence | MSDFEPCYFR GTGARKRSIG VDGFAFDDAD GSVRLFIAEF GGGEEPETLT QTDAKSHFAR LQAFCEEAVS GKLHREIEES NPAAGLAQLL FKERGAVTRF RLYLITDAEM SSRIRDWPES EISNIKAEFH IWDIVRFQRA FESRTGKDEL EVDLQELVEG GVPCLGASVD SDEYLAYLCV IPGEALANIY DEYGSRLLEG NVRSFLSTKG RVNKGIRQTI LTRPHMFFAF NNGIACTASK VDVVTGPSGL RITKASDLQI VNGGQTTASL AAAKRNDKAA LDHVFVQMKL SVVPPERSGQ VIPEISRCAN SQNRVSDADF FSNHEFHRRI EQISRKLWAP AVGGAQHGTQ WFYERARGQY LNEQSGLSLS DRKRFVLQHP RHQVIAKTDL AKYENAWRQL PHLVSQGAQK NFLSFSSYAS DAWDKNEVQF NDEYFKRVVA KAILFRRTEQ IVSKQRWYQG GYRANIVAYS ISKLSRMIEV EAPGRALDFR SIWLRQALTP ATETQIAKIA ESVFDVIVNP AGGFQNITEW GKKELCWKRV AELEIPLDST FYKELADHES DLQQKKDSAV DQKIEIGIEQ QASVLQLGAS YWKQIREFGA REGLLSPDDI SILGLACLIP NKVPSEKQSS RLLQMKTRME SEGFPIRTIA TGAS
|
| |