Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2674 |
Symbol | |
ID | 4071928 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3151725 |
End bp | 3155123 |
Gene Length | 3399 bp |
Protein Length | 1132 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637984691 |
Product | hypothetical protein |
Protein accession | YP_591749 |
Protein GI | 94969701 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTAA AGTTCTGGGC CCTTGCTTTC CCGCTTCTTC TCTTACTAAC TTCATTTTCT GTCGCGCAAA ACGTCGTCAC TGGCGACGTC TCTGGTACCG TCACTGATCC GTCCGGCGCC GTAGTGCCCA ACTCCAAAGT CGAGCTAAAG AGTTCTGAAA CCGGCTTCGA CAAAGTTGTC ACCACCAGCA ATGCCGGAGA TTTCCGCTTC TCTCTCCTGA AGCCCGGTCC TTACACCATC ACCATCAGCA ACTCGTCATT CACCACGGTC ACGCGACATC TCACCGTGAA TCTCGGACAA ATCACCAACG CCTCGACTGC CCTCAGTGTC GGCGCCAGCA CCACCACGGT CGAAGTCAGT GGTGAAGCTC CTCTCCTGCA GACGGAAAAC GGCAACATCG CGACGACCTT CGACAGCCGT GCCGTTGCGC AGCTTCCCAA TCCCGGCGGC GACATCACCT ACTATGCGCA GACAGCCCCC GGTATCGCGA TGAACACCAG CGGTGGCGGC TACGGAAACT TCTCCGCCTT CGGTCTCCCC GCCACGTCGA ATCTCTTCAC CGAAAACGGT AACGACGAAA ACGATCCCTT CCTCAACCTC AACAATTCCG GCTCTTCGAA CCTTCTTCTC GGAAAGAATG AGATCCAGGA AGTCGCCGTC GTCACCAACG GCTACACCGG CCAGTACGGA CGCATGGCCG GAGCCAACGT GAACTACACC ACCAAGAGCG GCACCAACCA GTACCACGGC TCAGCCACGT ACGATTGGAA TGGCGGCGTG ATGAACGCCA ACGAGTGGTT CAACAAGGGC CAGGGCAATC CGCGGCCATT CGCCAACAGC AATCAGTGGG GCGCAGATTT CGGCGGTCCC ATCGTGAAGA ACAAAACGTT CTTCTACGTC GATACCGAAG GCATCCGCTA CATTCTGCCG TCCACGCAGA ATATCTATAT GCCGACGCCG GCTTTCGCCA CCTCCGTGCT TGCAAACATC GGCGCGAACT CGCCGAGTCA GCTTCCCTTC TATCAGCAGA TGATGGGTCT CTATCAAAGC TCCGCGCCTT ACGGTCAGAT GAAGCCCTTC AACGCTTCAC CCGCATCTGC CACCGATAAC ACTGGCGGTT GCGGCGACAT CTCCGCCTCC ACCGGCTTCG GCGCCGGCAA TCCTTGCGTC GGATACTTCC TCGCCTCCGG CAAGAACCTC AACAAAGAGT GGCTGCTCAC CGCGCGTGTG GACCAGAACA TCGGAACGAA AGATGTTCTC TTCGGCCGCT TCAAGATGGA CCGCGGCACT CAGCCCACTT CCACCGACAT CATCAACAAC GATCTCTTCG GATCGCACAG TGTGCAGCCT TCCTACGAAG GTCAGTTGAA CGAGACCCAC ACCTTCAGCC CGAACATGGT CAACAGCCTC ATCCTCAGCG GACAGTGGTA CTCCGCCATC TTCGTCCGCA ACAGCGGCGA ACCCGCTGGA CTCGCGGCGT TACCGTACAG CTCCGTGCTC TTCGGCGCCA ACCCGCTGTC CACGCTCGGT GGCACCAGCA CGCCCGATTA CTTTTTCCCG CAAGGTCGTA ACGTCACGCA GTACCAGATA ACTGACGATC TCTCCTACAC CAAGGGCAGG CACGAACTCA AAGTCGGCGC GAACGTCCGT CGTAACGACA TCTCGAATTA CGACGTGCAG ACACTGACCA GCGGCTTCCT GAATTTCGGC AGCATGGCCG ACTTCTATAA CGGCGCCGTC AACATCAACA ATGGCGATTA CTTCTACCAG GCGTTCTCGA ATGCCAATCG TGTTCCACTC GCCATCTACA GCCTCGGCGT CTACGCGCAA GACACCTGGA AAGTGAAGCC CAACCTCACG CTCACCCTCG CAATCCGCGC CGATCGCAAT TCCAACGCGG TCTGCCAAAC CGATTGCTAC GCCAACCTCG CCGCGCCCTT CGACAGCCTC TCGCACGACT CGACCATCCC TTACAACCAG GCCATCAACA CCGGCTTGCA TCGTCCGTTC TACGACATCG AAGCCGCTGC CATTCAACCG CGTTTCGGCT TCTCCTATAA CCCGAGCTTC GCGCACAACA CCGTGCTGCG CGGTGGTATC GGACTCTTCT CCGATCTCTA TCCCGGCGTG CTCCTCGACA ACATCATTCA GAACCCGCCG AACTACAACA GCTTCTTAAC CTACGGCATG GGCTCCGGCG TCACCGCAGC TCCGGGTTTG GCCACCAGCG CCTCTTCGCT CGCGGCGCAA TCCAACTCGT CGTTCCTCAC CGGTTTCGCC AACGGCGCAA CCTTGGCCGA CATCACCGCG ACGAACCGGT TCTTCACACC GCCCAATGTC TTCTCACCAG CCGGCACCAT TCACAACCCG AAATACATTG AGTGGAACTT CCAGATCCAG CAGGAGATCG GTGCGTCGAA CGTGCTGTCG CTCAACTACG TCGGCAATCA CGGATACGAC CTGCTCATCA ACAACCCGGG GTTGAACATC AACAACGCCG GACTCGCGCT CGCAAATGTT CCAGATGTTT CACCCGACAT TCGCTTCGGC ACCGTCACCG GCCTCGCCAG CGACGGCATC TCCAACTACA ACGGACTCGT CACCAGCTTC TCGCACCGCT TCTCGCGCGG ATTCCAGGCG CAAGTGAACT ACACCTGGAG CCACGCTCTC GACGAGCTCA CCTCGCTGCC GGCCACTCCG TATAACTACG GCGAAGCCGC CAGCATCACC ACGCAGTTGG ATCCCAATTG TCTGCGCTGC CTCAACTACG CCAGCTCCGA CGCTGACGTT CGCCACAACC TCACCTTCAA TTATGTTTGG GATCTTCCTT TCAAATTCGG GAACAGATTC GTCAACCAGG TCCTCGGCGG ATGGCAGTTT GCCCAGACCC TCTTCCTTCG CAGCGGCACG CCCTACTCTG TCATTGACAC AGCGGCGGCC AACCAACTCT CGGGATCGTT CAGCGGTGGC ACGTTCCTCG CGAGTTGGGC CAACGATCCC AACGGCCCCG GCGGCGTGAG CTTCGGCGAT TGCAGCACGC CCGGCACCGG CGCAACTCCA AATCAGTGCC TCAGCGCTGG TTCATTCCTT GGTCCCGGTG CGGAATCCTC GTTCGGCAAC GTTAGCCGCA ACGCCTTCCG CGGCCCCGGC TACTTCGATA GCGACTTCAA CGCCATGAAG AACTTCAACC TCACCGAACG CGTGAAGCTG CGCGTCGGCG CGAACTTCTT CAACATCTTC AACCACCCCA ACTTCCAAAA CCCGGTCAAC GACATCGCCT CAGGATCGTT CGGCCAGATC CTGGCCACGG TCTCCCCCGC CACCAGCCCC TACGGATCCT TCCAGGGCGC CGGCGTCTCA GGCCGCTTGA TCCAGTTGGA AGCCCACATC CAGTTCTAA
|
Protein sequence | MKLKFWALAF PLLLLLTSFS VAQNVVTGDV SGTVTDPSGA VVPNSKVELK SSETGFDKVV TTSNAGDFRF SLLKPGPYTI TISNSSFTTV TRHLTVNLGQ ITNASTALSV GASTTTVEVS GEAPLLQTEN GNIATTFDSR AVAQLPNPGG DITYYAQTAP GIAMNTSGGG YGNFSAFGLP ATSNLFTENG NDENDPFLNL NNSGSSNLLL GKNEIQEVAV VTNGYTGQYG RMAGANVNYT TKSGTNQYHG SATYDWNGGV MNANEWFNKG QGNPRPFANS NQWGADFGGP IVKNKTFFYV DTEGIRYILP STQNIYMPTP AFATSVLANI GANSPSQLPF YQQMMGLYQS SAPYGQMKPF NASPASATDN TGGCGDISAS TGFGAGNPCV GYFLASGKNL NKEWLLTARV DQNIGTKDVL FGRFKMDRGT QPTSTDIINN DLFGSHSVQP SYEGQLNETH TFSPNMVNSL ILSGQWYSAI FVRNSGEPAG LAALPYSSVL FGANPLSTLG GTSTPDYFFP QGRNVTQYQI TDDLSYTKGR HELKVGANVR RNDISNYDVQ TLTSGFLNFG SMADFYNGAV NINNGDYFYQ AFSNANRVPL AIYSLGVYAQ DTWKVKPNLT LTLAIRADRN SNAVCQTDCY ANLAAPFDSL SHDSTIPYNQ AINTGLHRPF YDIEAAAIQP RFGFSYNPSF AHNTVLRGGI GLFSDLYPGV LLDNIIQNPP NYNSFLTYGM GSGVTAAPGL ATSASSLAAQ SNSSFLTGFA NGATLADITA TNRFFTPPNV FSPAGTIHNP KYIEWNFQIQ QEIGASNVLS LNYVGNHGYD LLINNPGLNI NNAGLALANV PDVSPDIRFG TVTGLASDGI SNYNGLVTSF SHRFSRGFQA QVNYTWSHAL DELTSLPATP YNYGEAASIT TQLDPNCLRC LNYASSDADV RHNLTFNYVW DLPFKFGNRF VNQVLGGWQF AQTLFLRSGT PYSVIDTAAA NQLSGSFSGG TFLASWANDP NGPGGVSFGD CSTPGTGATP NQCLSAGSFL GPGAESSFGN VSRNAFRGPG YFDSDFNAMK NFNLTERVKL RVGANFFNIF NHPNFQNPVN DIASGSFGQI LATVSPATSP YGSFQGAGVS GRLIQLEAHI QF
|
| |