Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4711 |
Symbol | |
ID | 4070650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 5572049 |
End bp | 5575039 |
Gene Length | 2991 bp |
Protein Length | 996 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637986756 |
Product | hypothetical protein |
Protein accession | YP_593785 |
Protein GI | 94971737 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCAAAGAC TCTTTTCCTT CGCGGGCATA CTTTCTTTTC TACTTATCCT GGCAAATTGC GGTGGTGGAA GCGGCAACAC GCAACCTCCG GCGGGTGGCG GCGGTGGTGG CTCAACTCCG AGCTTCACAG TCTCCCTTTC CATTTCCACA GTGAACCTGA CACCGGGCGG AGCGACACAG GATGTCACTG TCTCGGTGAC GGGAAAGGGC GGCTACTCCG GATCGGTATC GGTTAGCGCT ACCGGTCTTT CTTCCGGGGT GACTGTTTCG CCGACATCGG TGTCTGTTCA GACGGGAAGT AGTGGGAAGC TGACCTTCTC GGCCTCAGAT TCAGCGACAG TCGGAGCGCA ATCGGCAAAC ATTGAGGCAG TCGCAGGATC CGTCAAAGTT TCAAGTCCCG TGCAGATCAA CGTGGCGAAG GCTGCACGTC CTGATCGATT CCACTCTGTA GGTGGAACTT TATGGCGCGG ATTTTACGAC GATACTCGAG GGTTACTCTT CGCAGCGAAT CCAGGGTTGA GTGAAGTCGA TGTGATATCC GGTTCTGATT TCACGATCAA AGCGAGAGTC TCGGTACCGC AGGCGTGGAG CGTTGACCAA ATGGCGGACG GCAAGACGCT CGTTATCGGC ACGGTTGCTC AAGAGTTCTT CACGCTGGAC GAAGATACTC TCAAGGCGAC GATTCATCTG CTGCCGACAT TGCCGCGTAT CTCCTATAGC TTGAATTGTC CATCCGTTGT GGCGATGGCC AATGGCATCG TCTTTCTGCT GACCCAGGAA ATGGGAATCG CGGGAGGTGG CGCAGATGGC GCCGCTCACT TAATAAAGTG GGATTCAAAG AAGAACACGT ATACGGAACT TGGCCCTCTG AACGGAGCTT CGAGTTGGTC CACGAAGAGC ATGGTTCGAA GTGCGGACCG CAAATGGGCG GCTTTCGCGG TGGACAAGTT CTATTTGTAT AGTTCGGACG ATGACAGTAT TACGTCAGTC GTGGATCTGG CGACGGTCAA CCCGCCCGCC GATTCGTTTG GTGTTCGGGG TTACGCGCTG AACGCTGACG GAAGCAAGAT TGCTGTCGCT TCCGCATCGC AAGTCACCTT CCTCGATCAC TCATTCAATG TACTCGCGAC AGTACCGTAT TCATCCGCTT TTCAGGATTC AGGCACCACC GTGCGATTCA CTGCGGACGG GAACCGCCTC ATCATGCAGA ATATTTTCCC GGTCTCGCTT GAGATGGTCG ATGCGAACTC CTATACGGCG CTCGGATACC AACCTGCTTT CGGAGACAGA GCAGATGTTT ACTCGACGAT AATCGCGATC GACGGCGTCG GACGGGCCTT TGTCGGATTC GATGGCGGAT ACGAAGTAAT CGATACAGCG CAGACACCGG TTCCAAATCC GACCGAAGCG GGCGCTACCT TGGACGGGCC GGAATGCCCC CTCCCGAACC CGCCCAATGC AGGCCTGAAC GCGAGCCTGA CATATTCCAC GTTTAACAGC AGTCTTTTTG CCGGCTACTC CTTCTACTTC GCTGGAGCGG CCGGAACCGT GTCCGCAGAC GGCACCCAAG TGACTGCGCC GGCGTCTTCC CAAGCAGGCC CGGTTGATGT GGAGTGCGTC GATTCGGCAG GGAGTTCCAG GACGCTGCCA TTTGCATTCT CCTACGGGGC AAAGGCTGCT GCGGTGAGCG CAAACCTCTT ATCTCCCGTG GGAGAACAAT CTTTGTATGC GTTCGGTTTT GGTTTTTTCT CCGATGCATC CTCCGTTCCG GCGGTGTCGG TAGGCGGGCT AGCTGCCGCG AACGTTGAGC AGGTTTCGCT TTCGAAGGGC AGCTTACAAG GAGTTCGTCT GCAACCACCA ACGCTCTCCG CTTCCACAAC CGCGGATGTC ACTGTGACAA GCGCGTACGG ATCGAGCACG GTCAAGGGCG CGGTTTCATA CATCCCATCC GCGCACGTGG TAGCGACGAG TGGAGTGTTG CAACTGCTCT TCGATTCGCA TCGCAACTTG CTCTATGCCC TAAAGGCCTC CGAGATCGAT GTCCTGGACC CGATCTCGCT GACGTGGAAC ACGCCGTTTG CGCTGCCGGC ATTCGCTGGT TCTGCGAACT ACGGATATAT AGCGCTCAGT CCGGATGGAA GCCGATTAGT GGCTGTCGCC AGCGCAGGCT ACGCCGCTGT CGTCAACCCT GACGACCCAT CGAAAACCTT CTCGGTCTCG ACGCCGAACC CTGGGTTCTC GTGGGGCAGG GTCGTGATAA CCAAGGAGAA CAAGGCCGTA TTCGGCGGAA GGCCGCCTGT TGAAATCGAT CTCGCAACAT CAACCGGGAA AGTTATTCCA ACTTATCTAG GGTGGCTGAT CGCATCACCG CCGGACGGAA GTGTGATTTA TGGCATTGAC ACCGGTGTCA CCACAGGGCA GGCGTACCGA ACTGACATCT CCACCTACAA AACGACGAGC ACGCCGCAGT TTGGCGCTCA ATTCTGGTCT GACTTGGCCG TCTCAGCGGA CGGATCACAT TTCGCTGGGA TCCTCGCGGA ATCAAATGGA GGCGACGTTA TAGGGTTCTT TGAGTCGGGA CTTCATCTCG TCAATTTCAA TGAAGGTCCG CTGCTCAGCC CTGCAGACGA TTCTCTCGTA TTGGGCTCAG TATTTGGCCC TAAAGGAAAC GTGCTGGTGG TCGCATTGGG AGACTCTATC GAATTCTGGG ATACGCAGAC CGGTACTCTG CGCGCCCGGC TTATGACGCC TGAAGAGTTG CAGACGGAAT CGGGTTCAGC CAGTTTCGCC GCACCGCAGG TCGCCCTAGA CTCGACGGGA CAGACTATCT TCGCAGTGTC CGCGAGTGGT ATCAGCGCAA TGACTCTTCC GGTCCCGGTC GATGACCTCC CGGTTGCGGC ATGGAACGGC CCGTTGCCGG CGCCTCAAGC TCCCGTGTCG GCGGTTTTGG GGCCTAGACA CGGGAGCTAC ACAGTTCGTC GCAGCCGGTA A
|
Protein sequence | MQRLFSFAGI LSFLLILANC GGGSGNTQPP AGGGGGGSTP SFTVSLSIST VNLTPGGATQ DVTVSVTGKG GYSGSVSVSA TGLSSGVTVS PTSVSVQTGS SGKLTFSASD SATVGAQSAN IEAVAGSVKV SSPVQINVAK AARPDRFHSV GGTLWRGFYD DTRGLLFAAN PGLSEVDVIS GSDFTIKARV SVPQAWSVDQ MADGKTLVIG TVAQEFFTLD EDTLKATIHL LPTLPRISYS LNCPSVVAMA NGIVFLLTQE MGIAGGGADG AAHLIKWDSK KNTYTELGPL NGASSWSTKS MVRSADRKWA AFAVDKFYLY SSDDDSITSV VDLATVNPPA DSFGVRGYAL NADGSKIAVA SASQVTFLDH SFNVLATVPY SSAFQDSGTT VRFTADGNRL IMQNIFPVSL EMVDANSYTA LGYQPAFGDR ADVYSTIIAI DGVGRAFVGF DGGYEVIDTA QTPVPNPTEA GATLDGPECP LPNPPNAGLN ASLTYSTFNS SLFAGYSFYF AGAAGTVSAD GTQVTAPASS QAGPVDVECV DSAGSSRTLP FAFSYGAKAA AVSANLLSPV GEQSLYAFGF GFFSDASSVP AVSVGGLAAA NVEQVSLSKG SLQGVRLQPP TLSASTTADV TVTSAYGSST VKGAVSYIPS AHVVATSGVL QLLFDSHRNL LYALKASEID VLDPISLTWN TPFALPAFAG SANYGYIALS PDGSRLVAVA SAGYAAVVNP DDPSKTFSVS TPNPGFSWGR VVITKENKAV FGGRPPVEID LATSTGKVIP TYLGWLIASP PDGSVIYGID TGVTTGQAYR TDISTYKTTS TPQFGAQFWS DLAVSADGSH FAGILAESNG GDVIGFFESG LHLVNFNEGP LLSPADDSLV LGSVFGPKGN VLVVALGDSI EFWDTQTGTL RARLMTPEEL QTESGSASFA APQVALDSTG QTIFAVSASG ISAMTLPVPV DDLPVAAWNG PLPAPQAPVS AVLGPRHGSY TVRRSR
|
| |