Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2949 |
Symbol | |
ID | 4070873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3493710 |
End bp | 3495617 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637984968 |
Product | hypothetical protein |
Protein accession | YP_592024 |
Protein GI | 94969976 |
COG category | [S] Function unknown |
COG ID | [COG4805] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGGTCG CCCGAATACT CCTCTGCGTC TTCTTTGTAT CCTGCTGCTC ATTCCTCTTC GCGCAGTCCC ACAAAACCAC ACCCGAGGCG CCTGTGACCG GAACTCCGAG CGAACGGCTG GCCAAGCTCT CCGAACAGTT CCTTCATGAA TCACTTCAAC TTTCACCGGT ATCAGCTTCT GGGGCGGGAT ACCACACGTA TGTCGATCCG ACGAGCGGAC GCACGGTTCG CCTTGATGCC GAGCTCGATG AGATGGGTAC CGAAGACCTC GCGGAACAGT TGAAGTTCTA TCGCCACTGG CGCGAGCGGT TGCGGAGCCA GGCGCCGTAC AAAAGCCTCG ACGCTCAGGG CCATGCCGAC TGGATTCTTC TGGACGACGG AATCTCGAAC AACTTGCTCG AATTGGAGAA AGTCCAGAAC TACAAGCACA ATCCGACCGG CTGGGTGGAG TTGATCGGCA ACGGGTTGTT CCTCAACATG TCGCAGGAAT ATGCGCCAAA AGATCAGCGC ATGGCAGATG CAGTGTCGCG CATCGCGCAG ATTGCGCGCT TCATTGGGCA AGCTAAGCAA CAACTCATGG ACTCAGATCC GATCTACATC AAAGTTGCGG TGGAAGAAAA CAGCGGCAAT CTCGGGATGA TTGATGACAT CGGCAAAGAG CTCCCGGCGA GTGGAGCGGT GCGTCAGAAG TACGATCGCT TCGCACCGGC GGCAAAGAAG GCGCTGACGG ATTTCTCGCA GTGGATGCAG ACCGACTTGG CGAACCGTCC GACGAACGGC CGCAACTGGC GGTTGGGGAA GGAGTGGTAT GCCGAGCGTT TCCGCCTGGT GATGGAGACG AACGTTACTC CGGACGTGCT ACTCACCGAT GCCGAAACGG ATATGACGAG CGTCCGGGCG GAGATGCTGG AAATTGCGGT ACCGATGCAC AAGGACATGT ATCCAGACCA CACCGATCAC GCCGACCTGA GCGGAGTGGA TCGCGAAAAC AAAATTATTG GCGAGGTCCT GGACCGCCTG GGGCAGGAAC ATCCGCAGCG CGATCAGTTG ATGGACTACA TTCAGGGCGA TCTCCAGAAC ATTATTGATT TCATTCGCGA ACACAAGATC GTCGCACTGA GTGCGCGAAA CAATCTGAAG GTGGTTGCAA CTCCGGACTT CATGCGCGGC GTTTATTCGG TGGCAGGTTT CCACGCGCCG CCGCCGCTTG ATCCCAATAC CCAGGCGCAG TACTGGGTCA CCCCGATCGA TCCTAAGACG GCGGATGAAA AGGCCGAGTC GAAGCTGCGC GAGTACAACA ACTACACGCT GCACTGGCTG ACCATTCACG AAGCGCTTCC GGGACATTAC ATCCAATTCG AGCACGCGAA TAACGTGGAG CCTCCGATGC GCAGGTTATT GCGCGCGTAT TACGGCAACG GCCCGTACGT GGAAGGCTGG GCCGAGTACA TTGCGGGCAT CATGCTCGAC GCTGGGTTTG CTGACAACGA TCCGCGTTTC CGGCTGATCA TGAAGAAGAT TCGTCTGCGC GTGTTGGCTA ACACAATCCT GGACATCCGC ATGCACACAA TGGATATGAG CGACGACGAA GCCATGTCGC TCATGACCAA GCAGGCCTTT CAGACTGACG CAGAAGCTCA AGGAAAACTT CAACGTGCAA AGCTAACTGC AACGCAGCTT CCGACCTACT ACGTAGGCAT CCGCGGCTGG AACGATCTGC GGGCGAAGTA CAAGAAGGCG AAGGGAACGG CATTTACGAA TCTGGAATTT CACAACCGGG CGTTGGATCT CGGTCCAGTG CCTCTGCCGC TGGCAGGTGA GATTCTTCTG GGGATTCCGG CCAACTTGAG TGTGGGGCAG AGCACCAGCG CTGCCCCGGC ACACAAAAGG GCGACGCGCA AAAAGTAG
|
Protein sequence | MKVARILLCV FFVSCCSFLF AQSHKTTPEA PVTGTPSERL AKLSEQFLHE SLQLSPVSAS GAGYHTYVDP TSGRTVRLDA ELDEMGTEDL AEQLKFYRHW RERLRSQAPY KSLDAQGHAD WILLDDGISN NLLELEKVQN YKHNPTGWVE LIGNGLFLNM SQEYAPKDQR MADAVSRIAQ IARFIGQAKQ QLMDSDPIYI KVAVEENSGN LGMIDDIGKE LPASGAVRQK YDRFAPAAKK ALTDFSQWMQ TDLANRPTNG RNWRLGKEWY AERFRLVMET NVTPDVLLTD AETDMTSVRA EMLEIAVPMH KDMYPDHTDH ADLSGVDREN KIIGEVLDRL GQEHPQRDQL MDYIQGDLQN IIDFIREHKI VALSARNNLK VVATPDFMRG VYSVAGFHAP PPLDPNTQAQ YWVTPIDPKT ADEKAESKLR EYNNYTLHWL TIHEALPGHY IQFEHANNVE PPMRRLLRAY YGNGPYVEGW AEYIAGIMLD AGFADNDPRF RLIMKKIRLR VLANTILDIR MHTMDMSDDE AMSLMTKQAF QTDAEAQGKL QRAKLTATQL PTYYVGIRGW NDLRAKYKKA KGTAFTNLEF HNRALDLGPV PLPLAGEILL GIPANLSVGQ STSAAPAHKR ATRKK
|
| |