Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2867 |
Symbol | |
ID | 4070386 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 3410763 |
End bp | 3412760 |
Gene Length | 1998 bp |
Protein Length | 665 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984885 |
Product | hypothetical protein |
Protein accession | YP_591942 |
Protein GI | 94969894 |
COG category | [S] Function unknown |
COG ID | [COG5267] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTTTC AACGCGTAGT AACCGCAGCG GCCCTCATCT CGACGCTCAC CCTCGGCGCA ACGATTTTGT CCGCGTCCAA AAAGAAAACC ACAGCGCCTC AACTCGACGA CACCAAGCAG GTCGTGCACG CGCTCAACCG GTTAACCTTC GGTCCGCGAC CGGGCGATGT GGATCGGGTG AAAGCAATCG GTCTCAACAA GTGGATCGAT GAGCAACTGC ATCCTGACAA GATTGACGAC AGCGCGCTCC AGGCGCGGCT CTCGAACTTC CGCACGCTGA CGATGAACGC ACGCGAGATG GCCGAGAAGT TCCCGCCGAA CCAGGTGGTG AAGCAAGTGT CGGAAGGCAA GATGAGCGTG CCCCATAATC CCGACGAGAA GATCGTCTAT CTCGCGGCGC TCGATCGTTA CGACCTGAAG AAGGAAAACA AGGCAAACGG AACCAAGAAG AAGGCCGACG ATGCGGACGC CACCGACCAG GCCGATCTTC CCGATGATCA AATAAACGAT GCTGAGCGCC AACGCCGCCG CTCGGCGCGG ATCCACGGCG AGCAGATCGC CACCCAGATT GCGAGCGTCG CTCCCGACAA GCGCATTGAG GCTCTCCTGC GCCTTCCCGA TCAAGACCGC CGCGACCTGC TGCGCATCAA CGACGAAACG CGCCAGCAAC TCATCAGCGG CATGAACCCT GCGGATCGCG AAGCTGTGCT CGCCATGCGG AACCCGCAAG GCTTGGTGGA AGACGAACTC AAGGATTCAA AGCTGCTGCG CGCCATCTAC AGCGATCGCC AACTCGAAGA GGTAATGACC GACTTCTGGT TCAACCACTT CAACATCTTC CTCAACAAGG GACCGGATCG TTATTTCGTG ACGGAGTACG AGCGCGATGT GATTCGTCGC CATGCGCTCG GGAAATTCAA AGATCTTCTG AACGCCACCG CCCATAGCCC GGCGATGATG TTCTATCTCG ACAACGCCGA GAGCGTCGGC CCGAACTCTC CGGCTGCGCT GGGCATGCCC GACAGCCTTC GCCGCCCGAT GTACCGCGGC TATGGTCAAC CGCCGCAGCA GCATTCCGCC AAGAAAAAGC AGAACGGCTT GAACGAGAAC TACGCGCGCG AGTTGATGGA ACTGCACACG CTCGGCGTGA ATGGCGGCTA CTCGCAGAAA GATGTCACGG AAGTCGCGAA GGTCTTCACC GGCTGGACGA TTGAAGAACC GCGCAAAGGC GGCGGCTTCA AGTTCGCCGA GCGCCGTCAC GAGCCGGGCT CGAAGTATGT CCTCGGCCAG AAGATCGACC AGGGCGGCGA GCGTGAAGGC GAGCACGTTC TCGAGATGCT GGCTCGCGAT CCGCATACCG CGCACTTCGT CTGCAACAAG CTCGCAATGC GATTTGTCGC CGATGCGCCA CCGCAGGCAC TGGTGGACCG CATGGCCGAC ACCTTCCTGA AGAAAGATGG CGACATCCGC GAAGTGCTGC GAACGATGTT GCAATCGCAG GAATTCTGGG CGCCGGAGTC CTATCGCGCG AAAGTGAAAA CGCCGCTGGA ATTCGTAGTG AGTTCGGTGC GCGCAACCGG CGCTGAGGTC AGCGATGCCA AGCCGCTGGT GCAAACGCTC AACCAGATGG GCATGCCGCT CTATGGCATG CAGCCGCCGA CGGGCTACTC GATGAAGGCC GATACCTGGG TGAACTCCGC TGCCCTGCTG GCGCGCATGA ATTTCGCGCT CGGCCTCGGC ACCGGCAAGA TCAAGGGCTC GCAGGTTCCG CCGGAATTCC TGCACGGCAA CAACGCGCCC GATGCCATGG CCACTGAATC AACGCTCGAA CAGAACTTGC TGGCGGGCGA CATTTCGGAG CAAACGCGGA GCGTGATTCA CAAACAGCTC GACGATCCCA AAATCCAGGC GCAAGTCGCC GACGACAACA AACGCGCACA ACGCGAGGGA CTGCTGGCGG GCCTGATTCT CGGCTCGCCC GAGTTCCAGC GGAGGTAA
|
Protein sequence | MPFQRVVTAA ALISTLTLGA TILSASKKKT TAPQLDDTKQ VVHALNRLTF GPRPGDVDRV KAIGLNKWID EQLHPDKIDD SALQARLSNF RTLTMNAREM AEKFPPNQVV KQVSEGKMSV PHNPDEKIVY LAALDRYDLK KENKANGTKK KADDADATDQ ADLPDDQIND AERQRRRSAR IHGEQIATQI ASVAPDKRIE ALLRLPDQDR RDLLRINDET RQQLISGMNP ADREAVLAMR NPQGLVEDEL KDSKLLRAIY SDRQLEEVMT DFWFNHFNIF LNKGPDRYFV TEYERDVIRR HALGKFKDLL NATAHSPAMM FYLDNAESVG PNSPAALGMP DSLRRPMYRG YGQPPQQHSA KKKQNGLNEN YARELMELHT LGVNGGYSQK DVTEVAKVFT GWTIEEPRKG GGFKFAERRH EPGSKYVLGQ KIDQGGEREG EHVLEMLARD PHTAHFVCNK LAMRFVADAP PQALVDRMAD TFLKKDGDIR EVLRTMLQSQ EFWAPESYRA KVKTPLEFVV SSVRATGAEV SDAKPLVQTL NQMGMPLYGM QPPTGYSMKA DTWVNSAALL ARMNFALGLG TGKIKGSQVP PEFLHGNNAP DAMATESTLE QNLLAGDISE QTRSVIHKQL DDPKIQAQVA DDNKRAQREG LLAGLILGSP EFQRR
|
| |