Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1668 |
Symbol | |
ID | 4069816 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 2017826 |
End bp | 2019271 |
Gene Length | 1446 bp |
Protein Length | 481 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637983676 |
Product | hypothetical protein |
Protein accession | YP_590743 |
Protein GI | 94968695 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCAAT ATCGATTCGC CCTGGTTGTC TTCGTGCTCT CAAGTTTTCT CTTCAATGCG CCGCACCTTT TTGGGTCTCG CCGTACCGCA TCCGACGTCA TTCATCTTTC GCCGGGTGAC GACATTCAGG CTGCGGTCAA CGCCAATCCG GCAGGAACGT CGTTCGTGCT GTCGGCCGGC GAGTACCGAT TGCAGAGCAT ACGCCCCAAG GACAACGACA GCTTCCAGGG AGAGGGACTT GTGGTCCTCA ATGGATCGCA GGTTCTTCCG GCGGAGCCCG CGACGACCTA TACCGGTCAG TCGCTCTGGA AGGTAAGAGC AACTCCGAAC CGTGTCCAAT ATGGACAATG CCAGTCCGTC GCGCCCTTAT GCGGCTATAC GCAGGACTTG TTCGTGCGCG GCGTGCTCAT GGCGCCCGCG CCCTATTTGA TCTTTCTCAC TCCCCGATCG TTTTATTTCG ACCGCAACAC CCACCAGATA TTTTTTGTGC GCGCGCAATC GTCCTCGGTG CATGCAGCCC TGTTCGTCGA ACTTGCGACC AGGAACTACG CCTTCTACGG AAGCGCCAAG GGAGTACGAA TTGCCAATCT TGTCGTCGAG AAGTATGCCA ACGAGGCGCA AAAAGGAGCG ATCGGCGGGG ACCGCAGCGG CAGCGGTTGG ACACTCGACA ATGTCGAAGT GCGATGGAAC CACGGTGCCG GCGTGGAACT CGGCCCGCGA TCAACCCTGC AGAACTCGAA GATCCATCAT AACGGCCAAC TCGGGGTAGC GATGAGCGGC CCGGACTGCC TGGTTCGGAA CAACGAGATC GCGTGGAACA ACTATGCGCA CTTCGACCCC AACTGGGAAG CCGGAGGCTC GAAGTTCTGG GCCACCTCGA ACCTGGAAGT GGACAGCAAT GACGTTCACG ACAATGAGGG CCCCGGGTTA TGGACGGACA CGGACAACAT CCACACGGTT TATGAGAACA ACCGCGTCAT CCACAATTCG GTCACTGGCA TCGTTCACGA GGTCAGTTAC GACGCCACGA TTCGTAACAA CCTTGTCAAA GAGAACGGTT ACGGCAAGAA CAACTGGATG TGGGGGGCGC AGATCACCAT TCAGAATTCC TCGAACGTCG AGGTCTATCT CAACCAGGTG GAAGTCGCGC CCGGCTACGG AAACGGGATT GCTGTCATCA ACCAGAACCG CGGTACCGGG GCCTACGGCC CGCGAGTAGC GTCGAACAAC ACGGTGCACT TCAACGTGGT GGTGTATCAC GGAAGCGCGG GAACCACGGG TTTCGCGGAC GATACCGGCA CTGCAGCCGC CACGCAAAAC GCATTCGACA ACAACATCTA CGTGACTCCG AACTGCGCGG GAGTTCATTG GCGCTGGCTC AGCGGCAAGT ATTGGACCGA TTTTCAAGGT CTCGGGCAGG AACTCAATGG CAGCTGTCAG AATTAG
|
Protein sequence | MRQYRFALVV FVLSSFLFNA PHLFGSRRTA SDVIHLSPGD DIQAAVNANP AGTSFVLSAG EYRLQSIRPK DNDSFQGEGL VVLNGSQVLP AEPATTYTGQ SLWKVRATPN RVQYGQCQSV APLCGYTQDL FVRGVLMAPA PYLIFLTPRS FYFDRNTHQI FFVRAQSSSV HAALFVELAT RNYAFYGSAK GVRIANLVVE KYANEAQKGA IGGDRSGSGW TLDNVEVRWN HGAGVELGPR STLQNSKIHH NGQLGVAMSG PDCLVRNNEI AWNNYAHFDP NWEAGGSKFW ATSNLEVDSN DVHDNEGPGL WTDTDNIHTV YENNRVIHNS VTGIVHEVSY DATIRNNLVK ENGYGKNNWM WGAQITIQNS SNVEVYLNQV EVAPGYGNGI AVINQNRGTG AYGPRVASNN TVHFNVVVYH GSAGTTGFAD DTGTAAATQN AFDNNIYVTP NCAGVHWRWL SGKYWTDFQG LGQELNGSCQ N
|
| |