Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0746 |
Symbol | |
ID | 4068622 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 919323 |
End bp | 920579 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637982752 |
Product | NHL repeat-containing protein |
Protein accession | YP_589825 |
Protein GI | 94967777 |
COG category | [S] Function unknown |
COG ID | [COG3391] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACGGCA TGAGCGACAA GTCGAAATCG AAATTAGCAA CAACCGCGAC GTGCATCCTC GCGCTGGTGC TCGGCTGCGC TGTGCCTGTC TTCGGTGGAA AAGATAAGAA GAAGGATGCT GCCGCGCCTG TGCAGGAAGA GTCGGTTCTG AAGAAATTGG ATTACTCGAA GATCGTGTGG CCGAACCCGC CTGCGATCAC GCGGCTCAAG TATGTGGATT TCTTCGCGGG CGAGAAGATC CAGACGCAGA TCGTTCAGGA AAAGAAGAAG TCGGAGTGGA TGGCACGACT GGCGGGCGGC GATTCCGAGG GCAACGGCAA GAACGGCCCG AAGCAGCGGT TTGCGTTGGC TACTCCGTAC GGCATGGCAG TGGATTCGAA GGGCCTGCTG TATGTCGCGG ACGGAAAAGT TGGAGCGATC TTCATCTTCA ACACCGAGAC CCACGATGTC GACATGATCA AGAACGGAGT GCAGGCGCAC TTCGGGCTGA TCACGGGATT GACGATTGAC GACGGCGACC GGCTCTTTGT TTCGGACTCG CAGCTGCATC GGGTACTGGT TTTCGGGCCG GATCGCAAAC AGGAAGCGGT AATCAGTGAG GGGCTGGTAG ATCCGGGCGG GATGGCGGTT GATAACGAGA ACCGGTTCCT TTATGTCGCG GATCCGGCGC TCGACCAAGT ATTGGTGTAC GACGCCGACA AGTTCAACTT GATCCGCAAG ATGGGGACTT CGGGAAAGAA CCACGCACTG ACGGAGCCAG GACAGTTTGC GCGGCCGACG AACGTAGCGG TGGACAGCGA CAGCAACTTG TATGTGACCG ATACCTCGAA CCGGCGAGTA GAGATTTTCG ACGCCGACGG ACAGTTCATT ACGGCATGGG GCAAGGCGGG CGATGGTCCG GGAACGTTCG CACGGCCGAA GGGGATCGCG ATTGATTCCG ACGGGCACGT GTGGGTAGCG GATGCCGCAC AGGACCGCGT GCAGTGCTTC AGCAAAGATG GAAAAGTTTT GTTGTACCTG GGAGGACACG GATTGTTGCC GGGGATGTTC GGCAATGTTG CCGGACTGAC GATCGACAAG AAGAACCGTG TGTACACCTC AGATCAGAAT CCGGGCCGGG TGCAGATGTT TCAGTACATC AGCAACCCGG AGGCGCGTGC CGAGTGGGAA CGCCGGCAAG CGTTGGAAAA GGGTAAGACT GGCGCGACAG CGACGGCTTC GCAAGCGCCG GCAAACAGTA ATAACAAGCC GAAGTAG
|
Protein sequence | MYGMSDKSKS KLATTATCIL ALVLGCAVPV FGGKDKKKDA AAPVQEESVL KKLDYSKIVW PNPPAITRLK YVDFFAGEKI QTQIVQEKKK SEWMARLAGG DSEGNGKNGP KQRFALATPY GMAVDSKGLL YVADGKVGAI FIFNTETHDV DMIKNGVQAH FGLITGLTID DGDRLFVSDS QLHRVLVFGP DRKQEAVISE GLVDPGGMAV DNENRFLYVA DPALDQVLVY DADKFNLIRK MGTSGKNHAL TEPGQFARPT NVAVDSDSNL YVTDTSNRRV EIFDADGQFI TAWGKAGDGP GTFARPKGIA IDSDGHVWVA DAAQDRVQCF SKDGKVLLYL GGHGLLPGMF GNVAGLTIDK KNRVYTSDQN PGRVQMFQYI SNPEARAEWE RRQALEKGKT GATATASQAP ANSNNKPK
|
| |