Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_0741 |
Symbol | |
ID | 4069083 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 912193 |
End bp | 914130 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637982747 |
Product | hypothetical protein |
Protein accession | YP_589820 |
Protein GI | 94967772 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01905] doubled CXXCH domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0951377 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCGCG CTTGGCTGCT TGCTCTCTTG GTCGGTGTCT TGTGCGTGAA TTCCCCAGCG CAGTGGACTA CCGACGTACT CGGCTCGCAT GATCTCTCGC CAGGCGGCAC TTCGCCGATC AAGGGCCGGC TGAATTCCGG ATGCCAGTAT TGCCATGCGC CGCACTCAGG AATCACGATG GGCTCGGCGC CCCTGTGGGC GCAGACACTC TCGAAGCAGA CCTACACCAC GTATACGAGT ACGACGCTCA AGAACCTGAC CACCCAACCA CCTCTCGGGG GCGATAGCAA TCTTTGCCTG AGCTGCCATG ACGGCACCGT CGCGCCGGGG CAGACGGTTC CTTATGGGCG CCTGCGGATG ACAGGCAACA TGCTGCCGCA GGACAAGTTC GGCAGCAATC TCGGGGGCTC GCATCCATTT AGTTTTCGAG CTTTGACTAC CGATTCGCCG GACCTGGTCA CGACCCTGAT CTCGAGCCAC AAGACCGCGG ATCCGCAAAA CGCGGTGAAG TTGATTAACA ACAACGTGGA ATGCACGAGC TGCCACAATC CGCACGTGCA GGCGATTGAC ACGGTGGCGC AGCAGTTCCT GGTACGCGAC GGTTCGAACG GAGCACTGTG CCTGGCGTGC CATGAGCCGG GTGCACGCCA GGTAAGCAAC CAGAACAACC CTCTGTCGCC GTGGACGACG AGTATCCACG CGAACACGAA CAATAAGCTT TCGCAGGGCG CGGGACTCGG CAGCTATACG ACGGTCGGCG CAAATTCATG CATATCGTGC CACGTTCCAC ATAGCGCACT GGGCGGAGCA GAATTGTTGC GGCAGCCGGC ATCGCCAGTG CCGAACATGG ATTCGGCGAC ACAGAATTGC ATTACCTGCC ACAACGGGGG ATCGAACATC TCACCGGCAA TTCCGAACGT GTATGCCGAA TTTGCAAAGA CGGGCCACCC GTATCCGGCC GGCAACAACA CCCATAGCGC CGGGGAGGCA ACGGTCCTCG AAAACAATCG TCATGCGACT TGCGTTGACT GTCACAACGC GCACGGATCG CAGCAGGTGA CGAGCTTTGA TGCTCCGCCG AAGATACGCA TTTCGCAAAC CAGCACCAAG GGCTTAGGCG TGGACGGCAC CACGCAAATC GATCCGGCGG TGAATCAGTA CGAGAACTGC TTGCGTTGCC ATGGGCCGAG TTCCGGGAAG ACGACGTTGA CGATCTTCGG GTACGCGCCC GCGTGGGCGG CAGAGAATCC CGGCGATTCG CTGAACGTGA TTTATGAATT CAACTCGTCC TCGACGTCGC GACATCCGGT GATGCTCGAT CGCAGCAGCG GGTATCCCCA ACCAAGCTTG CGCGCGTTCA TGGTGCAACT CGACGGGAAG ACCCAGGGGC GCTCCATGGG GCAGCGCATC TTCTGCACGG ATTGTCATAA CAGCGATGAC AATCGTGAGG GTGGTGGAAC CGGGCCAAAT GGTCCGCATG GCTCGACGTT CAGCCACATC CTTGAGCGCC GCTACGAATA CAGCCAGGTG GCTTCCGGCG CCGGTGCGGG TACGACGATC ACAAACCTGA TTCCGAATCC GCCGCTCGAT CCTTCCGCGA ATGGACCGTA TTCGATGTGC GCGAAGTGCC ACGACCTTAC GAACATCGTT TCGGATGCGA GTTTCTTGCC CGACAAAAAC GGTAAGGGAG GCCACGCGAC CCACATCAAC GACGGGTTCT CCTGTTCCAT CTGCCATACT TCGCATGGAA TGGGCGGAAC GGCGGCAGGC ATCTCCGGCG AGCGCATGGT GAACTTCGAC CTGAAGGTCG TCGCGCCGAA CAATGGCACG CTGGCGTACT CGCACAGCGC AAATACCTGC ACCCTGACCT GCCACGGCTA CGCGCACTAC TCTAACGGCT CTGTGACCCC GGCTCTCGCT AAACCGGGGG TGAAGTAA
|
Protein sequence | MKRAWLLALL VGVLCVNSPA QWTTDVLGSH DLSPGGTSPI KGRLNSGCQY CHAPHSGITM GSAPLWAQTL SKQTYTTYTS TTLKNLTTQP PLGGDSNLCL SCHDGTVAPG QTVPYGRLRM TGNMLPQDKF GSNLGGSHPF SFRALTTDSP DLVTTLISSH KTADPQNAVK LINNNVECTS CHNPHVQAID TVAQQFLVRD GSNGALCLAC HEPGARQVSN QNNPLSPWTT SIHANTNNKL SQGAGLGSYT TVGANSCISC HVPHSALGGA ELLRQPASPV PNMDSATQNC ITCHNGGSNI SPAIPNVYAE FAKTGHPYPA GNNTHSAGEA TVLENNRHAT CVDCHNAHGS QQVTSFDAPP KIRISQTSTK GLGVDGTTQI DPAVNQYENC LRCHGPSSGK TTLTIFGYAP AWAAENPGDS LNVIYEFNSS STSRHPVMLD RSSGYPQPSL RAFMVQLDGK TQGRSMGQRI FCTDCHNSDD NREGGGTGPN GPHGSTFSHI LERRYEYSQV ASGAGAGTTI TNLIPNPPLD PSANGPYSMC AKCHDLTNIV SDASFLPDKN GKGGHATHIN DGFSCSICHT SHGMGGTAAG ISGERMVNFD LKVVAPNNGT LAYSHSANTC TLTCHGYAHY SNGSVTPALA KPGVK
|
| |