Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3271 |
Symbol | |
ID | 4072683 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3873671 |
End bp | 3875755 |
Gene Length | 2085 bp |
Protein Length | 694 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 637985292 |
Product | BNR repeat-containing glycosyl hydrolase |
Protein accession | YP_592346 |
Protein GI | 94970298 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00240388 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGTTAG CCACCGCGTC CATTCCAAGG GGAAGGGCGG TATCATTAAC CCAAATGTTC ATGCGCAAGC TGTTTTCGCT GCTGTTTGTC CTGTTGTTGG CTGTGACCCT CGCATCGGCT TCATCGTGGA AAGAACTTGG TCCAGATGGC GGCGACGCCC GCAGTCTCGC GTTCGACCCT CACAATCCCG ACCGCATTCT GCTGGGTACA AGTTCCGGCC AGCTCTACAT GTCCAATGAT CGCGGACACT CCTGGAATCG CTTCGCGCAG ATCGGCCTCG GCGACGACTA TGTCCTCGAC AACACCGCGT TCGATCCCAG CGATCCGAAC ACGATTTACA TCGGTGTGTG GAGTGTGGAG CACCAGCGTG ACGCCGGCGA TCTCTATGTC ACTCGCGACG GTGGCAAGTC ATGGAAGACC ATCGAGGGCA TGCGCGGTAA ATCGATTCGC GCTCTTTCCA TCGCGCCGAG CAATTCGAAG ACTCTGGTCA TCGGCGCACT CGATGGCGTC TATCGCAGCG ACGACAGTGG TGAAACCTGG CGGCGTATCT CCCCGGAAAA TCACGCGGAG ATCAAGAACA TTGAATCCAT CGCCATCGAT CCCAAAAATC CCGACGTCGT TTACGCTGGC ACCTGGCACC TGCCGTGGAA GACCGATGAC GGAGGCAAGT CATGGCACCA CATCAAAGAG GGCGTCATTG ACGACTCCGA CGTCTTCTCC ATCATCGTCG ACTTCTCGAA CCCTTCTACG GTATTCGCCA GCGCGTGCTC CGGTATCTAC AAGAGCGAAA GCGCAGGCAA TCTCTTCCAT AAAGTTACTG GCATTCCTGC CACGGCGCGC CGTACGCGCG TGCTGATGCA GGACCCGAAG AACCCGCAGA TTGTTTACGC CGGCACCACG GAAGGGCTTT ACAAGACCCT CGATGGCGGT AAGACGTTCA AGCGCATGAC CGGCCCGGAA GTCATCGTTA ACGATGTCTC CGTCGATCCG CGCGACACCA GCCGCGTGCT GCTGGCAACC GATCGCAGCG GTGTTCTCGC CAGCGAAAAC GGCGGCGCCA CGTTCACGCA GTCAAACCGC GGCTACTCGC ATCGCCAGGT CTCATCGCTG CTGGTGGATT CGAAAGATCC CAACACGATC TATGTCGGAC TGCTGAACGA TCGCGACTTC GGTGGCATGT ATGTCTCGCG CGACGCCGGA TCTACCTGGT CACAGGCGAG CAAGGGCCTG AAAGACCGCG ATGTCTTCAC TCTTCGCCAG GCAGGTGATG GCGACATCTT CGCCGGCACC AATCACGGCG TCATGAAATT CTCGAGCAAG ACGCTGTTGT GGGAGCCTGC CAGTGTCGTG GTGAAGGAAA AGACCACGCC TGGTCCTAAG ATTCCCGCGA AGGTCGTTAA AGGCAAAAAG ATTCCGGCAC ACGAAGGCAC GCCGAAGGTC ACCATCGAGA AATCGGAACT GACCTCGCAG GTGTCGCAAC TCGTGTTCAC TCCGGCGCTC TGGTACGCCG CCGCAAGCAG TGGCGTTTAC AGCAGCAAGG ATGATGGCAA AACCTGGCAA CACGCCGATA TCGAAGGTGA CGTACGTTTC CTTGCGATCG GCGCGTTCGG CGACAAGGCC TTCGCAGCAT CCGCACTTGA CGGCTACGTC ACAACCGATC ACGGCGGACA CTGGACGCAA GTGAGCGTGC CGAAGTTCAT CACCGGTATC TACGATGCTG CGGTTGGGTT CGATCAGTCG CTTTGGCTGG CGACGCAGCA GGGTGCATTG CGTAGCGGTG ACGACGGAAA GACATGGGAG CACGTTACGG CGGGACTCCC GTGGAAGCAC GTCCTCACCG TCAGCCTCGA TACCGCAAAT AACCGGATGC TCGCCACCTC TCGCGATGGT CGCGGCGTTT ACTCCAGCTC CGACAACGGA CAGACTTGGA AGTATTCAGA TGACGCTGGT CTGCTGGTTC GTAGCGCCGT CGGTTATCGT GGCGGCTACC TGGCAGCGAC GGCGTACAAC GGCGTGGCGA TTTCCTCGGC ACCGGGCCAC AGTGCGACGG CGCCTTCAGC GAGCGGTTCC GGTAACTCGA ATTAA
|
Protein sequence | MRLATASIPR GRAVSLTQMF MRKLFSLLFV LLLAVTLASA SSWKELGPDG GDARSLAFDP HNPDRILLGT SSGQLYMSND RGHSWNRFAQ IGLGDDYVLD NTAFDPSDPN TIYIGVWSVE HQRDAGDLYV TRDGGKSWKT IEGMRGKSIR ALSIAPSNSK TLVIGALDGV YRSDDSGETW RRISPENHAE IKNIESIAID PKNPDVVYAG TWHLPWKTDD GGKSWHHIKE GVIDDSDVFS IIVDFSNPST VFASACSGIY KSESAGNLFH KVTGIPATAR RTRVLMQDPK NPQIVYAGTT EGLYKTLDGG KTFKRMTGPE VIVNDVSVDP RDTSRVLLAT DRSGVLASEN GGATFTQSNR GYSHRQVSSL LVDSKDPNTI YVGLLNDRDF GGMYVSRDAG STWSQASKGL KDRDVFTLRQ AGDGDIFAGT NHGVMKFSSK TLLWEPASVV VKEKTTPGPK IPAKVVKGKK IPAHEGTPKV TIEKSELTSQ VSQLVFTPAL WYAAASSGVY SSKDDGKTWQ HADIEGDVRF LAIGAFGDKA FAASALDGYV TTDHGGHWTQ VSVPKFITGI YDAAVGFDQS LWLATQQGAL RSGDDGKTWE HVTAGLPWKH VLTVSLDTAN NRMLATSRDG RGVYSSSDNG QTWKYSDDAG LLVRSAVGYR GGYLAATAYN GVAISSAPGH SATAPSASGS GNSN
|
| |