Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2202 |
Symbol | |
ID | 4069092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2623653 |
End bp | 2624708 |
Gene Length | 1056 bp |
Protein Length | 351 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637984218 |
Product | glycosyl hydrolase |
Protein accession | YP_591277 |
Protein GI | 94969229 |
COG category | [R] General function prediction only |
COG ID | [COG4447] Uncharacterized protein related to plant photosystem II stability/assembly factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.120875 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.984431 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGACCTC GGAAAACAAC CAGCGCTTGC CTGCTGCTCC TCGCTTCGGT CGCTTCAGCC CAATGGCAGA TGCAGGAGTC GCATAGCAAC GCTGGTCTGC GCGGCATCCA CGCCGTGAAT TCGACGATCG CGTGGGCTAG CGGCACCAAC GGTACGGTCT TGCGCACCAC AGACGGCGGC ACGCACTGGC AGAAATGCGC CGTGCCACCC GACGCCGACA AGCTCGATTT CCGCGCGGTA TGGGCATGGG ATGACCAGAA CGCTTACGCA ATGTCAGCCG GGCCGGGTGA GTTATCGCGT GTCTACGCCA CGTCGGACGG CTGCACGCAT TGGTTTGAAA TCGGACGTAA CAAGGATCCC AAAGGCTTCT GGGATGCCAT GGTCTTCGCC GCTGACGGAA AGCTGCGGAA CGGGGTGGAC CGCACTGGCG TTCTGATGGG CGATCCAGTG GAAGGGAAAT TCTATGTCGC CAAACAGAAG TTCGGTCGCG GCTTTCGCAT GGCGGACGAC TTCTCCTGCT CGCCGAATCC CGACGAATCG GCGTTCGCGG CGAGCAATTC TTCGGTGGCC GTACTTCCCT TGCAAATCAT GGTTGGAACC GGTGGCAAGA GCGGCCCGAG AGTGCTGATT TCCGTGCCGA TGATGAGCAA AGATACGTGC ACCGCGTATC CGGTTCCGCT TGCGAGCGGC GCAGATTCTA CCGGAATCTT TTCGCTGATA TTCCGCACGG CACAGATTGG AATCGCAGTC GGTGGCGATT ACCAGAAGCC TGATGCGACC GCCGGTACCG CAGCCTGGAG CAACGACGGC GGCCATCATT GGACGGCCGC AACCACGCCA CCACACGGCT ATCGCTCCGC CGTCGCATGG GACGAGGCTA AAGGCGTCTG GATCGCCGCC GGCACCAACG GTTCCGACCA ATCGCGCGAC GATGGTAAGA CCTGGGAACC GCTGGATAAA GGGAACTGGA ACGCGTTGTC GCTTCCCTTC GCAGTCGGAC CTAACGGACG GATTGGCAAG TTCGCAGATG TGAAGCCAAC TACTTCGTCG CGATGA
|
Protein sequence | MRPRKTTSAC LLLLASVASA QWQMQESHSN AGLRGIHAVN STIAWASGTN GTVLRTTDGG THWQKCAVPP DADKLDFRAV WAWDDQNAYA MSAGPGELSR VYATSDGCTH WFEIGRNKDP KGFWDAMVFA ADGKLRNGVD RTGVLMGDPV EGKFYVAKQK FGRGFRMADD FSCSPNPDES AFAASNSSVA VLPLQIMVGT GGKSGPRVLI SVPMMSKDTC TAYPVPLASG ADSTGIFSLI FRTAQIGIAV GGDYQKPDAT AGTAAWSNDG GHHWTAATTP PHGYRSAVAW DEAKGVWIAA GTNGSDQSRD DGKTWEPLDK GNWNALSLPF AVGPNGRIGK FADVKPTTSS R
|
| |