Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3262 |
Symbol | |
ID | 4072674 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3864371 |
End bp | 3865471 |
Gene Length | 1101 bp |
Protein Length | 366 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 637985283 |
Product | hypothetical protein |
Protein accession | YP_592337 |
Protein GI | 94970289 |
COG category | [S] Function unknown |
COG ID | [COG4641] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.403362 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGATCT TCGTCTTCGG CTCCAGCCTG GTTTCGTCGT ACTGGAACGG TGCGGCGACC TACTACCGCG GCATTTACAA GAACCTCGCG GCGTTGGGTT GCGAAATCAT TTTTGCTGAG CCCGACATCT ACAAACGCCA GCAGAATCGC GACCCGGGCG ACTACAGCTA TGCGAAATCT CTGGTCTACC GCACGCCCGA CGACATTGAT TGGCTCCTGC GCCTCGCCAG CGACTCCGAC CTGGTCATCA AGCACAGTGG AATCGGCGCC GAAGATGCGC TGCTCGAGCG CCGCGTCCTC GACTGCCGCT CCCCACGCAC CCGCGTTGCC TTCTGGGACG TGGACGCCCC GGCCACGCTG TCTAGTGTCG AAGCCGAGCC GTTGAATCCC TTCCGCGCCT GCATTCCCGA GTACGATTTC GTCTTCACCT ACGGTGGCGG CCCGCCTATC GTTCAACGAT ATCTCCAACT CGGTGCGAAG AATTGCCACC CCGTTTATAA CGCACTCGAG CCCGAATCGC ACCACCCGGC CGACCCGAGT TCAGATTTCG CTTGCGATCT TGTTTTCGTC GGCAACCGGT TGCCAGATCG CGAACGGCGT GTCGAGCAGT TCTTCCTGCG CACCGCCGAA CTCGCGCCGG AATGCAAGTT CATCCTGGGC GGAGAGGGTT GGGGCAACAA GGAGCTACCG CCGAACGTCC GCTGGATCGG CCACGTCCGC ACTGGCGACC ACAATCGCGT GAATTGCTCC GCGCGCATGG TCCTCAACAT CAACCGCGAA TCCATGGCTG ACGTCGGGTT CTCCCCTCCG ACGCGTGTAT TCGAAGCAGC CGGCGCCGGC GCATGTCTCA TTACCGATCA CTGGAAGGGT ATCGAAACGT TCTTCGAGCC AGGCAACGAG ATCCTGGTCG CATCCAGCGC CGACGAGATC GTCCACCTTC TCCGAACGAC GCCACCGAAG AAGGCCCGCG CGATTGGACA AGCGATGCGT ACCCACGCCC TGCGTGACCA CCTATACGCC CAGCGTGCGG AACTCGCGAT GTCGATTTTC AGAAGTTCGG CTCCGCAGAG GGAACGGCAC GAACTCCGAC AAGCGGTGTA G
|
Protein sequence | MKIFVFGSSL VSSYWNGAAT YYRGIYKNLA ALGCEIIFAE PDIYKRQQNR DPGDYSYAKS LVYRTPDDID WLLRLASDSD LVIKHSGIGA EDALLERRVL DCRSPRTRVA FWDVDAPATL SSVEAEPLNP FRACIPEYDF VFTYGGGPPI VQRYLQLGAK NCHPVYNALE PESHHPADPS SDFACDLVFV GNRLPDRERR VEQFFLRTAE LAPECKFILG GEGWGNKELP PNVRWIGHVR TGDHNRVNCS ARMVLNINRE SMADVGFSPP TRVFEAAGAG ACLITDHWKG IETFFEPGNE ILVASSADEI VHLLRTTPPK KARAIGQAMR THALRDHLYA QRAELAMSIF RSSAPQRERH ELRQAV
|
| |