Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2836 |
Symbol | |
ID | 4071839 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 3373576 |
End bp | 3374847 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637984854 |
Product | hypothetical protein |
Protein accession | YP_591911 |
Protein GI | 94969863 |
COG category | [S] Function unknown |
COG ID | [COG1262] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR03440] conserved hypothetical protein TIGR03440 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCGAAC GAATGTCGGC GATCCAGACA GGGCCAGACA GCAAGGCCCT GGCAAAACAG TTCTCGTCCG TGCGCTCTTT CAGCGAGCGA CTGGTCGCCC ATCTCGCGCC AGAGGACCTG ATGGTCCAGT CCATGCCGGA CGCAAGCCCG GCAAAGTGGC ACCTCGCCCA CACCACGTGG TTTTTCGAGA CTTTCCTGCT TGCTGAGTTC CAGCCCAGCT ACAAAGCCTA CGACCCGGCT TTTCGGGCGG TCTTTAATTC CTATTACAAA GGCGTCGGAA AGCATCCGGT GCGCGGGATG CGCGGCACAT TTTCGCGTCC CACGCTCGAT CGCGTGCTCG CGTATCGGGT CCACGTGAAC GCCGCAATGG AGCGGCTGAT CGATTCGGAT CTGCCGGAGA GCGCGAGAAC TCTAATCGTC CTCGGCCTCA ATCACGAACA GCAGCACCAG GAACTGATCG TCACCGACAT CAAGCACGCC TTCTGGACCC AGCCCTTGCA GCCCGCGTTC GTTGAATCGT CAGACGAAGA AAGTCGTTCT GCTCCTCCGC TCACCTGGTC GGCATTCGAC GGTGGGGAAG TCGAGATCGG GCATACCGGC TCCGGCTTCT CCTTCGACAA CGAAGAGCCT CGGCATCGCG TTCTTTTGCA GCCATACAAG TTGGCGAATC GGCTGGTCAC CAATCGCGAG TACTTAGCAT TTATGCAGGA TGGTGGTTAC CACCGGCCTG AGTTGTGGCT CTCCGATGGT TGGGACACCG TCAATGCGCA GGGATGGGAA GCGCCGTTTT ACTGGGATCG CGACGGACAG CAGTGGCGTG TCTTCACCGC CGCTGGAACG AAGCCGGTAA ATCTCGATGA GCCGGTTTGC CACGTCAGCT TCTACGAGGC CGATGCTTAC GCACACTGGG CCAATGCGCG GCTGCCGCTC GAAGCCGAGT GGGAACACGC TGCGGCATCT CAGCCGATAC GCGGCAACTT CGCTGAGTCC GGACGATTTC ATCCAACCGT TGCGCCATCC GCGGATGCGC CACAGTTCTA CGGCGACGTT TGGGAGTGGA CCGCCAGCCC ATATGTTGGC TATCCGGGAT TTCAGCCGGC AGCGGGCCTG GTCGGCGAGT ACAACGGCAA GTTCATGTGC AATCAGTTCG TGTTGCGTGG AGGCTCGTGC GCCACGCCGC AGTCTCACAT TCGGGCCAGC TATCGCAATT TCTTCCCGCC ACAGGCTCGG TGGCAATTCA TGGGAATCAG GTTAGCGGCC AATGCACGTT AG
|
Protein sequence | MSERMSAIQT GPDSKALAKQ FSSVRSFSER LVAHLAPEDL MVQSMPDASP AKWHLAHTTW FFETFLLAEF QPSYKAYDPA FRAVFNSYYK GVGKHPVRGM RGTFSRPTLD RVLAYRVHVN AAMERLIDSD LPESARTLIV LGLNHEQQHQ ELIVTDIKHA FWTQPLQPAF VESSDEESRS APPLTWSAFD GGEVEIGHTG SGFSFDNEEP RHRVLLQPYK LANRLVTNRE YLAFMQDGGY HRPELWLSDG WDTVNAQGWE APFYWDRDGQ QWRVFTAAGT KPVNLDEPVC HVSFYEADAY AHWANARLPL EAEWEHAAAS QPIRGNFAES GRFHPTVAPS ADAPQFYGDV WEWTASPYVG YPGFQPAAGL VGEYNGKFMC NQFVLRGGSC ATPQSHIRAS YRNFFPPQAR WQFMGIRLAA NAR
|
| |