Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_3561 |
Symbol | |
ID | 4069293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | - |
Start bp | 4209584 |
End bp | 4210783 |
Gene Length | 1200 bp |
Protein Length | 399 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637985584 |
Product | hypothetical protein |
Protein accession | YP_592636 |
Protein GI | 94970588 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR03436] VWFA-related Acidobacterial domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.239873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTTGC CCTATAAACA CCTTAAAAAA TGGCTTAGCT CGATCGCACA GGAGGCATGC GCCGCTTCTG CTACGGCAGC GGTGATAGCC GGGTTAGTCC TCCCGCCGCA GGTAGCGCTT GCCCAGTCGC AATCGCAAAG CGGTCAACAG CAGCAGCCAG AAGTTCCGGA AGCCGGTGGC CCTGCGGGCG ATACCGGCCC CATGGCGATT CCGAAAAAGA CGGAAAAGCC GGTGCCGCCA CCGCCGCCGC GGCCGAAGAC GCCAAGCTCC GACTCACCGA GCTATTCCAT CAGCGTGGAC GTGCCGCTGG TGAACATTGA CGTCTCGGCG AACACGAAAG ACGGTGGGTT CATCCCCGGG CTGAAGAAAG AGAATTTCCG CATCTACGAA GATGGCGTGG AACAGAAGAT CACCAACTTC GCGCAGACCG AGGCACCGAT CACGGCAGTA TTGCTGGTGG AATTCGCGAA CGGCAGCTAT GCGTTCATGA ACGACGCGCT GGTGGCGTCG TATAACTTCG CGGCCAACCT GAAGAAGGAC GACTGGGTGG CGGTAACCGA GTTCGATATG AGGACGCACA TCCTGGTGGA CTTTACCCAG GACAAGCGTC AGATTTACGG CGCCCTCAAT ACCCTGCGGA TTCCGGGCTT CAGCGAAGTG AACGTTTTCG ACGCACTCTA CGACACGCTG GACCGGTTGG ACCGCGTGGA AGGCCGCAAG GAAATCGTCC TGGTAAGTTC GGGACGCGAC ACGTTCAGCC GCATCAACCT GGACCAGATT TTGAAGAAAG TGAAAGCGAC TCCAAATGTG ACGATCTTCT CGATCAGCAC CGGCGCAGCC TTCTTGATCT GGGCCGAGGC GCACGGGATG GGATCGATGC GAGAACTCGA TTACCTGCAG GCTGACAACG AAATGAACGA ATACGCCAAG CTTACCGGCG GCCAGCACTA CAAACCGCGC TTCGAAGGCG AGTTCCCCGA CATCTTTAAG GACATCGCAG CACGGGTGCG AAACCAGTAC ACCATCTCGT ACCACCCGGT GAACCATAAG CAGGACGGTT CCTACCGCAA AGTGAAAGTG GAACTGGTGG CCGGCGACGG CAGCGGTAAG CCGCTCATCG TGAAAAACGA GAAGGGCAAA GAGTTGAAGA CAGTGCTGGC GTATCGCGAA GGGTATTCGG CGAAACACCA GGTGGAATAG
|
Protein sequence | MTLPYKHLKK WLSSIAQEAC AASATAAVIA GLVLPPQVAL AQSQSQSGQQ QQPEVPEAGG PAGDTGPMAI PKKTEKPVPP PPPRPKTPSS DSPSYSISVD VPLVNIDVSA NTKDGGFIPG LKKENFRIYE DGVEQKITNF AQTEAPITAV LLVEFANGSY AFMNDALVAS YNFAANLKKD DWVAVTEFDM RTHILVDFTQ DKRQIYGALN TLRIPGFSEV NVFDALYDTL DRLDRVEGRK EIVLVSSGRD TFSRINLDQI LKKVKATPNV TIFSISTGAA FLIWAEAHGM GSMRELDYLQ ADNEMNEYAK LTGGQHYKPR FEGEFPDIFK DIAARVRNQY TISYHPVNHK QDGSYRKVKV ELVAGDGSGK PLIVKNEKGK ELKTVLAYRE GYSAKHQVE
|
| |