Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_1092 |
Symbol | |
ID | 4069552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 1369356 |
End bp | 1370369 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637983101 |
Product | hypothetical protein |
Protein accession | YP_590169 |
Protein GI | 94968121 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.109622 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 1 |
Fosmid unclonability p-value | 0.000250734 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGTCCATAC TCTTCGGAGG CGGATCAGAA TTTCGTCGGA TTGAATATAA GAGCGAGGCT GAATTTGAGC GCTGCATAGT CGATATTCAG CACCGATTGT TCGGTCCCTC CCGGTTCTAC TTGGACATCA AGCGAAAAAT CGGGGTCAAA GGCGGAGTGC AGAACATTCC CGATGGCTAC CTTCTCGACC TCTCAGGTCC GACGCCTCGA TTGTATGTGG TAGAAAACGA GCTAAAAGCA CACGACCCGC TTCGCCACGT TGCAGTTCAG ATCCTCCAGT TCTCCATCTC GTTCGAAAGT GAGCCGCTAG CCGTCAAAAG AATTCTGCTT TCCGCGCTGA ACGAACAACC GTCGATTCGA GAAGCCTGTG AGAAGTACGC TCCCGCGCGG GGATACAGGA ACCTCGATCA TCTAATCGAG TACATGGTCG CGGAATGCCC TTTTGCGGCT CTCGTGATTA TTGACGAGAT GCCGGAGTCG CTCCAAAGCG TGCTTTCGCA AAGATTCCGG TTCGGTGTCG AAGTCTTGGA AGTCGCTTGT TATGAGGACA AGAATGGCGG ACGACTTTTT CTGTTCGAAC CTTTCCTGGC TGATGTAGTC GGCGATACTA CTGCAGATCA AAATGCAGAA GGGATGTCTG CGATCGATAC TTCTGAAATC GATACTCTCG TTGTTCCGGC GCGTGAGGAT GGATTTGAAG AAGTATTTCT ACGCGAAAAC CGCTGGTATG CAGTGCGGAT TCACGACACG ATGCGACCAC AGATCAAGTA TCTGGCGGCC TACCAAGTGC ATCCCGTTTC GGCGATCACC TTCATCGCGC CGGTACAGTC GATTGAGCCT TGGAAGGAGT CAGGCAAGTA TGTGCTAAAT TTCGCTGAGC CTGCACGGCC GGTCGGGCCG CTTGCTTTAG TTAAAGGTGG CCAGGTTCGT CCGCTTCAGG GGCCACGATA TGCCACACAC AAAGCGATTG TGGCAGCGAA GACACTCGAT GACGTATGGA AGAAGCAGCA GTAA
|
Protein sequence | MSILFGGGSE FRRIEYKSEA EFERCIVDIQ HRLFGPSRFY LDIKRKIGVK GGVQNIPDGY LLDLSGPTPR LYVVENELKA HDPLRHVAVQ ILQFSISFES EPLAVKRILL SALNEQPSIR EACEKYAPAR GYRNLDHLIE YMVAECPFAA LVIIDEMPES LQSVLSQRFR FGVEVLEVAC YEDKNGGRLF LFEPFLADVV GDTTADQNAE GMSAIDTSEI DTLVVPARED GFEEVFLREN RWYAVRIHDT MRPQIKYLAA YQVHPVSAIT FIAPVQSIEP WKESGKYVLN FAEPARPVGP LALVKGGQVR PLQGPRYATH KAIVAAKTLD DVWKKQQ
|
| |