Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_4639 |
Symbol | |
ID | 4070796 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 5498499 |
End bp | 5500109 |
Gene Length | 1611 bp |
Protein Length | 536 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 637986679 |
Product | hypothetical protein |
Protein accession | YP_593713 |
Protein GI | 94971665 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACTCT CCGCGACACT GCTCACGATG ATTGCATTGC TCTCATTGCC GCTGCTGCTC CGTGGCGGGA AACATGAAGA GCTGAAGCCC GCGTTCCAAA CGTCGGACCG CTGCATGGCC TGCCACAACG GATTGACCGA CACGCAGGGC AAAGATATTT CGATCGGCCT GAGTTGGCGT GCGAGCGTGA TGGGCAATTC GTCGCGCGAT CCGTACTGGC AAGCGAGCGT CCGTCGCGAG ACGATCGATC ACCCAGCGGT GAGCGCCGAG GTGCAGGACG AATGCTCGAT CTGCCACATG CCCATCGTCC GCTACGAAGC GGCGATGCAG GGCAAAAAGG CAGAGCCCTT CAAGTTCTTC CCGCTTGCGC AGAATGGAAC GAAGGAGTCG CGCGATGGCG TTTCGTGCGC GGTGTGCCAC CAGATTTCGT CGGAACGACT CGGTACGAAG GAGAGTTTTA CCGGACAATT CAAAGTCGAC GCGCCGAGCC AGAAAGACGT GCGCCCTGAG TTCGGGCCTT TCGCTGTCGA TCCTGGACAT CAGCGAATCA TGCAGTCCTC TACCGGCGGG TTCACGCCGA TGGCAGCATC GCATATCCGC GATTCGGCAC TGTGCGCGAC GTGTCACACG CTCTACACCA CGGCACGCGG CGAGGGCGGC AAAGCGATTG GTACTCTGCC GGAGCAGGTT CCGTTTCTCG AATGGCAGCA CAGTTCGTAT GTGAACGAAG CCACGTGCCA GAGCTGCCAC ATGCCCGAAG TGAAGGGCGC GGTGGGAATC ACTGCAGTTT TGCCGGTGAT GCGCGAGGGC ATGCATCAGC ACACCTTTAC GGGCGGGAAT TTCCTGCTGC CGCGCGCACT CGATAAGTAC CGCAACGAGC TCGACACCCG GGCGCTGCCG CAGGAGCTCC AAGCGGATTC CGAACGTACG ATCCAGTTCC TGCAAACGCA GGCGGCGAAG GTCGCGGTCA AGAACCTCGA TGTAAATGGC AGCACGATGC GGGTGGATGT CGAGGTCGAA AACAAGACCG GGCACAAGTT GCCAACGGCG TATCCCTCGC GGCGAGCGTG GCTGTGGGTG AAAGTGACCG ACCGCGACGG CCACACCGTC TTTGAGTCGG GCAAGCTGAA CTCGGATGGA TCGATCGCCG GCAATGACAA CGATGCCGAT CCTACAAAGT ACGAGCCCTA CTATCGGGAG ATCACGAGCG CCGACCAGGT GCAGATCTTC GAAGACATTC TCGGCGACGA GCGCGGGCAG GTAACCACTG GTCTGCTCAA GGGCGTGCGT TATCTCAAGG ACAGCCGCCT GCTACCGCAG GGATTCGACA AAGCGACGAC TTCGAAGGAC ATTGCGACAT ACGGCAATGC CACGGACGAT CCGAATTTCG CCGCCGGCGG CGCGCTGGTG CGCTACTCGG TGCCACTCGG AAACGCCGCG GGGCCGTTCC ATGTGCAGGC TGAGCTCTGG TATCAGCCAA TCGGTTTCCG CTGGGCGCAC AATCTTGAAC CATACCTGCA GGCAGACGAG CCAAAGAGGT TCGTGAATAT TTACAACGCG ATGTCGAATG TTTCTGCGCT GAAGTTGGCA GGCGCAGAAG CTACTCGCTA A
|
Protein sequence | MKLSATLLTM IALLSLPLLL RGGKHEELKP AFQTSDRCMA CHNGLTDTQG KDISIGLSWR ASVMGNSSRD PYWQASVRRE TIDHPAVSAE VQDECSICHM PIVRYEAAMQ GKKAEPFKFF PLAQNGTKES RDGVSCAVCH QISSERLGTK ESFTGQFKVD APSQKDVRPE FGPFAVDPGH QRIMQSSTGG FTPMAASHIR DSALCATCHT LYTTARGEGG KAIGTLPEQV PFLEWQHSSY VNEATCQSCH MPEVKGAVGI TAVLPVMREG MHQHTFTGGN FLLPRALDKY RNELDTRALP QELQADSERT IQFLQTQAAK VAVKNLDVNG STMRVDVEVE NKTGHKLPTA YPSRRAWLWV KVTDRDGHTV FESGKLNSDG SIAGNDNDAD PTKYEPYYRE ITSADQVQIF EDILGDERGQ VTTGLLKGVR YLKDSRLLPQ GFDKATTSKD IATYGNATDD PNFAAGGALV RYSVPLGNAA GPFHVQAELW YQPIGFRWAH NLEPYLQADE PKRFVNIYNA MSNVSALKLA GAEATR
|
| |