Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Acid345_2165 |
Symbol | |
ID | 4073107 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Candidatus Koribacter versatilis Ellin345 |
Kingdom | Bacteria |
Replicon accession | NC_008009 |
Strand | + |
Start bp | 2583841 |
End bp | 2584950 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637984181 |
Product | metallophosphoesterase |
Protein accession | YP_591240 |
Protein GI | 94969192 |
COG category | [R] General function prediction only |
COG ID | [COG1408] Predicted phosphohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.440112 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCTCTTGT TCTTGATCGT TTTCCTGTTG CTGTGGGGAG GCATGCACGT ATACGTCGTG CAGCGCCTCC TTTCCATTCC CACGCTTGCG GCGCATCTCC CGCTCAAAAT CTTTCTTCCC ATCGTCACTT TCCTGGCGCT GAGCTACGTC GCCTCCAGAA TCATGGAGCA CTATGAGCTG GGTAGGTTCT CGCACGTTCT CGAGTACATC GGGGCGACCT GGGTCGGCAT CGTTTTTCTT CTGTTTGCGA TGTTCGTTTT CGCAGATGCG CTCTCGGGTT TCGGTCTCTT TTTCCGTGAG CAACAGGTCC TGATTCGCAC CGTCGCGCTT TGCACAGCCT GCGTTCTCAT TGCCATCTCC TACGTTCAGG CGTGGCGCAC TCCTGTCGTC ACCGAGCACG AGGTCGCGAT GCCGGGACTG CCGGCGTCGG CCGATGGCAC CGTGGTTGTG GTTGGTACAG ATCTCCATCT TGGCTCCATG CTGAATCACC GCTGGGCGAG CGCGCGTGCG GAACAGTTCA AGGCGCTCAA GCCCGATCTG ATCCTGCTGA TCGGCGACAT TTTCGAAGGC GAGAAAGAAA CACACGCGGG ATGGCTTCCG GTGCTTCAGA AATTTCGCGC TCCGCAGGGC GTCTACGCTG TTACCGGAAA CCACGAGTTC TATGCCGGAC CGGACGCAAT CATCGAACTG ATGGGCCGAG CCGGGTTTCG TGTCTTACGC GACGAGAACG TGGAGCTATT GCCGGGGTTG GTGATCGCTG GGGTGGATGA TCCTGCATTC CGCAAGCGCG GCAATCGCGA TCAATCGGTG GCGCTCGATC AGGCCTTCGC CGATCATCCC GGAGGCGCCA CGATTTTTCT CTCCCACACG CCGGTACTTG CCGAGAAAGC CGCGCAACTT GGCGCGGGGC TCATGCTCTC TGGCCATACT CACAAGGGTC AAATCTGGCC CTTTCAATAC ATCGTACGGC TGGCGTTCCG CCTAGTCTCC GGACGCTACG ACATAAGCGG TATGACCGCG ATTGTCTGCC GCGGTACTGG CACTTGGGGG CCGCGCATGC GCCTGTGGCA GCCCAGCGAG ATTTTACGTA TTACGCTGAA ATCTATGTAA
|
Protein sequence | MLLFLIVFLL LWGGMHVYVV QRLLSIPTLA AHLPLKIFLP IVTFLALSYV ASRIMEHYEL GRFSHVLEYI GATWVGIVFL LFAMFVFADA LSGFGLFFRE QQVLIRTVAL CTACVLIAIS YVQAWRTPVV TEHEVAMPGL PASADGTVVV VGTDLHLGSM LNHRWASARA EQFKALKPDL ILLIGDIFEG EKETHAGWLP VLQKFRAPQG VYAVTGNHEF YAGPDAIIEL MGRAGFRVLR DENVELLPGL VIAGVDDPAF RKRGNRDQSV ALDQAFADHP GGATIFLSHT PVLAEKAAQL GAGLMLSGHT HKGQIWPFQY IVRLAFRLVS GRYDISGMTA IVCRGTGTWG PRMRLWQPSE ILRITLKSM
|
| |