Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0777 |
Symbol | |
ID | 8413642 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 856576 |
End bp | 858129 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645022359 |
Product | RNA binding metal dependent phosphohydrolase |
Protein accession | YP_003179797 |
Protein GI | 257784580 |
COG category | [R] General function prediction only |
COG ID | [COG1418] Predicted HD superfamily hydrolase |
TIGRFAM ID | [TIGR00277] uncharacterized domain HDIG [TIGR03319] conserved hypothetical protein YmdA/YtgF |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000796077 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.068248 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAATTAG CAGGAGTTGG CATTGTTTGC CTTCTTGTGG GAGTAGGCCT TGCATACGGC GTGCTCTCTA ACATCAGTAA TTCAAAGATT AAAACTGCAG AGCAGCAGTT AAAAGATGCT CAGACTAATG CAGATCGTAT TGCTTCTGAA GCAACTCGCC AGGCAGAAAC AGTTAAAAAG GAAGCTGTTC TTGAGGCTAA AGAAGAGGTT CTTCAGCTTA AGCAGGCAGC AGAGGCAGAC GAGAAGAAGC GCAAGAGCGA GCTTCGCAGC ATGGAGAATC GTATTCTTCA GCGTGAAGAA TCTCTTGATC ACCGTACCGA CGCATTAGAA AAGCGTGAGC ATCAACTTTC TAGTCTTCAA GGTCAGCTTG ATCGTCGTAA GAACGATTTA GACTCTCTTG TGTCTCAGCA ATCTCAAGAG CTTGAGCGCA TTGCAGCTCT TACAAAAGAC GAAGCTCATG ATGAGCTTCT AGCTCGTGTT CGTTCAGAAA GCGTTCGTGA TGAGGCAATG ATTCTTCGTG AGTCTGAGCA GCGCGTTCGT GCACAGGCAG ATAAAACTGC TCGTGAGATT ATTTCTACTG CCATTCAGCG TGTTGCTGCA GATCAGGCTT CTGAGATTAC CGTTACTTCT GTTCATATTC CATCCGATGA TTTAAAGGGT AGGATTATTG GACGCGAAGG CCGCAACATT CGTACTTTTG AGCAAGTATC TGGTGTTTCT CTTGTTATTG ATGACACTCC AGAGACTGTT GTTCTTTCCA GTTTTGACCC CGTCCGTCGT GAGACTGCTC GCGTTGCTCT TGAGAATCTT ATTGCAGATG GTCGTATTCA TCCTGCACGT ATTGAGGAGC TCTATAAGAA AGCTGAAGCC CTCGTTAACG AGCGTGTTCT TGAGGCTGGT GAGCAGGCCG CATTTGATTG TGGTATTCAT GATCTACATC CAGAGATTGT TAAGACGCTT GGTAAGCTTC GTTACCGCAC TTCTTATGGT CAGAATGTTC TTGCCCACTC AGTTCAGGTT GCAGTACTTT GTGGCATTAT GGCTGAAGAG CTTGGTCTTG AACCTGCTCC AGCAAAGCGG GCCGGTCTTC TCCATGACCT TGGTAAGGCA ATCGATCACG AAGTCGAAGG TCCACACGCT GTAATTGGTG CAGATCTTGC TCGTCGTTAT GGTGAGCGTC CAGAGATTGT TCATGCAATT GAAGCTCACC ACGCAGATAT TGAGCCAAAC ACCGTTCTTG ACATGCTTGT CATGGCCGCA GATGCAATTT CTGCTGCTCG TCCTGGTGCT CGTCGTGAGT CCGCTGAAAA CTACATTAAG CGTCTAGAGA AGCTTGAGGC AATCTCTAAT GCTCATGAGG GCGTTGAGCG TACTTACGCA ATGCAGGCTG GCCGTGAGCT TCATGTAATG GTTGAGCCTC AAATGATTAG TGATTCCGAG GCAACTGTTC TTGCTCATGA TATTGCTAAG CAAATTGAGG ATGAGATGGA ATATCCAGGA CAGGTTCGCG TTGTTGTTAT TCGTGAGTCT CGTGCCGTAG ATGTAGCTAA ATAA
|
Protein sequence | MELAGVGIVC LLVGVGLAYG VLSNISNSKI KTAEQQLKDA QTNADRIASE ATRQAETVKK EAVLEAKEEV LQLKQAAEAD EKKRKSELRS MENRILQREE SLDHRTDALE KREHQLSSLQ GQLDRRKNDL DSLVSQQSQE LERIAALTKD EAHDELLARV RSESVRDEAM ILRESEQRVR AQADKTAREI ISTAIQRVAA DQASEITVTS VHIPSDDLKG RIIGREGRNI RTFEQVSGVS LVIDDTPETV VLSSFDPVRR ETARVALENL IADGRIHPAR IEELYKKAEA LVNERVLEAG EQAAFDCGIH DLHPEIVKTL GKLRYRTSYG QNVLAHSVQV AVLCGIMAEE LGLEPAPAKR AGLLHDLGKA IDHEVEGPHA VIGADLARRY GERPEIVHAI EAHHADIEPN TVLDMLVMAA DAISAARPGA RRESAENYIK RLEKLEAISN AHEGVERTYA MQAGRELHVM VEPQMISDSE ATVLAHDIAK QIEDEMEYPG QVRVVVIRES RAVDVAK
|
| |