Gene Apar_0777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0777 
Symbol 
ID8413642 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp856576 
End bp858129 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content47% 
IMG OID645022359 
ProductRNA binding metal dependent phosphohydrolase 
Protein accessionYP_003179797 
Protein GI257784580 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000796077 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.068248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAATTAG CAGGAGTTGG CATTGTTTGC CTTCTTGTGG GAGTAGGCCT TGCATACGGC 
GTGCTCTCTA ACATCAGTAA TTCAAAGATT AAAACTGCAG AGCAGCAGTT AAAAGATGCT
CAGACTAATG CAGATCGTAT TGCTTCTGAA GCAACTCGCC AGGCAGAAAC AGTTAAAAAG
GAAGCTGTTC TTGAGGCTAA AGAAGAGGTT CTTCAGCTTA AGCAGGCAGC AGAGGCAGAC
GAGAAGAAGC GCAAGAGCGA GCTTCGCAGC ATGGAGAATC GTATTCTTCA GCGTGAAGAA
TCTCTTGATC ACCGTACCGA CGCATTAGAA AAGCGTGAGC ATCAACTTTC TAGTCTTCAA
GGTCAGCTTG ATCGTCGTAA GAACGATTTA GACTCTCTTG TGTCTCAGCA ATCTCAAGAG
CTTGAGCGCA TTGCAGCTCT TACAAAAGAC GAAGCTCATG ATGAGCTTCT AGCTCGTGTT
CGTTCAGAAA GCGTTCGTGA TGAGGCAATG ATTCTTCGTG AGTCTGAGCA GCGCGTTCGT
GCACAGGCAG ATAAAACTGC TCGTGAGATT ATTTCTACTG CCATTCAGCG TGTTGCTGCA
GATCAGGCTT CTGAGATTAC CGTTACTTCT GTTCATATTC CATCCGATGA TTTAAAGGGT
AGGATTATTG GACGCGAAGG CCGCAACATT CGTACTTTTG AGCAAGTATC TGGTGTTTCT
CTTGTTATTG ATGACACTCC AGAGACTGTT GTTCTTTCCA GTTTTGACCC CGTCCGTCGT
GAGACTGCTC GCGTTGCTCT TGAGAATCTT ATTGCAGATG GTCGTATTCA TCCTGCACGT
ATTGAGGAGC TCTATAAGAA AGCTGAAGCC CTCGTTAACG AGCGTGTTCT TGAGGCTGGT
GAGCAGGCCG CATTTGATTG TGGTATTCAT GATCTACATC CAGAGATTGT TAAGACGCTT
GGTAAGCTTC GTTACCGCAC TTCTTATGGT CAGAATGTTC TTGCCCACTC AGTTCAGGTT
GCAGTACTTT GTGGCATTAT GGCTGAAGAG CTTGGTCTTG AACCTGCTCC AGCAAAGCGG
GCCGGTCTTC TCCATGACCT TGGTAAGGCA ATCGATCACG AAGTCGAAGG TCCACACGCT
GTAATTGGTG CAGATCTTGC TCGTCGTTAT GGTGAGCGTC CAGAGATTGT TCATGCAATT
GAAGCTCACC ACGCAGATAT TGAGCCAAAC ACCGTTCTTG ACATGCTTGT CATGGCCGCA
GATGCAATTT CTGCTGCTCG TCCTGGTGCT CGTCGTGAGT CCGCTGAAAA CTACATTAAG
CGTCTAGAGA AGCTTGAGGC AATCTCTAAT GCTCATGAGG GCGTTGAGCG TACTTACGCA
ATGCAGGCTG GCCGTGAGCT TCATGTAATG GTTGAGCCTC AAATGATTAG TGATTCCGAG
GCAACTGTTC TTGCTCATGA TATTGCTAAG CAAATTGAGG ATGAGATGGA ATATCCAGGA
CAGGTTCGCG TTGTTGTTAT TCGTGAGTCT CGTGCCGTAG ATGTAGCTAA ATAA
 
Protein sequence
MELAGVGIVC LLVGVGLAYG VLSNISNSKI KTAEQQLKDA QTNADRIASE ATRQAETVKK 
EAVLEAKEEV LQLKQAAEAD EKKRKSELRS MENRILQREE SLDHRTDALE KREHQLSSLQ
GQLDRRKNDL DSLVSQQSQE LERIAALTKD EAHDELLARV RSESVRDEAM ILRESEQRVR
AQADKTAREI ISTAIQRVAA DQASEITVTS VHIPSDDLKG RIIGREGRNI RTFEQVSGVS
LVIDDTPETV VLSSFDPVRR ETARVALENL IADGRIHPAR IEELYKKAEA LVNERVLEAG
EQAAFDCGIH DLHPEIVKTL GKLRYRTSYG QNVLAHSVQV AVLCGIMAEE LGLEPAPAKR
AGLLHDLGKA IDHEVEGPHA VIGADLARRY GERPEIVHAI EAHHADIEPN TVLDMLVMAA
DAISAARPGA RRESAENYIK RLEKLEAISN AHEGVERTYA MQAGRELHVM VEPQMISDSE
ATVLAHDIAK QIEDEMEYPG QVRVVVIRES RAVDVAK