Gene Apar_0074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0074 
Symbol 
ID8412917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp83810 
End bp85162 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content52% 
IMG OID645021641 
ProductDEAD/DEAH box helicase domain protein 
Protein accessionYP_003179101 
Protein GI257783884 
COG category[J] Translation, ribosomal structure and biogenesis
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG0513] Superfamily II DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATAA CTTTTGCCGA GCTCGGGCTA AACGAGCAAA TTTTAGCGGG CGTTACAACG 
CTTGGGTTTA GCGTGCCCAC TCCCGTTCAA ACTGCAGCAA TTCCTGCTGT CCTGGCGGGT
AAGGATGTTG TTGCATCTGC TCAAACAGGA ACGGGCAAAA CTGCAGCGTT TATGTTGCCT
ACGCTGCAGC GTATTGCTGT CGAGAAACAC GACAAAGCCG AGAAACCTGA CGGCAAACGC
AATGCAGCCG CCGAGCGCAA CGCAGTCGCC GAGCGCAACG CCAAACGCGG CACCGGCAAA
CGCAACGCGT ATCCTCGTGC GCTCATCGTT ACACCGACGA GAGAACTTGC AGCCCAAATT
GACAACGTTG CCAAAAGCGT TTGCGCATCC ACTGGTCAGC AAGCCGTCAT TGTCACAGGT
GGCGCTCACT ACAAACACCA GATAGCCGCG CTGCAAAAAG GTTGCGACGT GCTGGTGGCA
ACACCCGGCC GTTTGATTGA TCTTCTCGAC AAGAAGCATA CAAGCCTAGA GGACATCCAG
GTGCTGGTAC TCGATGAAGC AGACCGTATG CTTGACATGG GCTTTTGGCC AAGCGTGCAC
CGCATTATGG AACAGCTTCC CAAGGCACAT CAAACGTTGC TCTTCTCGGC AACGCTCCCC
GCGTCAATTA CGTCAACCAT AGATGCGCTG CTCAAAGACC CAGAGCGTAT CGAGATTGCA
AGAACCGGAC AAACTGCAGC AACAATCGAG CAACATTTGT GCTCAGTTAC CCAGGGACAA
AAACCGCAGC TCTTGAAGGC ACTTATCGAC TCGTTTGATC CTGCGCCAGA GCGCGTTTTG
GTCTTTTGCA GGACAAAGTC GCGCGTTGAT AGCATTTATA AAAACCTCAA AGCTGCAGGT
CTGAAAGTTG ATGTTATGCA TGCGGACCGT CCGCAAAAAG CTCGCGCAAA AGCTTTAGAT
CGATTCCGCA GCGCCTCTAT TCAAATTCTT GTTGCAACCG ACGTCATGAG CCGCGGCATT
GATATCCAGG GCATTGACGT CGTCATTAAC TTTGACGTAC CTCTTGACCC CGAGGATTAC
GTTCACCGCA TTGGCCGAAC GGGCCGTGCC GGAGCCACAG GTCAGGCCTA TACGTTCATG
GGACCAGACG AGGTTACGCC GCTTAGAGAG ATTGAGTACT TCACAAAAGC GTTAGTTCCT
GCATGGGATC TACCTGGCTT TGGGTATGAA ACAGGACGTA TTATTTTGCA GGCGTCTCGT
TCTACTTCCA AAACTACTCG TTCCATGTTT TCTGGCTCAA GAGCACGCGG AAGAAACTTT
GGTTTTAGCG GAAGATATGG ACGCCACACA TAA
 
Protein sequence
MEITFAELGL NEQILAGVTT LGFSVPTPVQ TAAIPAVLAG KDVVASAQTG TGKTAAFMLP 
TLQRIAVEKH DKAEKPDGKR NAAAERNAVA ERNAKRGTGK RNAYPRALIV TPTRELAAQI
DNVAKSVCAS TGQQAVIVTG GAHYKHQIAA LQKGCDVLVA TPGRLIDLLD KKHTSLEDIQ
VLVLDEADRM LDMGFWPSVH RIMEQLPKAH QTLLFSATLP ASITSTIDAL LKDPERIEIA
RTGQTAATIE QHLCSVTQGQ KPQLLKALID SFDPAPERVL VFCRTKSRVD SIYKNLKAAG
LKVDVMHADR PQKARAKALD RFRSASIQIL VATDVMSRGI DIQGIDVVIN FDVPLDPEDY
VHRIGRTGRA GATGQAYTFM GPDEVTPLRE IEYFTKALVP AWDLPGFGYE TGRIILQASR
STSKTTRSMF SGSRARGRNF GFSGRYGRHT