Gene Apar_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0785 
Symbol 
ID8413650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp863540 
End bp865045 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content47% 
IMG OID645022367 
Producttranscriptional regulator, XRE family 
Protein accessionYP_003179805 
Protein GI257784588 
COG category[S] Function unknown 
COG ID[COG1426] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0234796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.197603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACGCC CTCGTTTTAG TGAGATGCTC GTAGAACGTC GACGTCAACT CGGACTTTCT 
ATCGCTCAAG CATCAAAGAT TCTTCGTCTT AAGGAGCAGG CACTCATTGC TTTTGAGGAA
GGTGATTTCA AGAACATGCC TCAGAGTGGC TATGCTCAAG GCATGCTTTC TTCGTATGCT
CGTTATTTGG GATTAAATCC GCGTGAGATT GTAGATTTGT TTCAAGAAGA GAGCTACGAG
TTTGAAAATG GCACTTATTC TCATGAGTTG CGTCGTCGCA CCCGTGATAC CCAGTCTGGC
CGTGGAATTG CAGGATATGA CGTTATAAAT GAGTCTGAAA GTAGGCCAAA GGCATACGTA
CAGTATCACG GTCTTCTCCC TACATCGGGA GGACCAGCTG GCGATATGGG TGCATTTGCA
ACTACGTCCG GTGTTCGCTC TAGACAAACG GGTGTACCTT TGGGCGCTCA GGCATCTGAA
GAGGATGCCG GCGGCTACTC GTATTCTCGC TATGCCACCG GTCATGCCTA CAATGCAGGT
ATTGAAGAAG CCTCTCGTCA GAAGACTATG GCAAGTCGTA CGCGCGGTGG CGTTCAACGT
AGGCGCGTTG GTATTGCTGG CGTAAGTATG CCAGCTTATA AAGACGATGT AACTAGAAGA
ACTGTTGCTC CAAGTGATTA CACTGATGAC CTTCGCTATG ACAACGTAAC CGCTCCTTAT
GAGCGCGCTT CAACTATCTC TGGTCGTCGT GGTTCTAGAA ATATTGCACA GGTTGACAGG
CCTAACGTTC GCCGTCGTCA GTCTTCTAGC ACAGGTAATC AGCAGCGTCG TGCTCCTCAA
AGATCAGGCG TTATGGGTGC ACTTCAGAAC TACTTCTCTG ATACAGGTCG TACTGTTGCA
ACTGTGTTTG CTCTTGTTGT AATTATCATC ACCGCTGTTT TACTGTATAG CGTTCGCTCT
TGTATGACTG CTCGAACTAC GCCTACTACT ACAAAGACGG TAAGCGTTAA TAATTCAACA
AATGATTCTT CTAAGAATTC CACTACCACA AATAGCAATA CGGATACTTC TAAGAAGGAT
TCTACAGATG CTTCTAAGAG CACCACTACA ACCGAGTCTA AAAAAGAAGA GACTCCTAAA
CAGACTACCG TTGTAGTTAC TGTTGCAGAT GGTCAGACCA GCTGGGTTGA GATTACTGTT
GACGGCAAGA GCGTCGAGGC TGATGCAATT ACTGGTCCAT GGTCTCAGAC GTATACCGTT
ACGGATTCCA TGAGCGTTCT TGCAGGCACT CCTGGTGCTG TTGCTGTTAC CGTTAACGGT
ACTGCTAAAC CATTTGATGC AAATGCTTCT GGTATTGGTA CCTTGACCAT TCAGGGTACT
AAGTCTGCTG ATGCTAATGG AACTAACGCA TCTGGAGCTC AAACAAACAG CACGACATCA
AGTACTACCA GTGGAACTTC TGGCACAACT AATAATCAGA ATTCCAACAG TAATACAAGC
AACTAA
 
Protein sequence
MPRPRFSEML VERRRQLGLS IAQASKILRL KEQALIAFEE GDFKNMPQSG YAQGMLSSYA 
RYLGLNPREI VDLFQEESYE FENGTYSHEL RRRTRDTQSG RGIAGYDVIN ESESRPKAYV
QYHGLLPTSG GPAGDMGAFA TTSGVRSRQT GVPLGAQASE EDAGGYSYSR YATGHAYNAG
IEEASRQKTM ASRTRGGVQR RRVGIAGVSM PAYKDDVTRR TVAPSDYTDD LRYDNVTAPY
ERASTISGRR GSRNIAQVDR PNVRRRQSSS TGNQQRRAPQ RSGVMGALQN YFSDTGRTVA
TVFALVVIII TAVLLYSVRS CMTARTTPTT TKTVSVNNST NDSSKNSTTT NSNTDTSKKD
STDASKSTTT TESKKEETPK QTTVVVTVAD GQTSWVEITV DGKSVEADAI TGPWSQTYTV
TDSMSVLAGT PGAVAVTVNG TAKPFDANAS GIGTLTIQGT KSADANGTNA SGAQTNSTTS
STTSGTSGTT NNQNSNSNTS N