Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0785 |
Symbol | |
ID | 8413650 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | - |
Start bp | 863540 |
End bp | 865045 |
Gene Length | 1506 bp |
Protein Length | 501 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 645022367 |
Product | transcriptional regulator, XRE family |
Protein accession | YP_003179805 |
Protein GI | 257784588 |
COG category | [S] Function unknown |
COG ID | [COG1426] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0234796 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.197603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACGCC CTCGTTTTAG TGAGATGCTC GTAGAACGTC GACGTCAACT CGGACTTTCT ATCGCTCAAG CATCAAAGAT TCTTCGTCTT AAGGAGCAGG CACTCATTGC TTTTGAGGAA GGTGATTTCA AGAACATGCC TCAGAGTGGC TATGCTCAAG GCATGCTTTC TTCGTATGCT CGTTATTTGG GATTAAATCC GCGTGAGATT GTAGATTTGT TTCAAGAAGA GAGCTACGAG TTTGAAAATG GCACTTATTC TCATGAGTTG CGTCGTCGCA CCCGTGATAC CCAGTCTGGC CGTGGAATTG CAGGATATGA CGTTATAAAT GAGTCTGAAA GTAGGCCAAA GGCATACGTA CAGTATCACG GTCTTCTCCC TACATCGGGA GGACCAGCTG GCGATATGGG TGCATTTGCA ACTACGTCCG GTGTTCGCTC TAGACAAACG GGTGTACCTT TGGGCGCTCA GGCATCTGAA GAGGATGCCG GCGGCTACTC GTATTCTCGC TATGCCACCG GTCATGCCTA CAATGCAGGT ATTGAAGAAG CCTCTCGTCA GAAGACTATG GCAAGTCGTA CGCGCGGTGG CGTTCAACGT AGGCGCGTTG GTATTGCTGG CGTAAGTATG CCAGCTTATA AAGACGATGT AACTAGAAGA ACTGTTGCTC CAAGTGATTA CACTGATGAC CTTCGCTATG ACAACGTAAC CGCTCCTTAT GAGCGCGCTT CAACTATCTC TGGTCGTCGT GGTTCTAGAA ATATTGCACA GGTTGACAGG CCTAACGTTC GCCGTCGTCA GTCTTCTAGC ACAGGTAATC AGCAGCGTCG TGCTCCTCAA AGATCAGGCG TTATGGGTGC ACTTCAGAAC TACTTCTCTG ATACAGGTCG TACTGTTGCA ACTGTGTTTG CTCTTGTTGT AATTATCATC ACCGCTGTTT TACTGTATAG CGTTCGCTCT TGTATGACTG CTCGAACTAC GCCTACTACT ACAAAGACGG TAAGCGTTAA TAATTCAACA AATGATTCTT CTAAGAATTC CACTACCACA AATAGCAATA CGGATACTTC TAAGAAGGAT TCTACAGATG CTTCTAAGAG CACCACTACA ACCGAGTCTA AAAAAGAAGA GACTCCTAAA CAGACTACCG TTGTAGTTAC TGTTGCAGAT GGTCAGACCA GCTGGGTTGA GATTACTGTT GACGGCAAGA GCGTCGAGGC TGATGCAATT ACTGGTCCAT GGTCTCAGAC GTATACCGTT ACGGATTCCA TGAGCGTTCT TGCAGGCACT CCTGGTGCTG TTGCTGTTAC CGTTAACGGT ACTGCTAAAC CATTTGATGC AAATGCTTCT GGTATTGGTA CCTTGACCAT TCAGGGTACT AAGTCTGCTG ATGCTAATGG AACTAACGCA TCTGGAGCTC AAACAAACAG CACGACATCA AGTACTACCA GTGGAACTTC TGGCACAACT AATAATCAGA ATTCCAACAG TAATACAAGC AACTAA
|
Protein sequence | MPRPRFSEML VERRRQLGLS IAQASKILRL KEQALIAFEE GDFKNMPQSG YAQGMLSSYA RYLGLNPREI VDLFQEESYE FENGTYSHEL RRRTRDTQSG RGIAGYDVIN ESESRPKAYV QYHGLLPTSG GPAGDMGAFA TTSGVRSRQT GVPLGAQASE EDAGGYSYSR YATGHAYNAG IEEASRQKTM ASRTRGGVQR RRVGIAGVSM PAYKDDVTRR TVAPSDYTDD LRYDNVTAPY ERASTISGRR GSRNIAQVDR PNVRRRQSSS TGNQQRRAPQ RSGVMGALQN YFSDTGRTVA TVFALVVIII TAVLLYSVRS CMTARTTPTT TKTVSVNNST NDSSKNSTTT NSNTDTSKKD STDASKSTTT TESKKEETPK QTTVVVTVAD GQTSWVEITV DGKSVEADAI TGPWSQTYTV TDSMSVLAGT PGAVAVTVNG TAKPFDANAS GIGTLTIQGT KSADANGTNA SGAQTNSTTS STTSGTSGTT NNQNSNSNTS N
|
| |