Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_0546 |
Symbol | |
ID | 8413400 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 632060 |
End bp | 633271 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 645022119 |
Product | integrase family protein |
Protein accession | YP_003179568 |
Protein GI | 257784351 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG4974] Site-specific recombinase XerD |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.849888 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000145945 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAATATTG CAGTCAAAAG ACATGGAACT TTCTGGCAAG CTCGTGTACG TTTTCGTGGA GCTGATGGAA CCATCCAAGA AAAAAGTAAA TCTCTCGGAA TTCCCTGTGC GGCTGGAAGG GGCAAAAAAG TTGCTCGAGC AGCTGCTGAG AAATGGGTTC AAGATGCAGG GTTTGTTGAG GTTGTCGAAC AGAATCAAGC AACAAGGCTT GATTGTTCGG CATATACGTA TTGTCTTAAT TACTTTAAGA GCCTTGTTGC CACACAACAA ATTGAACGTC GTACTTATAC GTCTTATAAG AATAACGTCC GATATATTGA TTTGTTCTTT GGAGAAAAGC GACTACAAGA GATTACTGTC ACGGATGTCG AGTTATATGT GTCTTGGCTT TATGATTCTG GCTATGCAGC TAATACGGTT AAGAAAGCAT TTAACTCCTT CCGTGCTTGT ACGCGCCATG CTGTAGCAAT TAGAGATCTG CAATACGATC CATGTGCGGC AATTAAAGCG CCAAAAGGTT ATCTTGCACC GCCAAATCCG CTAAACGAAC CTTCACGCAA GAAGCTTCAA GTCATGCTTT CTGCTCTTGA GCTTTCTCCG ATGGTACTTG CGACATACCT GGCATATTAC ACAGGTATGA GACGTGAGGA GTGTTGTGGA CTTCAATGGA AGGACGTGAA GTTTAAAGCT GAAGATGTCA CAGCACATCT CTGTCGAGCC ATCTCATACG ATGGCGGCAA AACCTACATT AAGGGTTTAA AAAATGGTAA AGATAGAACG GTACCTATTC CAGCTCCGCT TGTAGACATT CTTAAGCAGT GGCGTTCTAA ATACATTGAA GATTGTATGT TGATGGGAAT TGCGTTTAGT GAAGAGATGT ACGTTCTAGG TGACTTCTCT GGCGAATATC TTAGACCAGA GCGAGCGACT GCATGGTGGA AGAGTCACTC TGAGGAGTGG GGTCTTCTTG GAACGCAGGG GAGAAGACCA GTCTTTCATG ATCTGCGACA CACATATGCA ACGATTGCAG TTAGAACTAT GGACATTAAG AGCGCACAAG ACATTCTTGG ACACAGCGAT ATTAATATGA CAATGCGTTA TGCAGATACA GACTTGGAGC AAATTCAAAA GGCAGGGAAA ATTATTGGAG AGGCTCTTAA CGATGCTCAC AAAGACGGTG CAGAAGTACT ACAACTTAGG CGAGCGATAT AA
|
Protein sequence | MNIAVKRHGT FWQARVRFRG ADGTIQEKSK SLGIPCAAGR GKKVARAAAE KWVQDAGFVE VVEQNQATRL DCSAYTYCLN YFKSLVATQQ IERRTYTSYK NNVRYIDLFF GEKRLQEITV TDVELYVSWL YDSGYAANTV KKAFNSFRAC TRHAVAIRDL QYDPCAAIKA PKGYLAPPNP LNEPSRKKLQ VMLSALELSP MVLATYLAYY TGMRREECCG LQWKDVKFKA EDVTAHLCRA ISYDGGKTYI KGLKNGKDRT VPIPAPLVDI LKQWRSKYIE DCMLMGIAFS EEMYVLGDFS GEYLRPERAT AWWKSHSEEW GLLGTQGRRP VFHDLRHTYA TIAVRTMDIK SAQDILGHSD INMTMRYADT DLEQIQKAGK IIGEALNDAH KDGAEVLQLR RAI
|
| |