Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Apar_1014 |
Symbol | |
ID | 8413886 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Atopobium parvulum DSM 20469 |
Kingdom | Bacteria |
Replicon accession | NC_013203 |
Strand | + |
Start bp | 1147246 |
End bp | 1149528 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 645022603 |
Product | THUMP domain protein |
Protein accession | YP_003180034 |
Protein GI | 257784817 |
COG category | [R] General function prediction only |
COG ID | [COG1092] Predicted SAM-dependent methyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.520443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAAG AACTTGAGTT CTTTGCAACA TGCCCAAAGG GCTTTGAGAA GCTTCTTTCA CAGGAGCTAG AAAGCCTGCA CATCAAAAAG GTGCGCCCTC TTGGCGGACA AGTTGCGTTT TATGGCTTGC TTGCCGATGC TTACCGTGTC TGTCTTTGGT CTCGTCTCGC GTCTCGCGTC ATTTTAGTGC TTGATCGCAT TGAAGCCGCA ACTTCAGACA CTCTCTACGA AGAACTCTCC CACCTTGCTT GGGAAGACCA TATTGGCCCA CGTGCCACTA TTGCAGTTAA TGCTCACGGC ACAAATGACC AACTCAGAAA CACAAACTTT ATTGCTGTCA GAACAAAAGA CGCCATTGTA GACAGACTTG CAGCTAAGCG CGGAAGCCGT CCTATGGTCA ACACGCTTGC GCCTGACGTA ACTATTGTCG TGCGCATATC ACGTAACCGT GCCACGGTTG GCATCGATCT TGCTGGAGAA GCGCTCTTTA AGCGCAGTCT AACTGGCGGT CGAGGGCCGG CACGTGAGTT TGCGCCGCTC AGGCTCGACT ACGCTGCTGC CCTTCTGTAT CTACAGGCGC AAACTGCTAC GAGTGCATCG GGTTTCTCGC CAGACGCGCT GCTGCCGGCG CTGCTGTTCC CAGGAGCAGG CGCGCTAGCG CAAGAGGCTG CGGGCATGGC GCTAGACGTG GCTCCGGGCA TCTTGCGCGC TCGCTGGGGC ATGACGGGCT GGGCCGGTCA CAACGACGAT GTCTGGCAAG ATTTGCTTGC CGAGGCCGAC GAACGTGCCG AGAAAGGCCA AGAGCGCCAA ATAACACTCT ATGCAGCGGA TTCTCGACCA AAGGCCAAGG AGGCTGTGCT TTACACGTTG CGTGCAGGCA GTCTCAAAGC AGACGTACAG TTCTTGGCAG CATCTGAACT CCTCAAACAC GCTGAGCACT TCACAGGAGT TGTTGCAGAC CTCTCGTGGA CCAAAGAGGA GCCTACACTC CAGGGCTCTG CTTACGCCAC GCTTGGACTT TTTGCGGGAC AGGCAAGCAC ACTGCTCACC AGCGATACAA ATACCGATAC GGTACTCAGG GCTACTCCTA CGCAAACGCT TTCCGTCTAC GTAGGTAACT CCATTGCCAC CATACGCTCC TACCCTGCTG CAAATGCAGA GGGAGCCGAT AGCAGCGCTA CTAATGCACC AGTAAGCAGC TCTAAGCCTA CATCAGTTCC CGCAGGTCCT ACCGTTATGG TCAACAATCA GCCGGTAAGC GTGCTGGTTC CTGCCTCAGA CCAGTTTGCT GCACGTCTTA CTAAAGTAGC TAAGCAACGC GCTAAATGGG CTCGTAAAAA CGATGTCTCA TGCTACCGTG TATACGATGC TGATCTGCCC GATTATGCCG TTAGTATTGA CATTTACAAG GGTGCTACAA AACCAACTAC CTGGCTACAA ATCTCTGAGT ACGCTGCTTC TAAAGAAATT GATCCAGACC TTGCAAAGCG TCGCCTTTTA GATGTCCTGG CTCTTGCTCC CCGCATTCTA GGTGTACCCA GCTCAAACGT GAACCTAAGA ACAAGAACAC GAGCAAAGGG CGGCTCCCAG TACTCTAACG AGGGCAGTGC AACAGACAAT TCAAGAAAAG AAATGCTGCT TATCGACGAG GGTGGTCTGC TCTTTGAGGT CAACTTTGCT TCCAGGCTGG ACTGCGGAAT CTTCCTAGAT CACCGTGATA CGCGCGCGGA GATTCGTGAG CTCATGAAAA GAGCTGGTAC TGCCAAGAGT TTCCTCAACT TGTTTGCCTA TACGGGCACC GCTACCTGTT ATGCAGCAGA CGGTGACGCG CTCCACACCA CCACTGTTGA CCTCTCCAAA CCTTCGCTTG AGTGGGCTAA GCGCAACATG AAACGTAACG GTTTTGGTGG CGAAGACCAT GAATTTGTCC AAGCAGACGT CTTATCTTGG ATTACCGAAA TGCGTCACAC CAAAAACCGC TGGAACGTTA TCTTCTGCGA CGTTCCAACC TTCTCCAACT CATCACGCAT GAAGCAAAGT TCATTTGATG TCCAAAGAGA CCATGCTGAG CTCATTATTG GTATTTCTCG CCTTCTGACT CATGGTGGCG TAGCTATTTT CTCATGCAAC CTACGTACTT TTAAACCAGA TGTTGAGAAA ATCGAGCGAG CTGGCGTAGT CATTGAAGAT ATAACTAGCA AAACTATTCC GGAGGACTTC TCGAGAAATC AAAAAATTCA TCACGCATAT AAAATCTCGA GAAAACCGCG GGAAAACGGC TAA
|
Protein sequence | MSQELEFFAT CPKGFEKLLS QELESLHIKK VRPLGGQVAF YGLLADAYRV CLWSRLASRV ILVLDRIEAA TSDTLYEELS HLAWEDHIGP RATIAVNAHG TNDQLRNTNF IAVRTKDAIV DRLAAKRGSR PMVNTLAPDV TIVVRISRNR ATVGIDLAGE ALFKRSLTGG RGPAREFAPL RLDYAAALLY LQAQTATSAS GFSPDALLPA LLFPGAGALA QEAAGMALDV APGILRARWG MTGWAGHNDD VWQDLLAEAD ERAEKGQERQ ITLYAADSRP KAKEAVLYTL RAGSLKADVQ FLAASELLKH AEHFTGVVAD LSWTKEEPTL QGSAYATLGL FAGQASTLLT SDTNTDTVLR ATPTQTLSVY VGNSIATIRS YPAANAEGAD SSATNAPVSS SKPTSVPAGP TVMVNNQPVS VLVPASDQFA ARLTKVAKQR AKWARKNDVS CYRVYDADLP DYAVSIDIYK GATKPTTWLQ ISEYAASKEI DPDLAKRRLL DVLALAPRIL GVPSSNVNLR TRTRAKGGSQ YSNEGSATDN SRKEMLLIDE GGLLFEVNFA SRLDCGIFLD HRDTRAEIRE LMKRAGTAKS FLNLFAYTGT ATCYAADGDA LHTTTVDLSK PSLEWAKRNM KRNGFGGEDH EFVQADVLSW ITEMRHTKNR WNVIFCDVPT FSNSSRMKQS SFDVQRDHAE LIIGISRLLT HGGVAIFSCN LRTFKPDVEK IERAGVVIED ITSKTIPEDF SRNQKIHHAY KISRKPRENG
|
| |