Gene Apar_1014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1014 
Symbol 
ID8413886 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1147246 
End bp1149528 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content52% 
IMG OID645022603 
ProductTHUMP domain protein 
Protein accessionYP_003180034 
Protein GI257784817 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.520443 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAAG AACTTGAGTT CTTTGCAACA TGCCCAAAGG GCTTTGAGAA GCTTCTTTCA 
CAGGAGCTAG AAAGCCTGCA CATCAAAAAG GTGCGCCCTC TTGGCGGACA AGTTGCGTTT
TATGGCTTGC TTGCCGATGC TTACCGTGTC TGTCTTTGGT CTCGTCTCGC GTCTCGCGTC
ATTTTAGTGC TTGATCGCAT TGAAGCCGCA ACTTCAGACA CTCTCTACGA AGAACTCTCC
CACCTTGCTT GGGAAGACCA TATTGGCCCA CGTGCCACTA TTGCAGTTAA TGCTCACGGC
ACAAATGACC AACTCAGAAA CACAAACTTT ATTGCTGTCA GAACAAAAGA CGCCATTGTA
GACAGACTTG CAGCTAAGCG CGGAAGCCGT CCTATGGTCA ACACGCTTGC GCCTGACGTA
ACTATTGTCG TGCGCATATC ACGTAACCGT GCCACGGTTG GCATCGATCT TGCTGGAGAA
GCGCTCTTTA AGCGCAGTCT AACTGGCGGT CGAGGGCCGG CACGTGAGTT TGCGCCGCTC
AGGCTCGACT ACGCTGCTGC CCTTCTGTAT CTACAGGCGC AAACTGCTAC GAGTGCATCG
GGTTTCTCGC CAGACGCGCT GCTGCCGGCG CTGCTGTTCC CAGGAGCAGG CGCGCTAGCG
CAAGAGGCTG CGGGCATGGC GCTAGACGTG GCTCCGGGCA TCTTGCGCGC TCGCTGGGGC
ATGACGGGCT GGGCCGGTCA CAACGACGAT GTCTGGCAAG ATTTGCTTGC CGAGGCCGAC
GAACGTGCCG AGAAAGGCCA AGAGCGCCAA ATAACACTCT ATGCAGCGGA TTCTCGACCA
AAGGCCAAGG AGGCTGTGCT TTACACGTTG CGTGCAGGCA GTCTCAAAGC AGACGTACAG
TTCTTGGCAG CATCTGAACT CCTCAAACAC GCTGAGCACT TCACAGGAGT TGTTGCAGAC
CTCTCGTGGA CCAAAGAGGA GCCTACACTC CAGGGCTCTG CTTACGCCAC GCTTGGACTT
TTTGCGGGAC AGGCAAGCAC ACTGCTCACC AGCGATACAA ATACCGATAC GGTACTCAGG
GCTACTCCTA CGCAAACGCT TTCCGTCTAC GTAGGTAACT CCATTGCCAC CATACGCTCC
TACCCTGCTG CAAATGCAGA GGGAGCCGAT AGCAGCGCTA CTAATGCACC AGTAAGCAGC
TCTAAGCCTA CATCAGTTCC CGCAGGTCCT ACCGTTATGG TCAACAATCA GCCGGTAAGC
GTGCTGGTTC CTGCCTCAGA CCAGTTTGCT GCACGTCTTA CTAAAGTAGC TAAGCAACGC
GCTAAATGGG CTCGTAAAAA CGATGTCTCA TGCTACCGTG TATACGATGC TGATCTGCCC
GATTATGCCG TTAGTATTGA CATTTACAAG GGTGCTACAA AACCAACTAC CTGGCTACAA
ATCTCTGAGT ACGCTGCTTC TAAAGAAATT GATCCAGACC TTGCAAAGCG TCGCCTTTTA
GATGTCCTGG CTCTTGCTCC CCGCATTCTA GGTGTACCCA GCTCAAACGT GAACCTAAGA
ACAAGAACAC GAGCAAAGGG CGGCTCCCAG TACTCTAACG AGGGCAGTGC AACAGACAAT
TCAAGAAAAG AAATGCTGCT TATCGACGAG GGTGGTCTGC TCTTTGAGGT CAACTTTGCT
TCCAGGCTGG ACTGCGGAAT CTTCCTAGAT CACCGTGATA CGCGCGCGGA GATTCGTGAG
CTCATGAAAA GAGCTGGTAC TGCCAAGAGT TTCCTCAACT TGTTTGCCTA TACGGGCACC
GCTACCTGTT ATGCAGCAGA CGGTGACGCG CTCCACACCA CCACTGTTGA CCTCTCCAAA
CCTTCGCTTG AGTGGGCTAA GCGCAACATG AAACGTAACG GTTTTGGTGG CGAAGACCAT
GAATTTGTCC AAGCAGACGT CTTATCTTGG ATTACCGAAA TGCGTCACAC CAAAAACCGC
TGGAACGTTA TCTTCTGCGA CGTTCCAACC TTCTCCAACT CATCACGCAT GAAGCAAAGT
TCATTTGATG TCCAAAGAGA CCATGCTGAG CTCATTATTG GTATTTCTCG CCTTCTGACT
CATGGTGGCG TAGCTATTTT CTCATGCAAC CTACGTACTT TTAAACCAGA TGTTGAGAAA
ATCGAGCGAG CTGGCGTAGT CATTGAAGAT ATAACTAGCA AAACTATTCC GGAGGACTTC
TCGAGAAATC AAAAAATTCA TCACGCATAT AAAATCTCGA GAAAACCGCG GGAAAACGGC
TAA
 
Protein sequence
MSQELEFFAT CPKGFEKLLS QELESLHIKK VRPLGGQVAF YGLLADAYRV CLWSRLASRV 
ILVLDRIEAA TSDTLYEELS HLAWEDHIGP RATIAVNAHG TNDQLRNTNF IAVRTKDAIV
DRLAAKRGSR PMVNTLAPDV TIVVRISRNR ATVGIDLAGE ALFKRSLTGG RGPAREFAPL
RLDYAAALLY LQAQTATSAS GFSPDALLPA LLFPGAGALA QEAAGMALDV APGILRARWG
MTGWAGHNDD VWQDLLAEAD ERAEKGQERQ ITLYAADSRP KAKEAVLYTL RAGSLKADVQ
FLAASELLKH AEHFTGVVAD LSWTKEEPTL QGSAYATLGL FAGQASTLLT SDTNTDTVLR
ATPTQTLSVY VGNSIATIRS YPAANAEGAD SSATNAPVSS SKPTSVPAGP TVMVNNQPVS
VLVPASDQFA ARLTKVAKQR AKWARKNDVS CYRVYDADLP DYAVSIDIYK GATKPTTWLQ
ISEYAASKEI DPDLAKRRLL DVLALAPRIL GVPSSNVNLR TRTRAKGGSQ YSNEGSATDN
SRKEMLLIDE GGLLFEVNFA SRLDCGIFLD HRDTRAEIRE LMKRAGTAKS FLNLFAYTGT
ATCYAADGDA LHTTTVDLSK PSLEWAKRNM KRNGFGGEDH EFVQADVLSW ITEMRHTKNR
WNVIFCDVPT FSNSSRMKQS SFDVQRDHAE LIIGISRLLT HGGVAIFSCN LRTFKPDVEK
IERAGVVIED ITSKTIPEDF SRNQKIHHAY KISRKPRENG