Gene Apre_1421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1421 
Symbol 
ID8398231 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1535276 
End bp1536403 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content41% 
IMG OID644995786 
Productputative RNA methylase 
Protein accessionYP_003153165 
Protein GI257066909 
COG category[L] Replication, recombination and repair 
COG ID[COG0116] Predicted N6-adenine-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAA TTATAATTAC AACAAGTTTT GGCTTGGAGG CTCTAGTCAA GAGGGAGCTT 
ATTGATTTAG GCTTCGAGGA TTTTTCTGTA AGCGACGGGA TGATTACTCT TTCAGGCGAA
CTTTCTGACA TAGGAAAACT AAATATTAAT CTTAGATGTG CAGATAGGGT CTATCTTGTC
CTAGATGAGT TTAAGGCGAC TTCTTTTGAT GAGCTTTTTG AAAATATTAA AAGGATCAAC
TGGACTGATT ATCTGCGAAA GGAAAGTAAC TTTATAGTAA ACGCCAGGAC CTATAAGTCC
AAGCTTTTCG CCCTAAGGTC AATCCAATCT ATCACAGAAA AGGCAATTAT CGATTCTTTG
AGGAAAAAAT TTAAGATTTC GACCTTCCCT AAATCAGGGG AGAGGGTAGG AATTGAAGTC
ATGGTCAATA GAAATATTGC TACAGTTACA ATCGATACAT CAGGTGATGG CCTTCACAAG
AGAGGCTATA GGGAGGATTC TGTCAAGGCT CCCCTTAGGG AAAATCTTGC GGCAGCCCTA
GTAGATCTTT CTTTCTATAA TCCTGATAGG TTCCTCCTTG ATCCCTTCTG CGGGTCGGGG
ACAATCCTAA TCGAGGCAGC GAGGAAGGCT CGTAACATAG CACCAGGTAT TGATAGGGAC
TTCGACTTTA GGCACTTCGT CTTTATGGAC AAATCTATCT ACGAGAATGA AAAGAAAGAA
GCTTTGGGAA GGATAGATTA TTCTACTAAG CTTCATATCC TAGGCTCTGA TATATCTGGT
AGGGCCATAA GCCTTGCCAA GAACAACGCC CTAAACGCTG GTGTTGAGGA AGACATAGCC
TTTGTCAAAA GAGATATAGG TTCTGTTGCC GTATCAAGGG ACGACTACGG TGTATTGATA
GCAAATCCTC CCTACGGTCT GAGATTATCA GATATGGATT TGGGAGAAAT TTATAAGAAG
ATAAATAATA AGTTTATGAA GCTTGACACC TGGTCCTTGT ATTTTGTAAC AGCTGATGAG
AAATTCGATA GAAACTTTAA AAGGAAGCTT TCCAAGAAGA GAAAGCTCTA CAACGGCGGT
GAGAAGGTAG ATTACTACCA GTATTTTGGC CCAAGGCCAA AGAATTAG
 
Protein sequence
MEKIIITTSF GLEALVKREL IDLGFEDFSV SDGMITLSGE LSDIGKLNIN LRCADRVYLV 
LDEFKATSFD ELFENIKRIN WTDYLRKESN FIVNARTYKS KLFALRSIQS ITEKAIIDSL
RKKFKISTFP KSGERVGIEV MVNRNIATVT IDTSGDGLHK RGYREDSVKA PLRENLAAAL
VDLSFYNPDR FLLDPFCGSG TILIEAARKA RNIAPGIDRD FDFRHFVFMD KSIYENEKKE
ALGRIDYSTK LHILGSDISG RAISLAKNNA LNAGVEEDIA FVKRDIGSVA VSRDDYGVLI
ANPPYGLRLS DMDLGEIYKK INNKFMKLDT WSLYFVTADE KFDRNFKRKL SKKRKLYNGG
EKVDYYQYFG PRPKN