Gene Apre_1837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1837 
Symbol 
ID8368744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013164 
Strand
Start bp100440 
End bp101996 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content27% 
IMG OID644984760 
ProductRadical SAM domain protein 
Protein accessionYP_003142411 
Protein GI256821212 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.654863 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTTAA TAACCCTAGA AAAAGAAAAG TTAGAAAATA AGTACAAAGA GTTATGTAAG 
GAAGAATATC CAGGGAGATT AGGCAAAACA TTTAAAACAA AATATAATTT TTACTATTAT
GATAGTGGTA CTGGTAAAGT AGCACAAATA AATAAAAATG TGTATAAAGT CTTAACTAAA
TTTTTAGAGA GCGAAAATTT CTTGGATTTT ATAAAACTTG ATATGTCTGA ACAAGAATTT
TGTGAGGCTA TAAGTGAAAT AAAAGATGCG ATTAATAAAG AAAGTATACT CTCAGCAACT
AAATTTAATT GTTTGACTGG AAAAACTTAT GAACAAATTG ATGAGATAAT AGATAATAAG
ATACAAAATG TTACCTTAGA GGTTACAGAG AAATGTAATT TAAGATGTAA ATATTGCATC
TATAATGAAT CTCATCCTGA ATACAGAGCT TTTGGTCATA AGAATATGGA CTGGGAAGTA
GCAAAAAAAG CTGTTGATTT TTTAAAAGCT CATTCACAAA ATTCTGATGA ACGTCATATT
GGATTTTATG GTGGAGAACC ATTAATAAAC TATGATCTTA TAAAGAAAAC AACAGATTAT
GCGAATAAGT TATTTGATAA AATGACTTAT TCTATGACAA CAAATGCTAC TTTAATGAAT
GAAGAAATTG CTGATTATAT TATGAAGAAT AAATTCAATA TTATAGTAAG TTTAGATGGA
TATAAAGAGC TTCATAACAA AAATAGATTG TTTGTTTCTG GGGAAGGAAG CTTTGAAAAT
ACTATTAGGG GATTAAAAAT TCTATTAAAA TCAGCGGAAA AATATAATAA TAAAGAAAGT
ATTATCTTAA ATATGGTAAT CGAAGGACCT GATTATGAAG ATCAGTATGA TAAAATACAA
TTTTATTTAA ATGAATGCGA TTGGTTGCCT AAAAATATAA ATATATTAAC ATCTTCTATA
GATTATGGAC CACATGAAAG TATATATACA AGACCACAAT CTTATGAAGA AAGAATGGTT
CTAAAAGATT ATTACGATCC AATCTTATCG TGGGATAAAA AAAATAAAAT AAGGAACAAG
GATAATACAA ATGTCCTATT TACAGATGCT GATGTAGACA AAGCTATGAT GATTATACAC
AAAAGATTAT TATCTGAAAA ACCTGTTAAA AAATATGGGA TGAATGGTTG TTGTGTCCCT
GGAGAAAGAC GAATATATGT AACAGTTGAC GGAAACTTCA AAATTTGTGA AAAGGTAGGG
GATATTCCAG AGATAGGAAA TGTAGACAAA GGATTTGATA AAAAAAGAAT TAAAGAATTA
TATTTTGATG ATTTTATTAA AGAAGCTAAC AAATATTGCA AAGACTGCTG GGCAATTAAT
TTATGCACTC TATGCTATGT AAATTGTTAT GATAAAAATG GGATGCACTT TGATTATAGA
CACAATTCTT GTAGAAGTGA AAGAAATTAT TTGTTAGGTA GCTTGATAAA ATATCATGAG
ATATTGGAAG AAAATCCAGA TGTACTTGAG GAATTTAACG AAATTGAGTT TCAATAA
 
Protein sequence
MDLITLEKEK LENKYKELCK EEYPGRLGKT FKTKYNFYYY DSGTGKVAQI NKNVYKVLTK 
FLESENFLDF IKLDMSEQEF CEAISEIKDA INKESILSAT KFNCLTGKTY EQIDEIIDNK
IQNVTLEVTE KCNLRCKYCI YNESHPEYRA FGHKNMDWEV AKKAVDFLKA HSQNSDERHI
GFYGGEPLIN YDLIKKTTDY ANKLFDKMTY SMTTNATLMN EEIADYIMKN KFNIIVSLDG
YKELHNKNRL FVSGEGSFEN TIRGLKILLK SAEKYNNKES IILNMVIEGP DYEDQYDKIQ
FYLNECDWLP KNINILTSSI DYGPHESIYT RPQSYEERMV LKDYYDPILS WDKKNKIRNK
DNTNVLFTDA DVDKAMMIIH KRLLSEKPVK KYGMNGCCVP GERRIYVTVD GNFKICEKVG
DIPEIGNVDK GFDKKRIKEL YFDDFIKEAN KYCKDCWAIN LCTLCYVNCY DKNGMHFDYR
HNSCRSERNY LLGSLIKYHE ILEENPDVLE EFNEIEFQ