Gene Apre_1754 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1754 
Symbol 
ID8368669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013164 
Strand
Start bp12796 
End bp14139 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content26% 
IMG OID644984685 
ProductRadical SAM domain protein 
Protein accessionYP_003142336 
Protein GI256821137 
COG category[R] General function prediction only 
COG ID[COG0641] Arylsulfatase regulator (Fe-S oxidoreductase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0305986 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTACAT CAGGTTTTAT AAAACTCAAA CATAATAACA ACAAATACAT TTTTGATTAT 
GATAATGTGT CAATAATACG AATTGATGAC AAAGTAGAAA AATTTTTAAA TTTAATGCAG
AAGTATGAAT GGGAAGAATT AGAAAATAAA TCATCTATGT ATATGACTAA ACAGGAATAT
AGAAAATTAA TAGAATCTAT GAAATCTATG GGTTTTTTAA GAGAAGCAGA ATTTGAAGGT
AAACAACACT ATGATTCTAG CAATAAAATT TCATCGATAA CATTGATGCT TATTCAAGGC
TGTAACTTAG CATGCAAGTA TTGTTTTGGT GATGAAGGTA GATATAATCA CACTGGATTT
ATGGATTCAG ACACAGCTAA ACAATCAATA GATTTTTTAA TTGAAAATAC AAATAGTGAT
AAACTTAATA TTATATTTTT TGGTGGAGAA CCGTTATTAA GATTTGATCT GATTAAAGAA
ATAGTAAATT ATTGTAAAAT CAAAGAATCA ACCAATAAAT TAAAATTTTA TTTTAGTATG
ACAACTAATG GTACATTGTT AAATAAAGAA GTAAATGAAT TCATCATTGA AAATAAAATT
AATACAATGA TTAGTATAGA TGGAGACTTA AACGATAATA GTGATAGGGT TTATCAAAAT
GGCAATCAAG CATACAATGA CATTATAGAA AATACTTTAT ACTTAAGAAA TAAAGGCTTA
CTATCGGCTA GAGCTACCAT TACTCCTAGA AATTTAGATA TGGTTAGAGT TTTTGAGCAT
TTAAATACCT TGAACTTTAA GAATATACCA ATATCTGCCG CTGACAATTC TTTAGGAACA
CTTGAATATA AGAGGTATAT TGATGAAAAT ATAAATCTTA TCAATCAATT TAAAGATTAT
ATTAAAAATG GAGAAATAGA TAAAGCTAAG AAAGTAAAAA TTCTATTCAG GGCATTAAAG
CAAATACATT TTAGCAAAAA ACAAAATTAC CCTTGTGGTG CTGCTTTTAA CTCAGTGGCT
ATTGATATTG ATGGAAATAT ATATCCTTGT CATAGGTTTG TATCGTATGA TCATTATAAT
ATAGGTAATG TATATTCAAA TTGTATGCAG ACTTCTAATT TCATAAAAAA AATCTTTAAT
GATAATAGCA AGCTTACTGA ATGTAGTAGC TGCTTTGCTA AACATTTCTG TAGGTGTGGC
TGCCCTTACG AAAATTATGA AAACACAGGT ATTTTAAATA GACCTTCCAG TAGACAGTGC
TATTTAAATA AAGTTATATT TATGAAATTA TTATACTTAT ATATAGATTT AAGTGATTCT
GAGATAAAGG CGTTATTTGA GTAG
 
Protein sequence
MITSGFIKLK HNNNKYIFDY DNVSIIRIDD KVEKFLNLMQ KYEWEELENK SSMYMTKQEY 
RKLIESMKSM GFLREAEFEG KQHYDSSNKI SSITLMLIQG CNLACKYCFG DEGRYNHTGF
MDSDTAKQSI DFLIENTNSD KLNIIFFGGE PLLRFDLIKE IVNYCKIKES TNKLKFYFSM
TTNGTLLNKE VNEFIIENKI NTMISIDGDL NDNSDRVYQN GNQAYNDIIE NTLYLRNKGL
LSARATITPR NLDMVRVFEH LNTLNFKNIP ISAADNSLGT LEYKRYIDEN INLINQFKDY
IKNGEIDKAK KVKILFRALK QIHFSKKQNY PCGAAFNSVA IDIDGNIYPC HRFVSYDHYN
IGNVYSNCMQ TSNFIKKIFN DNSKLTECSS CFAKHFCRCG CPYENYENTG ILNRPSSRQC
YLNKVIFMKL LYLYIDLSDS EIKALFE