Gene Apre_1427 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1427 
Symbol 
ID8398237 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1544065 
End bp1545384 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content42% 
IMG OID644995792 
Productputative chlorohydrolase/aminohydrolase 
Protein accessionYP_003153171 
Protein GI257066915 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID[TIGR03314] putative selenium metabolism protein SsnA 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0458985 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATAATTA AGGCAAAGAC CATTATTACC AACGATAAGG CTAACAACTT CTACGAGGAT 
GCGGCAATCC TTATAGAGGA AAATATCATA AAGGAAATAG GAGATTTTGA TAAGATAAAG
GGAGAAAATC CAGACGAAGA GGTCCTAGAT TTTTCTGATA AGCTTGTTAT GCCAGGCCTA
ATATGTGCTC ACTCCCACGC CTATTCTGCC TATGCGAGAG GGATGAGCGT ATCAAAGCCA
ACTGATAACT TCTTTAACGT CTTAGAAAAC CTCTGGTGGG CCCTAGATAA GGAACTTACC
CTAGAAGATG TGAAGCTAAA TGCCCTAACT ACCTTTATGG AGTCAGTTCA AAACGGGGTG
ACAACTATTA TAGACCACCA CTCAGGACCA AATGCTATAG AAGGATCTCT ATCTACTATG
GCAGATGCTG CGACAGATCT TGGAATTAGA GCAAGCCTCT GCTATGAAGT ATCCGACAGG
GACGGGATGG ACAAAAGAGA CCTAGGAATT GCTGAAAACA TAAACTGGAT TAAGGAAGCA
GAAAAAAGAG ACGATATGCT AAGTGCTCTC TTTGGCCTCC ACGCTTCCTT CACCCTATCA
GATGAGACCC TAGCAAAATG CCAAAAGGCC ATGGAAGAGG TCAATTCTGG CTACCATGTT
CACATAGCCG AAGGCATAGA AGATGAGTGG GAAACAGTAA AGATGAGTGG GAAGAGAATC
GTAGATAGGC TCGATGCCTA CGGAATCTTT AACGACAAGA CTCTTTCTAT TCACAACGTC
CACATCAACG AAAGAGAGAT GGATATCCTA AAAGAGAAAA ATACCATGGC AGTATTCAAT
CCGGAAAGCA ATATGAATAA TGCCGTGGGA GCTCCTCCAA CTGTAAGGAT GCTTGAGAAG
GAAATACTTT TAGGTCTTGG GACAGATGCC TACACCAACG ACATGTTTGA GTCAATGAAG
GTGGCCAAGG TTTTCATCAC TCACGAAAAC CATGACCCAA CCAAGGGCTT TGCCGAAGCT
ATCAAGATGC AATTTGAGAA CAATCCAAAG ATTATGGAAA GATACCTCAA AAGGCCAGTA
GGAAGAATCG AAGAAGGAGC CTACGCTGAT ATCATAGCCC TAGACTATGA TCCGATAACT
CCTTTAGAAA AAGAAAACTG GCCAGGCCAC GTCCTTTTCG GCCTAAGTGG AAAATGTGTA
ACAGATTCTA TAATAAACGG CAAAGTAGTA ATGGCTGATA GGAAAATTAA GACAGTCGAT
CAAAAAGCAA TCCACGAAAA ATCAAGACAA AGAGCAAAAG CAATCTGGCC TAAACTTTAG
 
Protein sequence
MIIKAKTIIT NDKANNFYED AAILIEENII KEIGDFDKIK GENPDEEVLD FSDKLVMPGL 
ICAHSHAYSA YARGMSVSKP TDNFFNVLEN LWWALDKELT LEDVKLNALT TFMESVQNGV
TTIIDHHSGP NAIEGSLSTM ADAATDLGIR ASLCYEVSDR DGMDKRDLGI AENINWIKEA
EKRDDMLSAL FGLHASFTLS DETLAKCQKA MEEVNSGYHV HIAEGIEDEW ETVKMSGKRI
VDRLDAYGIF NDKTLSIHNV HINEREMDIL KEKNTMAVFN PESNMNNAVG APPTVRMLEK
EILLGLGTDA YTNDMFESMK VAKVFITHEN HDPTKGFAEA IKMQFENNPK IMERYLKRPV
GRIEEGAYAD IIALDYDPIT PLEKENWPGH VLFGLSGKCV TDSIINGKVV MADRKIKTVD
QKAIHEKSRQ RAKAIWPKL