Gene Apre_1112 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1112 
Symbol 
ID8397899 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1194681 
End bp1195940 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content40% 
IMG OID644995459 
ProductAmidohydrolase 3 
Protein accessionYP_003152860 
Protein GI257066604 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00316686 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAA AATTTATCAA TGCAAAGATT TATGGTTATG AAGATGCGAG AGAAATTCTC 
GTAGAAGATG GTTGTTTTAA GGAATTTGGC AATAAGCTAG AGGCTTGTGA TGAAGTAATC
GATCTAGATG GAAGGCTTGT AATCCCACCT TATGTAGATA GTCACCTTCA TCTTGATTAT
TATATGATTG GCAAGACCGA TGAGGTAAAG AATGAATCGG GAACTCTTTT TGAGGCGATT
GACCTATGGA ATGACTTCAA GAAGGGCTCA AGCAAGGAAG AGATGAAGGA AAGAATCTAT
GGGGCTGTAG AAGAATGTCT ATCCCACGGA ACTCAATATA TCAGAGCCCA AACCGATTGT
ACAGATCCTA ATCTTACAGG AATTAAAGCA GCCCTTGAGG TTCGTGATGA ATTGAAGGAT
AAGGTCACAA TCCAAGTCGT AGCCTTCCCA CAAAATGGTA TGTATTCATA TGAGGAAGAA
GGAAAGACAG GTAGAGATCT TGTAGAAGAA GCCCTAAAGC TTGGTTGTGA AGTAGTCGGA
GGCATCCCTC ACAACGAATG GTCAAGGGAT TTAGGAGAAA AATCCATCAA AGAAATCGTA
AGGCTTGCCG TAAAATACGA TAGGCTAATA GATGTACACT GTGACGAGAC AGATGACGTG
ATGGCAAGAT TTGTCGAAGT ACTCAATGCG GAGGCTATGA TAAATAAAAT AGGGGAAAAG
ACTACAGCAA GCCATACCTG CTCTTTTGGG TCTGCGGATG ATTCCTATGC CTTTAGGATG
ATGGGCTTAT TTAGAAAATC TAAGCTTAAC TTCATAGCCC TTCCTACAGA AAACGCATTT
TTGCAAGGTA GACAAGACTC TTATCCAAAA CGTAGGGGAC TTACCAGAGT TTTGGAATTT
GTAGATAATG GAATCAATGT TTGCTTTGCC CAAGACTCCA TAGTAGACTT ATGGTATCCT
GCTGGCAACG GTAATCTCAT TAATATCCTA GACAATGGAA TTCACCTAAG CCAACTTATG
AGAGAAAAGG ACTTCGAAAA AGACTTCGAT CTTGTTACCT ACAATGGGGC AAGGACCATG
CACATAGAAG ACGATTACGG TTTTGATCCA GGAAAGCCTG CCAACTTTAT AGTTTTGGAT
GCAGAAAATG AATTTGAAGC TATAAGAAAC AGGGCCGAGT GTTTGGCATC AGTACGTGAG
GGAGAATTCC TATTCAAAAA GGCCAAAAGA GAATATGATG TGAAACTAAA TATAAGATAA
 
Protein sequence
MKKKFINAKI YGYEDAREIL VEDGCFKEFG NKLEACDEVI DLDGRLVIPP YVDSHLHLDY 
YMIGKTDEVK NESGTLFEAI DLWNDFKKGS SKEEMKERIY GAVEECLSHG TQYIRAQTDC
TDPNLTGIKA ALEVRDELKD KVTIQVVAFP QNGMYSYEEE GKTGRDLVEE ALKLGCEVVG
GIPHNEWSRD LGEKSIKEIV RLAVKYDRLI DVHCDETDDV MARFVEVLNA EAMINKIGEK
TTASHTCSFG SADDSYAFRM MGLFRKSKLN FIALPTENAF LQGRQDSYPK RRGLTRVLEF
VDNGINVCFA QDSIVDLWYP AGNGNLINIL DNGIHLSQLM REKDFEKDFD LVTYNGARTM
HIEDDYGFDP GKPANFIVLD AENEFEAIRN RAECLASVRE GEFLFKKAKR EYDVKLNIR