Gene Apre_0330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0330 
Symbol 
ID8397104 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp373719 
End bp375059 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content43% 
IMG OID644994690 
Productprotein of unknown function UPF0054 
Protein accessionYP_003152102 
Protein GI257065846 
COG category[R] General function prediction only 
COG ID[COG0319] Predicted metal-dependent hydrolase 
TIGRFAM ID[TIGR00043] metalloprotein, YbeY/UPF0054 family 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATCTTT TAATTGAAAA CAATACAGAA GAAAAAATAG AAATGGATGC TGACCTAGAA 
AGAACTGTCA AGGAAGTCCT AAAGACTGAG GGATTGGGCG AAGACTACGA AGTTTCCATC
ACCTTTGTCG ATAGAGAGGA GATTCATAGG CTAAATAGGG AATTTAGAGA TGTAGATAGG
CCAACAGATG TCCTATCCTT CCCCCTAGAT GATACTATAG ACTTACCAGA CGCTGATAAG
ATGCTGGGAG ATATAGTAAT ATGTCTGGAT GTGGCCAAGG ACCAGGCCAT GGAGTTCGGC
CATTCTCTAA GGCGCGAAAT CATGTATCTG ACCTGCCACT CCACCCTCCA TCTTTTAGGA
TACGACCATA TGGAAGAAGA CGAGAAAAGG GAGATGAGAC GTCGTGAGAA GGAAGTCATG
AGAAATCTTA GAGTCTTCAA GAACGACGAT CACGAGAACT TTGACCATAT GGACAAAAAC
GACTACGAGA AGGAAAGAGA AGAAGCTGCT AAGAATGATG AAAAGTTTCT AGAATTTAGG
GACGAGCTAA AGGAAGAAAT CAAGGAAGAG CTTAGGCTTG AGCAAAGATA CGCCAATGCC
GAATATGGGA AAATTTATAC CAAGGAGAAC CTCAGCCACG GGCAGAAGTT TCTCAAAGGT
TTTGATTATG CCTATGAGGG CCTAGTCTTT GCCATAAATC ACGAAAAAAA CATGAAATTT
CATATCCTAG CTTGTGCCAT AGTCTTTGTG GCAAGTTTGT TTTTCAATAT TTCTAGGATT
GAGATGATGT TTCTCATCTT TGCCATATCA TCAGTCATAG CCCTAGAGCT TGTAAATACA
GCTCTTGAGA ACGCTGTCGA TATAGCAGCA GACGGCAGGT GGCTATCTCT TGCCAAGTCC
GCCAAGGACG TATCAGCCGC GGCAACCTTC ATAGCAGCCC TCAATGCACT TTTTGTGGGC
TATATGATAT TTTTTGATAA GTTCCTCAAC TTCTACGATT CAGTAATCTT GAGGATTGCG
AGGAGACCGA GCCATTTGGC GGTAATTTCG ATTTCTCTTA TCATAATCAT AACCATATTC
CTTAAGGGAG TTTTCTATGA GGGACATGGT ACTGCCTTTA GGGGAGGCTT TGTTAGCGGA
CATACTTCCG TATCCTTTGG CCTTGCGACC ATTGGTATCC TCCTTATGGA CAATCCCATG
GTGATGATTC TTATGGCCTT TATGGCTCTT ATAGTTGCAG AATCCAGATA CGAGGCAGAC
ATCCATTCGA CAAAAGAGAT AATTAGGGGA GCCATACTTG GCATAAGCGT GGCCCTTGCA
ATATTTGGGA TATTTTCATG A
 
Protein sequence
MHLLIENNTE EKIEMDADLE RTVKEVLKTE GLGEDYEVSI TFVDREEIHR LNREFRDVDR 
PTDVLSFPLD DTIDLPDADK MLGDIVICLD VAKDQAMEFG HSLRREIMYL TCHSTLHLLG
YDHMEEDEKR EMRRREKEVM RNLRVFKNDD HENFDHMDKN DYEKEREEAA KNDEKFLEFR
DELKEEIKEE LRLEQRYANA EYGKIYTKEN LSHGQKFLKG FDYAYEGLVF AINHEKNMKF
HILACAIVFV ASLFFNISRI EMMFLIFAIS SVIALELVNT ALENAVDIAA DGRWLSLAKS
AKDVSAAATF IAALNALFVG YMIFFDKFLN FYDSVILRIA RRPSHLAVIS ISLIIIITIF
LKGVFYEGHG TAFRGGFVSG HTSVSFGLAT IGILLMDNPM VMILMAFMAL IVAESRYEAD
IHSTKEIIRG AILGISVALA IFGIFS