Gene Apre_0226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0226 
Symbol 
ID8397000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp258534 
End bp259631 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content35% 
IMG OID644994587 
Producthypothetical protein 
Protein accessionYP_003151999 
Protein GI257065743 
COG category[R] General function prediction only 
COG ID[COG4194] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGAAA ATATGAGTAT TGCAGTATTT TACGGGATAA TTATTATTTT GATAAGTCTT 
ATCCAAGCCT TCGTCACATC CTACTCCAAG AGGGGCTATG TACTTGGAGT TAGACTAGTG
GAAGACTTAG AAAAGGATAG GGAAGTAAGA AAGATAGTAA AAGACTATAG GACTTTGACT
ATCCTTGTAG GATTTGCCCT AGCTTTACTT ATAGTCGGCT TAAGCTATCT TATAGAAAAT
GAGGCCTTGC TTAATTTAGC CTATATTCTA TCAATATTTC TAACTTATAT TCCCCTAGTT
CTTGCTAACA AGAAACTAAA AGTATTAGCA AAGGATCAAA AAGTAGATAA GAGAAAAGTT
GTAAGTCTAG ATTATTCAAA GATAAAAATA TTTAACAAGA AGGAATTTTT TGGCATATAC
CTAGGCCTCC TCCTTATAGT GATAATCTTT GCCATAAGAA TCCACCTAGA CTATGAAAAC
TTTCCAGATA AATTAATTAT GCATATGAAT AGCAAGGGAG AAATTGATGG GATAGCTCAT
AAATCTTACC TATCTATCCA ATCCCCAACT ATAGTAAGTT TCTTTATGCT AGCAGTGATG
TTTTTTGCAA ATCTTTCCCA ACTTCTATCA AAGATGAGAA TTAGCCCAGA TATGCCAGAA
GAGTCCTTGG ATAGGCTCTT AGAAACTAGG AGGATTTGGA CCTATTATTT TGCAACATCG
GCAATTTTAC TTATAGTTTT ATTCCAAGTA GGAATTCCTT CCTTTATGAA AACTGGAGAC
GACTCCTTGG TTAAGGTCTT AGGCATAATT GCTATTGGGT TTTCTATAGG AGGTAGCATT
CTTATAGGAA AGTTTAGGTC GGTTGACGGT TCAGCCTTAA ATAAAACTGG TAGATATGGC
TACGAAGAGG AGGATGATAA GTGGATCCTA GGTGGTCTAA TTTATTACAA TCCAGACGAT
CCAGCAATAT TTGTAGAAAA AAGAGTAGGC GTTGGAACTA CTATGAACTT CGCCAATAAT
TGGGTTAAGG TAATTTTCAT TGCAGTGATA CTTTTCCCAT TTGTTCTAGG ACTTGTGCTT
AATATGTTTG AAGGATAG
 
Protein sequence
MNENMSIAVF YGIIIILISL IQAFVTSYSK RGYVLGVRLV EDLEKDREVR KIVKDYRTLT 
ILVGFALALL IVGLSYLIEN EALLNLAYIL SIFLTYIPLV LANKKLKVLA KDQKVDKRKV
VSLDYSKIKI FNKKEFFGIY LGLLLIVIIF AIRIHLDYEN FPDKLIMHMN SKGEIDGIAH
KSYLSIQSPT IVSFFMLAVM FFANLSQLLS KMRISPDMPE ESLDRLLETR RIWTYYFATS
AILLIVLFQV GIPSFMKTGD DSLVKVLGII AIGFSIGGSI LIGKFRSVDG SALNKTGRYG
YEEEDDKWIL GGLIYYNPDD PAIFVEKRVG VGTTMNFANN WVKVIFIAVI LFPFVLGLVL
NMFEG