Gene Apre_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0471 
Symbol 
ID8397246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp538148 
End bp539392 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content36% 
IMG OID644994828 
Producthypothetical protein 
Protein accessionYP_003152239 
Protein GI257065983 
COG category[S] Function unknown 
COG ID[COG3949] Uncharacterized membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAAAA AAATTATAAG AATAATGTTA GCCTATGTAG GAGTAATCAC TGGAGCAGGA 
CTAGCATCAG GCCAAGAGCT TATGCAGTAT TTTGTATCAA TGGGTATTCC TGGCATAGTC
GGAGTAGTAG TACTCGCATT TTTACATATG TTAATAGGAG GACTCCTACT TCAACTTGGT
AGCCATTATC TAGCAAATGA CCATTCGGAA GTTTTTGATG AGATTACCAA TAAGGTCATC
AGTAAATTTA TGGACTTATC CTTGATATTC ACTTGCTTTG TAATAGGTTT TGTGATGATT
GCAGGAGCTG GTTCAAACTT AAACCAGGCC TTTGGTACAC CTAACTGGTT GGGAGCTGTG
ATCTGTGCCC TATTAATTAT AGTTGTAGGT ATGCTAGATT TTGAGAGGGT TAGTCAGATT
ATAGGATCTT TTACTCCTTT AATCCTAGTC TTTACCCTAA TCGCTTCGAT CTATACCTTT
ACCCATCACA CACCTGACTG GAAAAGCCTT GACCTTGTTG CAAGGAGTCT ACCAAGCAAC
TTCTCAAGCG TTACTCTTTC CCTATTCAAC TATTTTGGTA TGTCGATAAT GACAGCTGTC
TCTATGGCCC TAGTTTTGGG TGGAGATGAA CTTAATACAG GAGAAGCTGG TATAGGAGGT
CTTATTGGAG GTCTTCTAGT TGGAATATTG GGTATACTTA TTGTTCTTAC TTTGTTTATT
AGGGTAAATG AAGTAAAAGA TCTAGATATA CCAATGCTCT ATGTAATAGA AGATATAAGT
CCAATCCTTG GAACTGTCAT GGCACTTGTC ATATTTGGCA TGATATTTAA TACAGGAATT
TCTCTTTTCT ATGCTTTGGC ACGTAGATTT TCCGGAGGAG AAGAGAGGAA GTTTAAAATC
CTTCTCGTAA CTATAACACT TGCTGGGTTT ATCCTAAGCT TTGGAGGATT TAAGAAGCTT
GTATCAGTAT TTTATCCAAT CATAGGTTAT GCTGGTATTA TCATGCTAGT GATTTTAGTT
TTTGCCTATC ATAAGGAAAG AACTTCTATC AAGTATGAAA ACATCAGAAG ATTTGCCATC
AATCATTATA TGAGAAAGAA ACTCGATGAT GATATGGAAT ATACAAAAGA AGATAAGGAA
AAGCTTAAAA GATTAATAGA AAGAAGCCAT ATAGAAAATA AGGATATGTT AGAAAAATCT
GAAGAAGCTG TTCAAGAAAT TTTGGACGAA GAAAATGAAG AATAA
 
Protein sequence
MNKKIIRIML AYVGVITGAG LASGQELMQY FVSMGIPGIV GVVVLAFLHM LIGGLLLQLG 
SHYLANDHSE VFDEITNKVI SKFMDLSLIF TCFVIGFVMI AGAGSNLNQA FGTPNWLGAV
ICALLIIVVG MLDFERVSQI IGSFTPLILV FTLIASIYTF THHTPDWKSL DLVARSLPSN
FSSVTLSLFN YFGMSIMTAV SMALVLGGDE LNTGEAGIGG LIGGLLVGIL GILIVLTLFI
RVNEVKDLDI PMLYVIEDIS PILGTVMALV IFGMIFNTGI SLFYALARRF SGGEERKFKI
LLVTITLAGF ILSFGGFKKL VSVFYPIIGY AGIIMLVILV FAYHKERTSI KYENIRRFAI
NHYMRKKLDD DMEYTKEDKE KLKRLIERSH IENKDMLEKS EEAVQEILDE ENEE