Gene Apre_0091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0091 
Symbol 
ID8396842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp110770 
End bp112533 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content38% 
IMG OID644994430 
Producthypothetical protein 
Protein accessionYP_003151865 
Protein GI257065609 
COG category[R] General function prediction only 
COG ID[COG5263] FOG: Glucan-binding domain (YG repeat) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA AGAAGATAAT AGCTTTGTTG ATATGTTTAT CTAGCTTTAC AAGCTTAAGT 
CCTTCATTTG CTGATGGAGG GGAGAAGCTA GAGGAAAATT TATATCAAGA AAGTCTTTAT
AATGAATCTA TTGATAATAT TAGCGAAGAC GCAGATGAAG TCAATATAGA AAAAGATAAT
CCGAGTGAGG ATGTTGAAGA GACTCCTTCA GAAAAATCGA TTGATGAAGG AGAAATTCGT
AAGATACTTA AAGAAATAAA GGAAGAAAAT CCGCTATCTT ACGAGCAGGA AGAGGAAATC
GAAGAGATAG AAAAGGATTC TAAATCTAAC ACAGGAGACA CAAGTAGCCT TCCTATAGTC
TATTATGACA CTTCAGATAT AGATGAAATA TTTAAGAAAA ACAATAAAAA AGCTGATGAT
GAAATTAAAG TTAGCGGAGA TTATAAATTA ATTGCTAAGA ATAATTCCAC CTATCTTTAT
GACAAAGCTG GCAAGAAACT TTCTGGTAAA AGAGATCTTT CCGGTAAGTC CTATTATTTC
GACAAAGAAA AGGGCCTAGT AAAGGAAAGA GAAGTCATAG CGGATGGAGG AAAATTTACC
GCAAACTCTA GTGGAGAGCT TACTCTAGTA GAAAAATCTA AGCCAGGCTG GATAGACCTA
GAAGGCAGGA GCTATTTCTT CCAAAATGAC GGCAAGCTTG CCAGAGGCCT TTACGATACG
GGACCTAACA GGTATTTCTT TGATAAGGAA ACTGGAGCCA TGGCCCGTAA TGAGATTAAT
ACCAAGGGCT TTGACAGAAT ATATTCTGAG GCAAACGGAA TTGCCCACTA TATAGGTTGG
GACTATTCCA AGGGAAAACT TGGATATTTC AATGAAGATG GAACCTACAC CAAGGGCCTT
AGGACGATCG ATGGAATCAC TTACGGTTTT GACGAAAATG GTAAGATTTT CAATAGGCAG
TACAAGAAGT TTGGAAATCA ATGGTATTAT TTCAATAAGT ACGGCGAAGC TAGCAAGCAT
TCAGGAAAAT TTGCCAAAGG TTGGGTTGGA GATAGGTATT ACTTCTCAGA CGGAAAGCCT
GCAGAAGGAC TACAAAAAAT TGGTGGAGTA ACCTATATAT TTGATGAGCT AACCAAGGAG
ACCTTGACCA ATACTACCAA GGTAATTAAT TTTAATAGAT ACAAGCTAGA TAGCCATGGT
CGTGCGACAT TTATCGGAAA AATCGACACT AGACAAGCGG TTAAGGGAAC CAGGGGACTC
TTTAAGCCAG GATTTTCTCA ATATCTTAAT AGGAAAACTC CTTACTTTAC CCAGAAAGAC
CCTAGATGGG CTAACAGAAG ATTTTCGAAT GGAACATTTT CAGGATATGG CTGCGGACCA
ACTGCCATGG CCATGGTCTT ATCAAGAGAG CTTCATAGGA ACGATATCTA CCCAACTAAT
ACAGCCATAG ATGCTAATGA TTACGAAAAC GACGGAACTG AATGGCAATA TTTCATGGAA
GCTCCAAGGA TGTATGGACT AAATTCTTAT GACGTGCCAG TAAATAAAAA AGCCTTCATC
CAGGCTCTAG AGACAGGAAC TATGGTAGTA AGAGTAGGGC CGGGATACTT CATAAACGGC
GGTCACTTCA TGGTAATAGA CTCCTACAAG GATGGATATT TTACAATAAA TGATCCATAC
TACTCAAGAA GAAATACCCT AGACAAACAC ACTTTCGAAA GACTAAAGGC GGAAGTGACA
GTTGGCTGGG TAATCAAAAA ATAA
 
Protein sequence
MNNKKIIALL ICLSSFTSLS PSFADGGEKL EENLYQESLY NESIDNISED ADEVNIEKDN 
PSEDVEETPS EKSIDEGEIR KILKEIKEEN PLSYEQEEEI EEIEKDSKSN TGDTSSLPIV
YYDTSDIDEI FKKNNKKADD EIKVSGDYKL IAKNNSTYLY DKAGKKLSGK RDLSGKSYYF
DKEKGLVKER EVIADGGKFT ANSSGELTLV EKSKPGWIDL EGRSYFFQND GKLARGLYDT
GPNRYFFDKE TGAMARNEIN TKGFDRIYSE ANGIAHYIGW DYSKGKLGYF NEDGTYTKGL
RTIDGITYGF DENGKIFNRQ YKKFGNQWYY FNKYGEASKH SGKFAKGWVG DRYYFSDGKP
AEGLQKIGGV TYIFDELTKE TLTNTTKVIN FNRYKLDSHG RATFIGKIDT RQAVKGTRGL
FKPGFSQYLN RKTPYFTQKD PRWANRRFSN GTFSGYGCGP TAMAMVLSRE LHRNDIYPTN
TAIDANDYEN DGTEWQYFME APRMYGLNSY DVPVNKKAFI QALETGTMVV RVGPGYFING
GHFMVIDSYK DGYFTINDPY YSRRNTLDKH TFERLKAEVT VGWVIKK