Gene Apre_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0473 
Symbol 
ID8397248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp539973 
End bp541418 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content40% 
IMG OID644994830 
ProductPTS system, trehalose-specific IIBC subunit 
Protein accessionYP_003152241 
Protein GI257065985 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01992] PTS system, trehalose-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAAT ATACAGATGA TGCGAAGCTT TTGCATCAAT ATATTGGAGG AGATAGTAAT 
ATCTCATCTG TTACGCATTG TGTTACCAGG ATGCGTTTTG TACTAAATGA TCCAAAGAAG
GCGGATGTAG AAAAAATCGA AGATCTTCCT TCAGTGAAAG GATCTTTCAC CCAGGCTGGC
CAATTTCAGG TTATTATTGG AAACGATGTA GATGAGTTTT ATAATGACTT CATGGCTATA
TCACACGCCA CAGAAAAGAG CAAGGATGAA GTAAAAAAGG ATGCTGTGAA AAACCAGAAC
GCCCTTCAAA GGGTATCTTC AGTCCTTGCG GAAATCTTTG CGCCTTTAAT TCCAGCTATT
ATCGTAGGTG GTTTGCTCTT AGGTTTTAGA AATATTCTTG GAGAGATGCC TTTTGATAGT
CTTGGAGGAA AGACAATCGT AGAGACTTCT GTTTTTTGGA ATGGGGTAAA TGACTTCTTG
TGGCTTATAT GTGAAGCAAT CTTCCACTAC CTACCAGTAG GGATCACCTG GTCTATCACA
AGGAAGATGG GTATAACCCA AATTCTAGGA ATTGTTCTAG GTATTTGTTT GATTTCACCT
AACCTACTTG CCAATGCATA TTCAATAGCA GGTGGGGGAG AAATTCCTGT CTGGGACTTT
GGATTCTTCA CAATAGAAAG AATTGGCTAC CAAGCCCAGG TAATCCCAGC CATGCTTGCA
GGCTTCCTCT TGGTTTATCT TGAAAGATTC TTCAAGAAGG TCATCCCTCA AGCAATATCA
ATGATTTTTG TTCCCCTATT TTCACTCATA CCAACAGTAC TTCTAGCTCA CTTAGTCCTA
GGTCCTATTG GTTGGAAGAT AGGCTCACTA ATCTCTGCAG GAGTATATAA TGGATTGACC
TCAGCCTTTA ACTGGCTATT TGCTGCAGTA TTTGGTTTCT TCTATGCGCC ACTTGTTATT
ACAGGACTTC ACCATATGAC AAATGCAATC GACCTTCAGC TTGCAAATGA CTTCGGTGGA
ACAATCCTTT GGCCAATGAT TGCCCTATCA AACATTGCCC AAGCCTCAGC AGTAGTAGCT
ATAATCTACC TACACAGAAA AGACGAGAAG GAGAAACAAA TCTCAGTTCC AGCAGCAATT
TCTGCCTATC TTGGAGTTAC AGAGCCTGCT CTTTTTGGTA TCAATATCAA ATACGGCTTC
CCATTTATAG CGGGAATGAT TGGATCTGCC CTTGCAGCGG TATTTTCTGT ATCAACTTCA
ACCATGGCCT ACAACATAGG TATAGGTGGA CTTCCTGGAA TTCTTTCAAT AATGGGAGGA
TCTAGGTTAA ACTTCGCCAT ATCCATGGCT ATTGCAATAG TTGTGCCTGT AGTACTTACT
GTAGTATTTG AAAAGAAGAA AATGTTTCAT AACAAGATAG AATTTAAGAC ACCAAGTTTT
AGCTAG
 
Protein sequence
MGKYTDDAKL LHQYIGGDSN ISSVTHCVTR MRFVLNDPKK ADVEKIEDLP SVKGSFTQAG 
QFQVIIGNDV DEFYNDFMAI SHATEKSKDE VKKDAVKNQN ALQRVSSVLA EIFAPLIPAI
IVGGLLLGFR NILGEMPFDS LGGKTIVETS VFWNGVNDFL WLICEAIFHY LPVGITWSIT
RKMGITQILG IVLGICLISP NLLANAYSIA GGGEIPVWDF GFFTIERIGY QAQVIPAMLA
GFLLVYLERF FKKVIPQAIS MIFVPLFSLI PTVLLAHLVL GPIGWKIGSL ISAGVYNGLT
SAFNWLFAAV FGFFYAPLVI TGLHHMTNAI DLQLANDFGG TILWPMIALS NIAQASAVVA
IIYLHRKDEK EKQISVPAAI SAYLGVTEPA LFGINIKYGF PFIAGMIGSA LAAVFSVSTS
TMAYNIGIGG LPGILSIMGG SRLNFAISMA IAIVVPVVLT VVFEKKKMFH NKIEFKTPSF
S