Gene Apre_0083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0083 
Symbol 
ID8396834 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp104092 
End bp105606 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content40% 
IMG OID644994422 
ProductPTS system, N-acetylglucosamine-specific IIBC subunit 
Protein accessionYP_003151857 
Protein GI257065601 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific 
TIGRFAM ID[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component
[TIGR00826] PTS system, glucose-like IIB component 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value6.36511e-09 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGAAA AACTACAAAG ATTAGGGCAA TCCCTAATGC AACCAGTCGC TGTAATGCCT 
CTTGCAGCCC TACTTTTGGG TATAGGTTAT GCAATCGACC CAGACTTATG GGGCGGAGGA
TCACCAATAG CAGCCTTCTT AATATCTGCA GGTGGATCAA TCCTCGACAA TTTAGGTATT
ATTTTCGCAG TCGGCATAGC ATTCGGTATA GCTCACGACA ACCACGGAGC AAGTGCTCTA
GCAGGTCTTG TATCATTTTT AACTATTATA AGATTGCTCG CTCCAAACAC AGTAGCCATG
CTTTCAGGCT TGGATTTAGA AGCATGGACT GCAGCTGATG AATTTAGGGC AACAGCCTTT
AGTACAATGG GCAACGGCAA TGTATTTGTC GGAATCTTAT CAGGTATTAT CGGTGGATTT
GCTTACAACA AATTCTTCTC AACAAAACTC CCTGATTTCT TGGCCTTCTT CTCAGGTAGA
AGACTTGTTC CAATCATGGC ATCATTTATG GCCATGGTCG CTTCAGGAAT TCTTTTTATC
CTCTGGCCAA TTATTTATGT AGGACTTGTT AATTTCGGAC AAATCCTCCT AAATCTCGGT
CCAGTAGGTG CAGGAATATA TGCTTTCTTT AACAGACTTT TAATCCCTAC AGGTCTTCAC
CACGCTCTTA ACCAAGTATT CTGGTTTGAC CTAGTTGGAA TCAACGACAT TCCTAACTTC
TTAGGAAATG TTCAAGAATC AATAACTAAA GTCTATCACC CAGGTATGTA TCAGGCAGGA
TTTTTCCCAA TTATGATGTT TGGTCTTCCA GGAGCCGCCC TTGCTATTAT CAAAAAGGCT
GATAACGACA AGAAAAAGTC AACCAAGGCT ATAATGATAG CAGCAGCTCT AGCATCTTTT
GCAACAGGAG TTACAGAACC ACTTGAATTT TCATTCATGT TCGCTGCTCC ACAACTTTAC
CTAATCCACG CAGCTTTTAC CGGCATATCT GCATTTATTG CAGCAAGTCT AAAGGCTTAC
GCAGGATTTG GTTTTTCTGC AGGTCTAGTA GACTTTATAC TTTCACTCAA AAACCCAATG
CATGCCAATA TCCTAATACT TATAATTATG GGTATAGTGT ATTTTGCTCT TTACTATTTT
GTCTTTAGTG CTTTGATAGA AAAATGGGAT ATAGCTACAC CAGGTAGGAA GACAGAAGAT
ACAGGAAAAG TCAGACCAGA TGATAAAAGC GCACTAGAAG AAGAAAATGA GAAAAAAATC
GTTCACTCAA ATTCTTATGA GAAAACAGCA GCAAAAATCC TAGAAGGTCT TGGTGGCAAG
GAAAATATCG ACACAACAAG CTATTGTACA ACAAGACTTA GACTAACAGT CCATGACCAA
GAAAAGGTAA ATGACGAAAG AATAAAGGAA GCTGGAGTTG CTGGAATCAT GAAACCAGGA
CCTAAAGCCG TCCAAGTAAT CATTGGACCT CAAGTCCAAG CTGTTTACGA TGAATTTATG
AAATTAATTA AATAG
 
Protein sequence
MKEKLQRLGQ SLMQPVAVMP LAALLLGIGY AIDPDLWGGG SPIAAFLISA GGSILDNLGI 
IFAVGIAFGI AHDNHGASAL AGLVSFLTII RLLAPNTVAM LSGLDLEAWT AADEFRATAF
STMGNGNVFV GILSGIIGGF AYNKFFSTKL PDFLAFFSGR RLVPIMASFM AMVASGILFI
LWPIIYVGLV NFGQILLNLG PVGAGIYAFF NRLLIPTGLH HALNQVFWFD LVGINDIPNF
LGNVQESITK VYHPGMYQAG FFPIMMFGLP GAALAIIKKA DNDKKKSTKA IMIAAALASF
ATGVTEPLEF SFMFAAPQLY LIHAAFTGIS AFIAASLKAY AGFGFSAGLV DFILSLKNPM
HANILILIIM GIVYFALYYF VFSALIEKWD IATPGRKTED TGKVRPDDKS ALEEENEKKI
VHSNSYEKTA AKILEGLGGK ENIDTTSYCT TRLRLTVHDQ EKVNDERIKE AGVAGIMKPG
PKAVQVIIGP QVQAVYDEFM KLIK