Gene Apre_1583 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1583 
Symbol 
ID8398395 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1719778 
End bp1721091 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content34% 
IMG OID644995947 
ProductPTS system, lactose/cellobiose family IIC subunit 
Protein accessionYP_003153325 
Protein GI257067069 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1455] Phosphotransferase system cellobiose-specific component IIC 
TIGRFAM ID[TIGR00410] PTS system, lactose/cellobiose family IIC component 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAAA CAAATAAAAT AAGCTTCATG GACAAGTTTA CTGAAGCTGC TATGAAATTT 
GGTGCTCAAG TACACCTTCG CTCACTTAGG GATGCCTTCG CAATAATGAT GCCCCTATTT
ATATTAGCAG GTCTTGCGGT ATTAATAAAT TCTGTAATAT TTCCTAAAAT ATTAAGTGAA
AGTGCCATTC AAACAGCAGG ACATTGGGCA ACTTCTATAG CAAATGCTAC CTTAAATGTT
TCTGGATTAA TCTTGTGTGG AATAATAGGT TATACTCTAT CTAAAAATAA AGGTTACAAA
AGTCCTTATA CATGTGTAAT GATAGCTATA GCAGCATTAA TTGTAATGAT GCCACAAACA
CTAAAAATAG CGGCTACTGA TGGATCTGAA GTTGAAGTGG GAGGAATCTT AACCTATGGA
AACCTTGGTA CCTCTTCAAT GTTTGCGGGT ATTATTGTAG GTTTATTGTC TACTGAAATT
TATTTGAGAC TTTCTAAAAT TGATAAGTTA AGGGTAAATA TTGGTGGAGA TGTACCACCA
CAAGTAAATG CATCCTTTAA TAATATGATT CCTGCCATGC TATCAATTAT TATATTCTCT
ATTGTAAGTT TTGTGCTATA CTCAGTATTT AATACAGACT TAATAACTTT GATAACAACA
ATGATTCAAG AGCCTTTGAG GAAAGTAAAC ACTTCTCTTG TTGGTACAGT ATTGATATAC
AGCTTTGGAA ACTTATTGTT CACATTCGGT ATTCACCAAA CAGTAGTAAA TGGAACAATC
CTTGAACCAT TGCTTCTTGT AAATATGAAT GAGAATATGG CAGCTGCAGC AGCAGGCAAA
GAGATTCCTC ATATAATTAA TTCTACATTC GTCCCAACAT TTGGCATGCT AGGTGGTACT
GGTTCAACAA TATGCTTGTT AATTGCAGCA TTCCTATTTT TCAGGAAAAA TCAACAATAC
AGTGAATTAG GGAAATTAGC TGTAGCTCCA GGATTATTTA ATATAAACGA ACCTGTTATA
TTTGGTTTCC CTATAGTGTT CAACTTGCCA ATGATAATAC CTTTCGTATT GACTCCAGCT
ATAGGAATTA TAATAGCTTA TTTTGCAACA GCAATAGGTT TTATGAATAA ATGTACAGTG
CTTGTGCCTT GGACTACTCC TCCATTATTA AATGGATTTT TAGCAACTGG AGGAGACTTC
AGAGCTATTA TAGTTCAATT AGTGATAATT ATTATAGGTG TACTATTATA CTTGCCATTT
ATGAAGATAA GTGAAAGAGT AAGCAGAAAA CAAGCAGAAG CTTTAAATAA CTAG
 
Protein sequence
MTETNKISFM DKFTEAAMKF GAQVHLRSLR DAFAIMMPLF ILAGLAVLIN SVIFPKILSE 
SAIQTAGHWA TSIANATLNV SGLILCGIIG YTLSKNKGYK SPYTCVMIAI AALIVMMPQT
LKIAATDGSE VEVGGILTYG NLGTSSMFAG IIVGLLSTEI YLRLSKIDKL RVNIGGDVPP
QVNASFNNMI PAMLSIIIFS IVSFVLYSVF NTDLITLITT MIQEPLRKVN TSLVGTVLIY
SFGNLLFTFG IHQTVVNGTI LEPLLLVNMN ENMAAAAAGK EIPHIINSTF VPTFGMLGGT
GSTICLLIAA FLFFRKNQQY SELGKLAVAP GLFNINEPVI FGFPIVFNLP MIIPFVLTPA
IGIIIAYFAT AIGFMNKCTV LVPWTTPPLL NGFLATGGDF RAIIVQLVII IIGVLLYLPF
MKISERVSRK QAEALNN