Gene Apre_1272 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1272 
Symbol 
ID8398061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1364464 
End bp1366434 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content38% 
IMG OID644995616 
Productpolysaccharide biosynthesis protein CapD 
Protein accessionYP_003153016 
Protein GI257066760 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1086] Predicted nucleoside-diphosphate sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAATT CGAAAATAAT TGACAAGAAA AAACTAATAC TAGGATTTTT TGATATATTA 
ATTATCAATT TAGCTTATTT TCTAGCCCTG TATTTCAGAT TTGATATGAA TTTCAGGGCC
ATACCTTTGG ATTTCCTTGA TGCTTTCAAG ACCTATGCTA TTTTTAATAC CATCCTTACG
ATAGCTATTT ACGGTCTACA CAAGATGTAT AACATGATAC TGGAGTATGC TTCCTACAAG
GAGCTTATTA ATATCACTCA AGCTGTTATT CTATCTCTTA TAAGTCACGC TGTATTAATG
ACAATTTTTG TCTATAGAAT GCCAATATCA TACTATATAT TCGGATTTAT GATTCAATAT
GTTTTGACTG TGGCATCAAG ATTTATCAAG AAGATGATGA TTCACAAAAG CTACAAGACT
GACAAGAAGA ACCAGTCTTA CAAGAGAGCC CTAATCGTTG GGGCAGGATC TGCAGGACAG
ACTCTAAAAA GAGATATATC TAATTCCAAC AATGGTGGCA AGACTAATAT TCTTGTTGTT
GGCTTCATAG ATGATGATCC AAAGAAGAAA AATCAATACA TAGATAATAG TAGAATCTTC
GGTGGAAGGG ATATGATAAA GGAAATTGTC GAGGAAGAAG CCATCGATGT TATTCTTGTA
GCTATTCCTT CTGTAGAGGA AGTCGAGAAG AGAGAAATCC TAAGAATATG TAACGAAACA
GGATGCGAGG TCAAAGTTCT TCCTGGTATT TACCAACTTG TTTCAGGAAA AGTTACCATG
TCTACTATGA AGGATATCCA AATCGAAGAC CTTTTAGGAC GTGATCCGGT AAAGATTTTC
TCCAACGAAA CTTTCGATTA CCTCAATGAC AAGGTAGTCC TAGTTACAGG AGGGGGAGGA
TCCATAGGAT CTGAACTTTG TAGGCAAATA GCTCAATATG GTCCAAGACT ACTTATAATA
TTCGATATAT ACGAGAACAA TGCCTATGAA ATAGAGCAAG AACTTAAGAG AAACAACAAG
GACCTCAACT TTATAACCCT AATAGGCTCT GTAAGAGACT ACAAGAGAGT GGAGAAGGTA
TTTAAGACCT ATAAACCAGA CATAGTATTT CACGCAGCAG CCCACAAGCA CGTACCGTTG
ATGGAAGTAA GTCCAGTGGA AGCTATCAAG AACAATGTCA GAGGAACATA CAATGTCGCC
CTACTTTCCC TAATCTATGA CGTTCAAAGA TTTGTTCTCA TATCAACAGA CAAGGCAGTC
AACCCAACAT CAGTTATGGG AGCAACCAAG AGAGTTTGTG AGAAGATAAT CCAAGGAATA
AATGACATAA GAGATAGCAA AGAATATAAT AATCTTGCAA AGGTAATTGT CCAAGATGGG
GACAGAAATA TTACAATTAA TCCTGAAGAC TTGCTAGAAG GGAAAAATCC AGGAACCGAA
TTTGTTGCAG TTCGTTTTGG TAATGTATTG GGATCGAATG GTTCTGTAAT ACCACTATTT
AAAAAGCAAA TCGCAGCGGG TGGACCTGTT ACAGTTACCC ACCCAGAGAT AATCAGATAC
TTTATGACAA TTAAGGAAGC TGTAAAGTTA GTCCTTCAAG CTGGATCCAT GGCCCAAGGC
GGTGAAATAT TCGTCCTAGA TATGGGAGAA CCAGTCAAGA TTGACGACCT GGCAAGACAG
CTAATAAGAC TATCAGGTTA TCAACCAGAC ATAGATATGC CTGTAGTCTA TACGGGACTT
AGACCTGGAG AGAAGCTCTA CGAGGAAAGA CTGATGGACG AGGAAGCCCT CACAGATACC
CACATTGAGG GAATATCTGT AGGCCGACCA CTAGACTTCT CTAGGGGAGA ATTTTTCGGG
AAACTTGATA AAGTAATCAA TCAAGAAAAT ATTGATGAAC TTGATATCAT AGCAACGATT
AATGAGTTAA TTACTACATT TATAGGAAAG GATAATTCGA AGGGAGAATA G
 
Protein sequence
MKNSKIIDKK KLILGFFDIL IINLAYFLAL YFRFDMNFRA IPLDFLDAFK TYAIFNTILT 
IAIYGLHKMY NMILEYASYK ELINITQAVI LSLISHAVLM TIFVYRMPIS YYIFGFMIQY
VLTVASRFIK KMMIHKSYKT DKKNQSYKRA LIVGAGSAGQ TLKRDISNSN NGGKTNILVV
GFIDDDPKKK NQYIDNSRIF GGRDMIKEIV EEEAIDVILV AIPSVEEVEK REILRICNET
GCEVKVLPGI YQLVSGKVTM STMKDIQIED LLGRDPVKIF SNETFDYLND KVVLVTGGGG
SIGSELCRQI AQYGPRLLII FDIYENNAYE IEQELKRNNK DLNFITLIGS VRDYKRVEKV
FKTYKPDIVF HAAAHKHVPL MEVSPVEAIK NNVRGTYNVA LLSLIYDVQR FVLISTDKAV
NPTSVMGATK RVCEKIIQGI NDIRDSKEYN NLAKVIVQDG DRNITINPED LLEGKNPGTE
FVAVRFGNVL GSNGSVIPLF KKQIAAGGPV TVTHPEIIRY FMTIKEAVKL VLQAGSMAQG
GEIFVLDMGE PVKIDDLARQ LIRLSGYQPD IDMPVVYTGL RPGEKLYEER LMDEEALTDT
HIEGISVGRP LDFSRGEFFG KLDKVINQEN IDELDIIATI NELITTFIGK DNSKGE