Gene Apre_1600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1600 
Symbol 
ID8398412 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1740191 
End bp1741597 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content38% 
IMG OID644995964 
ProductGlucuronate isomerase 
Protein accessionYP_003153342 
Protein GI257067086 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1904] Glucuronate isomerase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTCA TGACAGAAGA CTTTATGCTC CATAATGACT TTGGGAAGAA ACTCTATCAC 
AATTATGCGG CTAAGATGCC AATATTTGAC TTCCACTGTC ACCTTGAAGC TAAGGAAATT
TATGAAAACA AAAATATTCC TTCAATAACT GAAGCTTGGC TCGGTGGAGA CCACTACAAG
TGGAGAGTTA TGAGAGCTTG TGGAGTTGAG GAAAAATATA TCACAGGTGA TGCAGAGGAT
TTCGAAAAGT TTGAAAAGTA TGCGGAAATC ATGCCAAATC TTATAGGAAA TCCAATATAT
CACTGGACTC ACCTAGAACT TAAGAACTTC TTTGGCATAG AAGAGTGTCT TAACAAGGAA
AATGCCAGAG AAATTTATGA TAAGTGTAAC GAACTTCTAG CAAAAGATGA ATTTAGACCA
AGAGGACTAA TTGAGATGAG TAATGTTGCT GCAGTTTGTA CAACCAATGA TCCTATAGAT
GATCTTAAAT ATCACGAGCT TCTGGCCAAA GATAACTTCA AAGTAAAAGT TCTTCCAGCC
TTTAGACCAG ATAATGCCCT ATACATCGAA AAAGATAGCT TCAAGAGCTA TATGGGAGAT
CTTTCTAAGG CTAGTGGAGT AGAGATTACA AACTTTACAG ATATCAAAAA GGCCCTAAAG
GCAAGAATGG ACTTCTTTAA CGAGCACGGA GCTAAGGCAA GCGATCAAGC CTTCAAATAC
ATCCCACATA GAAGATGTGA TGAAGAAATC CTAAATAAAA TAGTTCAAAA GAAACTAAAT
GGAGAAGACA TTAGCCTAGA AGAGGAAGAA GCCTACAAGA CAGAGCTTGT AATCTTCCTT
GCCAAAGAAT ACAAAAAGCT AGACTGGGCT ATGGAAATCC ATGTTGGAGT AATCAGAAAC
AATTCCAAGC TAATGTTTGA AAAGCTTGGA GCTGACGTTG GTGGAGATAG CCAAAACGAC
CTAAACTTCG CAGAAAATCT TGCGGACCTA CTTTCTGATT TTGAAGAAAA CGAAGGCCTT
CCAAGAACTG TAATATTCCC ACTTAATCCA AAAGACCAAT TCCCAATTGC TACAGTTGGA
GGATCCTTTA ACAGGGCTAA TCCTGATGGT ATGCAAAATA TCCAACTTGG AACAGCTTGG
TGGCATCTTG ACCACAAGGA TGGAATGATA GAGCAAATGA AAGTATTCTC AAGCGTTGGA
GTCCTCTCCA AATTCATAGG AATGCTAACA GACTCAAGAA GCTTCTTGTC CTACCCAAGA
CACGAATACT TCAGAAGAAT CCTCTGCAAC TTCATAGGAG AATTAGTAGA AAAAGGCGAA
TATCCAGCTG ACGAAGAATT CTTAGGAAAA GTTGTAGAAG ATATTTGTTA CAACAACGCA
AGAAAATACA TCAAAATAGA TTTGTAA
 
Protein sequence
MKFMTEDFML HNDFGKKLYH NYAAKMPIFD FHCHLEAKEI YENKNIPSIT EAWLGGDHYK 
WRVMRACGVE EKYITGDAED FEKFEKYAEI MPNLIGNPIY HWTHLELKNF FGIEECLNKE
NAREIYDKCN ELLAKDEFRP RGLIEMSNVA AVCTTNDPID DLKYHELLAK DNFKVKVLPA
FRPDNALYIE KDSFKSYMGD LSKASGVEIT NFTDIKKALK ARMDFFNEHG AKASDQAFKY
IPHRRCDEEI LNKIVQKKLN GEDISLEEEE AYKTELVIFL AKEYKKLDWA MEIHVGVIRN
NSKLMFEKLG ADVGGDSQND LNFAENLADL LSDFEENEGL PRTVIFPLNP KDQFPIATVG
GSFNRANPDG MQNIQLGTAW WHLDHKDGMI EQMKVFSSVG VLSKFIGMLT DSRSFLSYPR
HEYFRRILCN FIGELVEKGE YPADEEFLGK VVEDICYNNA RKYIKIDL