Gene Apre_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1601 
Symbol 
ID8398413 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1741757 
End bp1742827 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content41% 
IMG OID644995965 
Productmannonate dehydratase 
Protein accessionYP_003153343 
Protein GI257067087 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1312] D-mannonate dehydratase 
TIGRFAM ID[TIGR00695] mannonate dehydratase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.475603 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATGA CATTTAGACA CTACGGAAAT GACGATCCAA TCTCACTAGA ATATATCGCA 
CAAATTCCTG GAGTTACAGG AGTTATGGTT ATGATGAACG AATGGGAAGC AGGAGAAGTT
TGGGAGAAGG ATGTTTTCCA AGAATACGTT GATAAGTGCC ACGCAGTAGG CCTTGATTGT
GAAATCATCG AATCAATCAA CGTCCACGAA GATATTAAGA TGGGTCTTCC AACAAGAGAT
AAATATATCG AAAACTACAA AGAGTCCCTA AGAAACGTTG CGGCTTGCGG TGTAAAGACA
GTAATCTACA ACTTCATGCC AGTATTTGAC TGGGTTAAGA CAGAATTATA CAAGGAGCTT
CCTGATGGAT CTAATACCCT TGCCTTTGAC CAAGCCAAGG TAGAAGGCCT TTCTCCAAGA
GATATGGTAA ATGAAATCCT CGACGGAGCA GGAAACTTCG AACTACCAGG CTGGGAGCCT
GAAAGACTCT CTCAACTAGA GGATGTTCTT GAAAAATACA AGGATATTGA CGAAGACAAA
TTAAGAGAAA ACTACAAATA CTTCCTTGAA GCAATAATCC CAACATGTGA AGAAGTAGGA
ATCAAGATGG CAGTTCACCC AGACGATCCA GCTTGGCCAA TCTTCGATAT CCCAAGAATC
ACATCAACTC CAGAAGATCT AGAAAAAATT GTAAACCTAG TAGACTCTCC ATCAAATACC
CTATGTATTT GTACAGGATC ATTGGGATCT AGAGTTGAAA ATGACGTAGC TAAAATAATC
GGAGACTTCG CTAAAAGAGG CAAAATAGGA GCGATTCACG CTAGAAACAT CAAGTTTACC
GGCGAGAAAC AATTCTACGA ATCAGCTCAC CTTTCTAAGT GCGGTTCATT AGATATGTAC
GCTATAATGA AAGCTCTATA CGATGCTGAT TTCGACGGCT ACCTAAGACC AGACCACGGA
AGAATGATCT GGGGCGAAGA AGGAAGAGCA GGCTATGGAC TCTACGACAG AGCCCTAGGA
GTTGCCTACC TCAACGGTCT ATGGGAAGCT ATAGATAAAA ATAACAAATA G
 
Protein sequence
MKMTFRHYGN DDPISLEYIA QIPGVTGVMV MMNEWEAGEV WEKDVFQEYV DKCHAVGLDC 
EIIESINVHE DIKMGLPTRD KYIENYKESL RNVAACGVKT VIYNFMPVFD WVKTELYKEL
PDGSNTLAFD QAKVEGLSPR DMVNEILDGA GNFELPGWEP ERLSQLEDVL EKYKDIDEDK
LRENYKYFLE AIIPTCEEVG IKMAVHPDDP AWPIFDIPRI TSTPEDLEKI VNLVDSPSNT
LCICTGSLGS RVENDVAKII GDFAKRGKIG AIHARNIKFT GEKQFYESAH LSKCGSLDMY
AIMKALYDAD FDGYLRPDHG RMIWGEEGRA GYGLYDRALG VAYLNGLWEA IDKNNK