Gene BAS1717 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1717 
Symbol 
ID2850931 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1735056 
End bp1736729 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content41% 
IMG OID637504969 
Productdihydroxy-acid dehydratase 
Protein accessionYP_027982 
Protein GI49184730 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG0129] Dihydroxyacid dehydratase/phosphogluconate dehydratase 
TIGRFAM ID[TIGR00110] dihydroxy-acid dehydratase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAGTG ACATGATTAA AAAAGGTTTT GATAAAGCGC CGCATCGTAG TTTATTAAAA 
GCAACTGGTT TGAAAGATGA AGACTTTGAT AAACCGTTTA TAGCGATCTG TAATTCTTTT
ATTGAAATTA TTCCAGGTCA TAAGCACTTA AATGAGTTTG GGAAGCTTGT TAAAGAAGCA
GTACGTGCAG CAGGTATGGT TCCATTTGAA TTTAATACAA TTGGAGTAGA TGACGGTATT
GCGATGGGGC ATATCGGTAT GCGCTATTCG CTTCCGAGTC GAGAAATTAT TGCAGATTCA
GTAGAAACGG TTGTAAATGC CCATTGGTTT GATGGCATGA TTTGCATTCC AAACTGTGAC
AAAATCACAC CCGGTATGAT GATGGCTGCA CTTCGTATTA ACATTCCAAC TGTTTTTGTT
TCAGGTGGTC CGATGGCGGC TGGAAAAACA TCTAAAGGAG ACGTTGTTGA TTTAAGTTCT
GTTTTCGAAG GAGTAGGGGC TTATCAATCT GGGAAAATTT CAGAAGAAGA ATTAAAGGAT
ATTGAAGATC ATGGCTGTCC ATCTTGTGGT TCTTGTTCTG GTATGTTTAC AGCGAACTCT
ATGAACTGTT TATGTGAAGT GTTAGGTTTA GCTCTTCCTG GTAACGGAAG TATTTTGGCT
ATTGATCCAA GACGCGAAGA ATTAATTAAA CAAGCAGCAG AAAAATTAAA GATTTTAATT
GAAAGAGATA TTAAACCGAG AGACATTGTA ACGGAAGAAG CAATTGATGA TGCGTTCGCG
CTTGATATGG CAATGGGCGG TTCAACAAAT ACAGTGTTGC ATACATTGGC GCTCGCGCAA
GAGGCTGGAT TAGATTACGA TATGAACCGT ATTGATGCCG TTTCAAGACG TGTACCACAT
TTATGTAAAG TAAGCCCTGC TTCCAATTGG CATATGGAAG ACATTGATCG TGCAGGCGGG
ATTAGTGCAA TTTTGAAAGA GATGAGCCGA AAAGAAGGGG TACTTCATTT AGACCGTATT
ACTGCTACGG GGCAAACATT AAGAGAAAAT ATTGCTCATG CAGAGATTAA AGATAAGGAA
GTGATTCATT CTCTTGAAAA TCCTCATAGT GAAGAAGGTG GATTACGTAT ATTAAAAGGA
AACCTTGCGA AAGACGGAGC AGTTATTAAA AGCGGGGCAA CTGAAGTAAA ACGATTTGAA
GGACCTTGTG TTATTTTTAA TTCACAAGAT GAGGCGCTTG CCGGCATTAT GCTTGGGAAG
GTTAAGAAAG GAGATGTAGT TGTTATTCGT TATGAAGGAC CAAGAGGCGG TCCTGGTATG
CCGGAAATGT TAGCACCAAC GTCAGCGATT GCTGGCATGG GATTAGGTGC AGATGTTGCG
TTATTAACCG ATGGTCGTTT CTCTGGTGCT TCACGTGGTA TTTCAGTAGG TCATATTTCG
CCAGAAGCAG CTGCGGGCGG AACGATTGCA CTTCTTGAAC AAGGGGATAT CGTTTGTATC
GATGTTGAGG AAAGGTTGTT AGAAGTAAGA GTTAGTGACG AAGAATTAGG TAAGCGTAAA
AAAGAATGGA AACGACCAGA ACCGAAAGTG AAAACGGGCT GGCTTGGACG TTATGCACAA
ATGGTAACAT CGGCGAATAC AGGTGCAGTC CTAAAAATCC CGAATTTTGA TTGA
 
Protein sequence
MRSDMIKKGF DKAPHRSLLK ATGLKDEDFD KPFIAICNSF IEIIPGHKHL NEFGKLVKEA 
VRAAGMVPFE FNTIGVDDGI AMGHIGMRYS LPSREIIADS VETVVNAHWF DGMICIPNCD
KITPGMMMAA LRINIPTVFV SGGPMAAGKT SKGDVVDLSS VFEGVGAYQS GKISEEELKD
IEDHGCPSCG SCSGMFTANS MNCLCEVLGL ALPGNGSILA IDPRREELIK QAAEKLKILI
ERDIKPRDIV TEEAIDDAFA LDMAMGGSTN TVLHTLALAQ EAGLDYDMNR IDAVSRRVPH
LCKVSPASNW HMEDIDRAGG ISAILKEMSR KEGVLHLDRI TATGQTLREN IAHAEIKDKE
VIHSLENPHS EEGGLRILKG NLAKDGAVIK SGATEVKRFE GPCVIFNSQD EALAGIMLGK
VKKGDVVVIR YEGPRGGPGM PEMLAPTSAI AGMGLGADVA LLTDGRFSGA SRGISVGHIS
PEAAAGGTIA LLEQGDIVCI DVEERLLEVR VSDEELGKRK KEWKRPEPKV KTGWLGRYAQ
MVTSANTGAV LKIPNFD