Gene BAS3441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3441 
Symbol 
ID2851388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3413453 
End bp3415111 
Gene Length1659 bp 
Protein Length552 aa 
Translation table11 
GC content42% 
IMG OID637506684 
Producturocanate hydratase 
Protein accessionYP_029697 
Protein GI49186445 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAAAAG TTAAACAAAC AATTCGCGCG CCAAGAGGTA CTGAGTTACA AACGAAAGGG 
TGGGTGCAAG AAGCTGCACT TCGTATGTTA ATGAACAATT TAGATCCTGA AGTTGCTGAA
AAACCAGAAG AATTAGTTGT ATATGGCGGA ATTGGCCGTG CAGCTCGTAA CTGGGAAAGC
TATCAGGCGA TTGTAGATTC ATTAAAAACG TTAGAAAGCG ATGAAACTTT ACTTGTTCAA
TCAGGAAAAC CAGTTGCTAT TTTTAAATCA CATGAAGATG CGCCTCGCGT TCTTTTAGCG
AACTCAAACT TAGTACCGAA GTGGGCGAAC TGGGATCACT TCCGTGAACT AGAGAAAAAA
GGTCTTATGA TGTACGGACA AATGACGGCA GGTAGCTGGA TTTACATCGG AACACAAGGT
ATTTTACAAG GAACTTATGA AACGTTTGGT GAAGCGGCGC GTCAACATTT CGGTGGTTCA
TTAAAAGGCA CATTAACACT TACTGCTGGT TTAGGTGGTA TGGGTGGTGC ACAACCTCTT
GCTGTAACGA TGAACGGCGG TGTTGTTATT GCTATTGATG TTGATAAGCG CAGCATCGAT
CGTCGTATTG AAAAGAGATA CTGTGATATG TATACAGAAT CATTAGAAGA AGCGTTAGCG
GTTGCGAACG AGTATAAAGA GAAGAAAGAA CCGATTTCTA TTGGTTTATT AGGAAATGCG
GCAGAAATTT TACCAGAACT AGTGAAGCGC AATATTACGC CAGACTTGGT TACAGATCAA
ACATCTGCTC ATGATCCATT AAACGGTTAT ATTCCAGTAG GCTACACGTT AGAAGAAGCA
GCAAAACTTC GTGAAGAAGA TCCAGAACGC TACGTACAAT TATCAAAAGA AAGCATGACA
AAACATGTGG AAGCAATGCT TGCTATGCAA GAAAAAGGCG CAATTACATT TGATTACGGA
AATAACATTC GCCAAGTTGC TTTTGATGAA GGTTTGAAAA ATGCATTCGA TTTCCCAGGA
TTCGTTCCAG CATTTATCCG TCCATTATTC TGCGAAGGAA AAGGACCATT CCGCTGGGTA
GCACTTTCTG GTGACCCAGA AGATATTTAT AAAACAGACG AAGTAATTTT ACGTGAGTTC
GCGGATAATG AGCATTTATG TAACTGGATT CGTATGGCTC GTCAGCAAGT TGAATTCCAA
GGCCTTCCAT CACGTATTTG TTGGCTTGGT TACGGTGAGC GTGCGAAATT TGGCCGCATC
ATTAATGAAA TGGTTGCAAA TGGTGAATTA TCAGCACCAA TCGTTATCGG TCGTGACCAT
TTAGATTGCG GATCAGTAGC ATCTCCAAAC CGTGAAACAG AAGCGATGAA AGACGGTAGT
GATTCAGTAG CTGACTGGCC AATCTTAAAT GCATTAATTA ATAGTGTAAA CGGTGCAAGC
TGGGTATCTG TTCACCACGG TGGTGGCGTT GGTATGGGTT ATTCACTTCA TGCAGGAATG
GTTATCGTTG CAGATGGAAC AGAAGCAGCA GCAAAACGTA TTGAGCGCGT ATTAACTTCT
GACCCTGGTA TGGGTGTTGT TCGTCACGTT GATGCAGGAT ATGACTTAGC TGTGGAAACT
GCGAAAGAAA AAGGCGTTAA CATTCCAATG ATGAAATAA
 
Protein sequence
MEKVKQTIRA PRGTELQTKG WVQEAALRML MNNLDPEVAE KPEELVVYGG IGRAARNWES 
YQAIVDSLKT LESDETLLVQ SGKPVAIFKS HEDAPRVLLA NSNLVPKWAN WDHFRELEKK
GLMMYGQMTA GSWIYIGTQG ILQGTYETFG EAARQHFGGS LKGTLTLTAG LGGMGGAQPL
AVTMNGGVVI AIDVDKRSID RRIEKRYCDM YTESLEEALA VANEYKEKKE PISIGLLGNA
AEILPELVKR NITPDLVTDQ TSAHDPLNGY IPVGYTLEEA AKLREEDPER YVQLSKESMT
KHVEAMLAMQ EKGAITFDYG NNIRQVAFDE GLKNAFDFPG FVPAFIRPLF CEGKGPFRWV
ALSGDPEDIY KTDEVILREF ADNEHLCNWI RMARQQVEFQ GLPSRICWLG YGERAKFGRI
INEMVANGEL SAPIVIGRDH LDCGSVASPN RETEAMKDGS DSVADWPILN ALINSVNGAS
WVSVHHGGGV GMGYSLHAGM VIVADGTEAA AKRIERVLTS DPGMGVVRHV DAGYDLAVET
AKEKGVNIPM MK