Gene BAS4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4109 
Symbol 
ID2848537 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4036920 
End bp4037945 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content38% 
IMG OID637507346 
Productspore photoproduct lyase 
Protein accessionYP_030359 
Protein GI49187107 
COG category[L] Replication, recombination and repair 
COG ID[COG1533] DNA repair photolyase 
TIGRFAM ID[TIGR00620] spore photoproduct lyase 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAT TTATGCCAAA ACTCGTTTAC TTTGAACCGA AGGCACTTGA ATATCCACTC 
GGAAAAGAGC TGTATGAGAA GTTTACGAAG ATGGGACTAG AGATTCGTGA AACGACATTC
CATAATCAAA TTCGAAATTT GCCGGGTGAA AATGATTTGC AAAAGTATCG TAATGCAAAA
GCAACGCTTG TCGTTGGGGT GAGGAAGACA TTAAAGTTTG ATACGTCAAA ACCGTCAGCT
GAATATGCAA TTCCGCTTGC AACAGGATGT ATGGGACATT GTCATTATTG TTACTTGCAA
ACGACACTTG GGAGTAAGCC TTACGTTCGC GTGTATGTGA ATCTTGATGA AATATTTGAG
AAGGCACAGC AATATATGGA TGAAAGAGCA CCTGAAATAA CAAGGTTTGA AGCTGCTTGT
ACATCAGATA TCGTTGGGAT TGATCATTTA ACACATGCAT TAAAGCGCGC GATCGAATTC
ATTGGAGAAA GTGAGCATGG GCGTTTACGT TTCGTTACGA AATATTCGCA CGTTGATCAT
TTATTAGATG CAAAACATAA CGGGAAAACT CGTTTCAGGT TTAGTATTAA TTCACGTTAT
GTAATTAAAA ATTTTGAACC AGGGACATCA CCGTTTGAAG AAAGAATTGA AGCGGCTCGT
AAAGTAGCAG GCGCGGGTTA TCCACTTGGA TTTATAGTGG CGCCGCTTTA TATGCATGAA
GGATGGGAAG AAGGATATCG TGAACTATTT GAGCGATTGT ACAATGCATT AAACGATTTG
TCGATACCGA ATTTAACATT TGAATTAATT CAACATCGCT TTACAAAGCC AGCAAAAAAG
GTTATTCAAG AGCGTTATCC GAATACGAAG CTTGAAATGG ATGAAGAGAA GCGTAAATAT
AAATGGGGAC GATATGGCAT TGGGAAATAC GTATATAAAA AAGATGATGC GGAAGTATTG
GAAGAAACGA TAAGAGGTTA TATATATGAG TTTTTTCCTG ATGCAGAAAT ACAATACTTT
ACTTAA
 
Protein sequence
MKPFMPKLVY FEPKALEYPL GKELYEKFTK MGLEIRETTF HNQIRNLPGE NDLQKYRNAK 
ATLVVGVRKT LKFDTSKPSA EYAIPLATGC MGHCHYCYLQ TTLGSKPYVR VYVNLDEIFE
KAQQYMDERA PEITRFEAAC TSDIVGIDHL THALKRAIEF IGESEHGRLR FVTKYSHVDH
LLDAKHNGKT RFRFSINSRY VIKNFEPGTS PFEERIEAAR KVAGAGYPLG FIVAPLYMHE
GWEEGYRELF ERLYNALNDL SIPNLTFELI QHRFTKPAKK VIQERYPNTK LEMDEEKRKY
KWGRYGIGKY VYKKDDAEVL EETIRGYIYE FFPDAEIQYF T