Gene BAS5016 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5016 
Symbol 
ID2852664 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4888476 
End bp4890452 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content41% 
IMG OID637508271 
Productexcinuclease ABC subunit B 
Protein accessionYP_031255 
Protein GI49188002 
COG category[L] Replication, recombination and repair 
COG ID[COG0556] Helicase subunit of the DNA excision repair complex 
TIGRFAM ID[TIGR00631] excinuclease ABC, B subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGAACGTC AATTTGAAAT TGTCTCAGCG TATTCCCCGC AAGGTGATCA GCCGGTAGCT 
ATAGAGAAGC TTGTAGAGGG AATTAATAGT GGAAAGAAAA AGCAAGTGTT GCTTGGGGCG
ACAGGAACGG GTAAGACATT TACGATTTCA AATGTCATTA AAGAAGTGCA AAAGCCAACG
CTTGTCATGG CTCACAATAA AACGTTAGCA GGACAGTTAT ATAGTGAGTT GAAAGACTTT
TTCCCGAATA ATGCAGTTGA ATATTTTGTT AGTTATTACG ATTATTATCA GCCAGAAGCG
TATGTGCCAC AAACAGATAC GTTTATTGAA AAAGACGCGC AGATTAATGA TGAAATCGAT
AAATTGCGTC ACTCAGCAAC GTCCGCATTA TTTGAACGGG ATGATGTAAT TATTGTTGCG
AGTGTTTCGT GTATATATGG TTTAGGTTCT CCAGAAGAAT ACCGCGAGTT AGTTGTTTCA
CTTCGAGTTG GTATGGAAAA GGACCGCAAT CAATTGCTTC GTGAACTTGT TGATGTGCAG
TATGGACGTA ATGATATTGA TTTCAAGCGT GGTACATTCC GCGTGCGCGG AGATGTAGTT
GAAATCTTCC CGGCATCACT TGACGAGCAT TGCATTCGAA TTGAGTTTTT TGGCGATGAA
ATTGATCGTA TTCGCGAAGT AAATGCTTTA ACGGGAGAAG TATTAGCAGA ACGTGATCAT
GTAGCAATCT TCCCAGCATC TCACTTCGTT ACACGTGAAG AAAAGATGAA GGTCGCTATT
GAAAATATCG AAAAAGAATT AGAAGAGCGT TTAAAGGAAT TAAATGATAA CGGTAAGTTG
TTAGAAGCGC AGCGTATAGA ACAGCGTACA CGTTATGATT TAGAAATGAT GCGCGAGATG
GGCTTTTGTT CAGGGATTGA AAACTATTCC CGTCATTTAA CACTTCGTCC AGCGGGTGCA
ACGCCGTATA CGTTATTAGA CTATTTCCCG AAAGATTTCT TAATCGTTAT GGATGAGTCC
CACGTATCAG TGCCGCAAGT AAGAGCGATG TATAACGGGG ACCAAGCGCG TAAACAAGTG
CTTGTGGATC ATGGATTCCG TCTGCCATCA GCTTTAGATA ATAGACCGCT CACATTTGAT
GAGTTTGAAG AGAAAACGAA TCAAGTTATT TACGTTTCAG CAACGCCAGG ACCGTATGAA
TTAGAGCAGT CGCCAGAAGT AATAGAACAA ATTATTCGTC CAACAGGGCT TTTAGATCCG
CCAATTGATA TACGACCAAT TGAAGGGCAG ATTGACGATC TATTAGGAGA GATTCAAGAT
CGCATTGCAA AAAATGAACG TGTATTAATT ACAACTTTAA CGAAGAAGAT GTCAGAGGAT
TTAACAGACT ACTTAAAAGA TGTAGGAATT AAGGTGAATT ATCTGCATTC TGAAGTGAAA
ACGTTAGAAC GTATTGAAAT TATACGAGAT CTTCGCCTTG GTAAGTTTGA TGTTCTCGTT
GGTATTAACT TATTGCGAGA AGGATTAGAT ATTCCAGAAG TATCCCTTGT AGCTATTTTA
GATGCCGATA AGGAAGGATT CTTGCGTTCA GAGCGTTCGT TAATTCAAAC AATTGGCCGT
GCAGCACGTA ATGAAAACGG TCGCGTTATT ATGTACGCAG ATCGTATAAC GAGATCGATG
GGGATTGCGA TTGAAGAGAC GAAGCGTCGT CGTAGTATAC AAGAAGCTTA CAATGAAGAG
CATGGTATTA CGCCGAAAAC GATTCAAAAA GGTGTGCGTG ATGTAATCCG TGCAACGACA
GCTGCTGAAG AGCCGGAAAC ATATGAAGCG ACGCCAGCTA AGAAGATGAC GAAAAAAGAG
CGTGAAAAGA CAATTGCGAA GATGGAAGCA GAAATGAAAG AAGCAGCAAA AGCATTAGAC
TTCGAGCGTG CAGCTGAATT AAGAGATTTA CTATTAGAAT TAAAAGCGGA AGGGTGA
 
Protein sequence
MERQFEIVSA YSPQGDQPVA IEKLVEGINS GKKKQVLLGA TGTGKTFTIS NVIKEVQKPT 
LVMAHNKTLA GQLYSELKDF FPNNAVEYFV SYYDYYQPEA YVPQTDTFIE KDAQINDEID
KLRHSATSAL FERDDVIIVA SVSCIYGLGS PEEYRELVVS LRVGMEKDRN QLLRELVDVQ
YGRNDIDFKR GTFRVRGDVV EIFPASLDEH CIRIEFFGDE IDRIREVNAL TGEVLAERDH
VAIFPASHFV TREEKMKVAI ENIEKELEER LKELNDNGKL LEAQRIEQRT RYDLEMMREM
GFCSGIENYS RHLTLRPAGA TPYTLLDYFP KDFLIVMDES HVSVPQVRAM YNGDQARKQV
LVDHGFRLPS ALDNRPLTFD EFEEKTNQVI YVSATPGPYE LEQSPEVIEQ IIRPTGLLDP
PIDIRPIEGQ IDDLLGEIQD RIAKNERVLI TTLTKKMSED LTDYLKDVGI KVNYLHSEVK
TLERIEIIRD LRLGKFDVLV GINLLREGLD IPEVSLVAIL DADKEGFLRS ERSLIQTIGR
AARNENGRVI MYADRITRSM GIAIEETKRR RSIQEAYNEE HGITPKTIQK GVRDVIRATT
AAEEPETYEA TPAKKMTKKE REKTIAKMEA EMKEAAKALD FERAAELRDL LLELKAEG