Gene BAS0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS0943 
Symbol 
ID2852605 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp997476 
End bp998774 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content40% 
IMG OID637504203 
ProductDNA repair exonuclease family protein 
Protein accessionYP_027217 
Protein GI49183965 
COG category[L] Replication, recombination and repair 
COG ID[COG0420] DNA repair exonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACTA GATTTACTAT TTGTAGTAAA GTACGGAAAA AGAAAGGATC GTTATTTGTG 
AAACAAGTGA AGTTTATACA TGCGGCTGAT TTGCATTTGG ATAGTCCGTT TAAAGGAATG
GAGATGAATG TACCGCAGTC TGTTTGGGAG AGAATGAAGC AGAGTACGTT TGAATCGTTC
GAACGTATTA TTGATAAAGC GATTCAAGAG CGCGTTGATT TCGTATTGCT AGCCGGGGAT
TTGTATGATG CGGAGACGAG AAGTTTGAGG GCGCAAGTGT TTGTGCGCGA GCAAATGAAG
AGACTTTCGC AGTACGATAT CCCTGTTTTT ATTATTCACG GTAACCACGA TCATTTAGGG
GGAAGCTGGG CAGCAATTGA GTTTCCGGAA AATGTTCATG TGTTTACAGA GCCTTACGTA
GAAGAGAAAT CATTTTATAA AAATGGTGAG TTATTAGCTT CTATTTACGG ATTTAGTTAT
TTGCAGCAAG CGGTAACGGA TAATATGACA GCGCAATATA CGAAAATGAG TGATGCGCCT
TTTCATATTG GCATGCTTCA CGGAAGTGTG GAAGGCGATG CAGAGCATAA TCGCTATGCA
CCGTTTCAAA TTCGTGAGCT GAAAGAAAAG CAGTTTGATT ATTGGGCTCT TGGCCATATA
CATAAACGTG AAATTTTATT AGAAGAGCCA TACATCATTT ATCCAGGTAA TATACAAGGA
CGTCATCGTA AGGAAACGGG CGAGAAGGGT GCATACCTAA TTGAACTTAC GAAACAAGGA
TCGCACTGTT CCTTTTTCCA TACGGCGGAT GTTGTGTGGG ATGAGATAGA AGTGAATATT
GATGGACTTG AAACTGTTGA TGAACTTATG ACAAGTGTGT CAACTGCGAT GAATGAGTGC
CGAAGAGAAG AAGAAGGTAC GCAATTAACT GTCGTATTTA CAGGACAAGG GCCACTTTCT
CCTTATTTAC GTGATGAAAA GCGCGTAGAA GAGATTTTTC ATATTTTAGC AGCTGGTGAA
GAGCGAAAAG ATTTCGTATA TACGATGAAG TGGAAAAATG AGACGGTTTC TTTTGCAGAA
ATCGAGCGTT TGAAAGAAGA AAATCATTTC GTCGGTAGTG TGCTGAAGGA GTTAGAAGCT
TTCACTAATA TGGACGGCGT GTTGCGCAGT ATTTGGACAT CTCCTATAGC GCGTAATAGT
ATTGAATCTT TTACAGAAGA AGAGAAGAAA GAGATTCAAA AGGAAGCGGA AAATATTATT
TTAGAACAAT TATTCCAGCA AGAGAGGGAT AAGAAATGA
 
Protein sequence
MDTRFTICSK VRKKKGSLFV KQVKFIHAAD LHLDSPFKGM EMNVPQSVWE RMKQSTFESF 
ERIIDKAIQE RVDFVLLAGD LYDAETRSLR AQVFVREQMK RLSQYDIPVF IIHGNHDHLG
GSWAAIEFPE NVHVFTEPYV EEKSFYKNGE LLASIYGFSY LQQAVTDNMT AQYTKMSDAP
FHIGMLHGSV EGDAEHNRYA PFQIRELKEK QFDYWALGHI HKREILLEEP YIIYPGNIQG
RHRKETGEKG AYLIELTKQG SHCSFFHTAD VVWDEIEVNI DGLETVDELM TSVSTAMNEC
RREEEGTQLT VVFTGQGPLS PYLRDEKRVE EIFHILAAGE ERKDFVYTMK WKNETVSFAE
IERLKEENHF VGSVLKELEA FTNMDGVLRS IWTSPIARNS IESFTEEEKK EIQKEAENII
LEQLFQQERD KK