Gene BAS5228 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS5228 
Symbol 
ID2849276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp5114854 
End bp5116158 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content36% 
IMG OID637508482 
ProductHD domain-containing protein 
Protein accessionYP_031466 
Protein GI49188213 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTATATT TAAACGACAA ACTCAGCGAA ACAAAAGTGT TTAAAGACCC GGTACATAAA 
TATGTGCACG TGCGCGATCG TGTTATTTGG GATTTAATCG GAACGAAAGA ATTTCAACGC
TTGCGCCGTA TTAAGCAGCT TGGAACGACA TTTTTTACAT TTCACGGTGC AGAGCATAGT
CGCTTTACTC ATTCGTTAGG TGTATATGAA ATTATTCGTC GTATGATTGA TGATGTGTTT
GATGGCAGAC CGAACTGGAA TGCTGAAGAT AGATTGTTAT GCTTATGTGC GGCATTACTT
CATGATGTCG GTCACGGCCC ATTTTCTCAT TCGTTTGAGA AAGTATTTTC ATTAGATCAT
GAGAAATTTA CGCAAAAGAT TATCGTTGGA GATACAGAAA TTAATCGCGT ATTAAGTCGT
GTGGATAAAG ACTTTCCGCA AAAGGTAGCG GATGTAATCG CAAAAACATC TAATAATAAA
TTAGCGATTA GCATGATTTC CAGTCAAATT GATGCAGATC GCATGGACTA CTTATTAAGA
GATGCGTATT TTACTGGCGT AAAGTATGGA AACTTTGATA TGGAACGTAT ACTGCGCGTT
ATGCGTCCGT ACGGAAATCA AGTAGTTATT AAAAATAGTG GTATGCATGC TGTTGAACAT
TATATTATGA GTCGTTATCA AATGTACTGG CAAGTATATT TCCATCCAGT AACACGCAGT
GCTGAAGTTA TTTTAACGAA GATTTTACAC CGTGCAAAAT CATTGCACGA GAAGTACTAT
ACATTTAAAA ATCATCCGGT TCATTTCTAT TCTTTATTTG AAGAAGAAGT AACAGTAGAG
GATTATTTAA AGTTAGACGA GAACGTTATG TATTATTACT TCCAAGTATG GCAAGACGAA
GAGGATCCAA TTTTAAGTGA TTTATGTCGC CGTTTTATGA ATCGAAACCT ATTTAAATAT
GTAGAGTTTA CAGATAAGCA CGGTTTAGAT AATTGGATGG AATTAAGTAG CTTATTCAAA
AAGATTGGAC TTGATCCAGA ATACTATTTA GTAGTTGATT CAACATCAGA CTTACCGTAC
GACTTTTACC GTGCTGGTGA AGAAGAAGAA CGTCTGCCAA TCTTACTTCT TATGCCAAAT
GGAGAGCTTA GAGAGCTTTC ACGTGAATCG GATATTGTTG AGGCGATTAC TGGTAAGAAG
AGAAGGGACC AGAAATTATT CTATCCACAT GATTTAATCT ATGAAGATGG AAGAAAAGGA
AAATATAAAG AGAGAATCAT CGAGTTACTC GAAGGAAAAA AATAA
 
Protein sequence
MVYLNDKLSE TKVFKDPVHK YVHVRDRVIW DLIGTKEFQR LRRIKQLGTT FFTFHGAEHS 
RFTHSLGVYE IIRRMIDDVF DGRPNWNAED RLLCLCAALL HDVGHGPFSH SFEKVFSLDH
EKFTQKIIVG DTEINRVLSR VDKDFPQKVA DVIAKTSNNK LAISMISSQI DADRMDYLLR
DAYFTGVKYG NFDMERILRV MRPYGNQVVI KNSGMHAVEH YIMSRYQMYW QVYFHPVTRS
AEVILTKILH RAKSLHEKYY TFKNHPVHFY SLFEEEVTVE DYLKLDENVM YYYFQVWQDE
EDPILSDLCR RFMNRNLFKY VEFTDKHGLD NWMELSSLFK KIGLDPEYYL VVDSTSDLPY
DFYRAGEEEE RLPILLLMPN GELRELSRES DIVEAITGKK RRDQKLFYPH DLIYEDGRKG
KYKERIIELL EGKK