Gene GBAA_5470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_5470 
Symbol 
ID2819143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4957491 
End bp4959419 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content37% 
IMG OID637792136 
Productsulfatase 
Protein accessionYP_022133 
Protein GI47530784 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000729601 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAAA CCTTGAAATC ACAATTTCAA AATGTGCGTT TCACTGTATT CGTAGCTTTA 
GCCGTATGGT TGAAGACATA TCTTATTACA CGCACAAGCT TTGATTTAAA ACTTGAATCT
TTCATGCAAG AATTCATTTT ATTCCTTAGC CCATTAGCAG CATCATTACT GCTTGTTAGT
CTTGCATTAT TTGCAAAAGG GAAAAAACGT AACTATATAG CACTTGGAAT TAATTTTGTT
TTAACAATTA TTCTTGTTGG TAACGTAATG TTCTACGGAT TCTATAATGA CTTCGTTACT
TTACCCGTAC TAGGACAAAC ATCTAACTTC GGAAGTTTAG GTTCTAGTGT GAAAGAATTA
TTTAACTACA AAATCATCCT TGCATTTGCT GATATTATCG TATTCTTCAT TTTATTGAAG
AAGATGAAGA ATTTTGCACC GACAGAACGT GTAGCACGCC CAATGCGTTC CCTATACTTC
GTGTCAACAA TTGCTATTTT CTTCGCAAAC TTAGGACTGG CAGAAGCTGA GCGTCCTGAA
CTATTAACAC GTTCATTCGA CCGCGTTATG CTCGTTAAAA ACTTAGGTTT ATATGTACAC
CAAGTGTATG ACCTTGGCTT ACAAGCAAAA TCAAGTTCAC AAAAAGCATT TGCTGACGGT
AGTAAGTTAC AGGAAACAGA GAACTACGTA AAAACAACGC AAAGCAAACC AGATCCAAAT
ATGTTTGGTA CTGCAAAAGG GAAAAACGTA ATTGTCGTCT CTCTTGAGTC ATTACAAACA
TTCTTAATTG GTGCAACAGT TAACGGACAA GAAGTTACAC CATTCTTAAA CCAATTTACG
AAAGAAAGTT ATTACTTCGA TAACTTCTTC CATCAAACTG GTCAAGGAAA AACATCTGAC
GCTGAATTCT TAGTAGATAC TTCCATGTAT CCACTAGACC GTGGTGCTGT ATTCTTCACA
CACGGTAACA ACGAATACAC AGCAACTCCA GAAATTTTAC GTGAGCAAGG ATATCACACA
TCTGTATTCC ACGCGAACAA TGCAACGTTC TGGAACCGTA ACATTATGTA TCCGGCACTT
GGTTATGACC GTTACTACAA CGAGCTTGAC TACAAGATTA CGCCAGAAAC AAAATTAAAT
TGGGGATTAA AAGATATCGA GTACTTCGAT CAATCTATCG ATATGTTAAA AGAAGTGAAG
CAACCGTTCT ACACTCGCTT CCTTACGTTA ACAAACCATT ACCCATTCAC TTATGATGAA
AGCACAAAAT TAATCGATGA ATACAATTCT GGTGATGGCG TATTTGACCG TTACATGGTA
ACTGCTCGCT ATTTAGACGA AGCAATGAAA CACTTTATTG AGCGTCTAAA AGCAGAGGGT
ATTTACGACA ACTCAATTAT CGTATTCTAC GGTGATCACT ACGGTATTTC TGAAAACCAT
AACCGTGCAA TGGCACAGTT CTTAGGAAAA GAAGAAATTA CTGCATTTGA CCATATGAAC
TTACAAAAAA CACCGATGTT TATTCACGTT CCAGGTCAAA AAGAAGGTAA AACAATTTCA
AAACCAACTG GTGAAATTGA CATTAAACCA ACAATTCTAA ACTTACTTGG TATAGATTCT
ACGAATCAAA TTCAATTTGG TCATGATGTA TTCTCACCAG AAAATAAAGG ATTTGTTGTT
CTTCGTGACG GTAGCTTCGT TACAGATAAG TACATGTATA CGAACAGTAC ATTCTACGAC
CGTGCTACTG GCGAAGTTGT ACAATTACCA AAAGAAGAAT CTCAACCACT CATTGATCGT
GCTCAAAATG AATTGAACAT GTCTGACAAA ATCATTGAAG GTGACTTACT TCGCTTCTCT
GAAAGCAACA AGACAAAAAC TGGTGAAGTA AAGACAGCTA TTAAAGAAGA AAAGAAGAGC
GCTGAGTAA
 
Protein sequence
MKETLKSQFQ NVRFTVFVAL AVWLKTYLIT RTSFDLKLES FMQEFILFLS PLAASLLLVS 
LALFAKGKKR NYIALGINFV LTIILVGNVM FYGFYNDFVT LPVLGQTSNF GSLGSSVKEL
FNYKIILAFA DIIVFFILLK KMKNFAPTER VARPMRSLYF VSTIAIFFAN LGLAEAERPE
LLTRSFDRVM LVKNLGLYVH QVYDLGLQAK SSSQKAFADG SKLQETENYV KTTQSKPDPN
MFGTAKGKNV IVVSLESLQT FLIGATVNGQ EVTPFLNQFT KESYYFDNFF HQTGQGKTSD
AEFLVDTSMY PLDRGAVFFT HGNNEYTATP EILREQGYHT SVFHANNATF WNRNIMYPAL
GYDRYYNELD YKITPETKLN WGLKDIEYFD QSIDMLKEVK QPFYTRFLTL TNHYPFTYDE
STKLIDEYNS GDGVFDRYMV TARYLDEAMK HFIERLKAEG IYDNSIIVFY GDHYGISENH
NRAMAQFLGK EEITAFDHMN LQKTPMFIHV PGQKEGKTIS KPTGEIDIKP TILNLLGIDS
TNQIQFGHDV FSPENKGFVV LRDGSFVTDK YMYTNSTFYD RATGEVVQLP KEESQPLIDR
AQNELNMSDK IIEGDLLRFS ESNKTKTGEV KTAIKEEKKS AE