Gene GBAA_3895 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_3895 
Symbol 
ID2815040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3563866 
End bp3565785 
Gene Length1920 bp 
Protein Length639 aa 
Translation table11 
GC content33% 
IMG OID637790613 
Productsulfatase 
Protein accessionYP_020532 
Protein GI47529183 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.60461 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATA AAATCAATTT ACAAATGCAA AATATAAGCT TTGTTTTAAT AATTGCTTTA 
GCACTATGGT TAAAAACATA TCTTATTACA AGATTCAGTT TTGATTTAAA AATTGAATCT
TCTACACAAG AGCTTATTTT GTTTATTAGC CCGCTCGCTG CATCATTAGC TTTTGTCGGA
TTAGCATTAT TTGCAACCGG TGAAAAAAGA AATTATATCG CGCTATGTAT TAATTTTTTA
TTAACAATCA TACTTGTTGG AAATGTAATG TTCTATGATT TTTATAGTGA TTTCGTTACG
TTACCAGTAC TTGGACAAAC ATCAAACTTT GGCCAACTAG GCGGTAGTAT TATAGAAATA
TTGAACTACA AAATTATACT CGCATTCGTA GACATTATTT TTTTCTTTAT TTTATTAAAG
AAGAAAGCAC TAGTCTTTCA AACAGGACGT GTAACGCATC CGGCACGCTT CGTATATTTC
CTCTTAACGG TTGGTGTATT TTTTGCAAAT CTACACCTTG CAGAAAAAGA ACGACCTGAA
CTATTAACGA GATCGTTCGA CCGGGTCATG CTTGTTAAAA ATTTAGGACT ATACACGCAT
CAAGTTTATG ATTTAACACT GCAAGTAAAA GCAGGATCAC AAAAGGCACT TGCTGATAGT
AGCAAATTAC AAGAAACAGA AAACTACGTA AAGGCAAACC AAAGTGAACC GAATCCTAAT
ATGTTTGGTG CAGCGAAAGG AAAAAACGTA ATTGTCGTTA CTCTTGAATC CTTGCAAACC
TTTTTAATAG GCGCAAAAGT TAACGGAGAA GAAGTTACAC CATTTTTGAA TGAATTCATA
AATGAAAGCT ATTACTTTGA TAACTTTTTC CACCAAACTG GGCAAGGAAA AACATCCGAT
TCTGAATTTC TAATCGATAC GTCGTTATAT CCATTAAATC GAGGAGCTGT ATTTTTCACA
CACGGTAACA ATGATTATAC TGCGACGCCA GAAATTTTAC GTCAGCAAGG TTATTTCACT
TCTGTATTCC ATGCGAATAA TGCAACATTT TGGAATCGTA ACATTATGTA CTCTGCTCTC
GGTTATGATC GTTACTATAA TGAACTTGAT TACAAAATTA CGCCTGAAAC AAATTTAAAT
TGGGGCTTAA AAGATATCGA GTACTTTGAT CAATCAGTAG ATATATTAAA AACTGTTGAT
CAACCATTTT ACGCTCGTTT CCTTACTTTA ACAAACCATT ATCCATTTAC GTATGATGAA
GATACTAAAT TCATTGAACC ATATGACTCT GGTAATGGCG TGTTTGATCG TTATATCGTA
ACAGCACGTT ACTTAGACGA ATCCATTAAA AAATTTATTG AGCGTTTAAA AGCTGAGGGA
ATGTATGATG ATTCTATTAT TGTTTTATAT GGTGATCATT ATGGCATTTC TGAAAAACAT
AATCGTGCAA TGGCACAGTT TTTAGAAAAA GATCAAATAA CCGAATTTGA TACTCTGAAT
TTACAACGCA CACCTTTATA TATTCATATG CCTGGCCAAA CAGAAGGTGA AACGATTTCA
AAGCCTACAG GACAAATTGA TATAAAACCT ACTATTCTAA ATTTACTTGG GATTGATACT
ACGAACGATA TTCGTTTTGG TCATGATATG TTTTCAAATG AATATACTGG TTTTGTTGTT
TTACGTGATG GTAGCTTTAT TACAGACAAG TATGCATACA AAAACAATAT TTTCTATGAC
CGCATAACAG GTGAAATTGT AGATTTACCA AAAAATGAAG CACAAGCACT CATTAAGCGT
GCGCAAAATG AATTACGAAT GTCTGACAAA ATTATTGAAG GCGATTTATT ACGTTTCTCA
GAAAGTAATA AGATTAAAAC TGGTGAAGTA CAAACTAAAA TTAAAGAAAC TGAAAAATAA
 
Protein sequence
MKNKINLQMQ NISFVLIIAL ALWLKTYLIT RFSFDLKIES STQELILFIS PLAASLAFVG 
LALFATGEKR NYIALCINFL LTIILVGNVM FYDFYSDFVT LPVLGQTSNF GQLGGSIIEI
LNYKIILAFV DIIFFFILLK KKALVFQTGR VTHPARFVYF LLTVGVFFAN LHLAEKERPE
LLTRSFDRVM LVKNLGLYTH QVYDLTLQVK AGSQKALADS SKLQETENYV KANQSEPNPN
MFGAAKGKNV IVVTLESLQT FLIGAKVNGE EVTPFLNEFI NESYYFDNFF HQTGQGKTSD
SEFLIDTSLY PLNRGAVFFT HGNNDYTATP EILRQQGYFT SVFHANNATF WNRNIMYSAL
GYDRYYNELD YKITPETNLN WGLKDIEYFD QSVDILKTVD QPFYARFLTL TNHYPFTYDE
DTKFIEPYDS GNGVFDRYIV TARYLDESIK KFIERLKAEG MYDDSIIVLY GDHYGISEKH
NRAMAQFLEK DQITEFDTLN LQRTPLYIHM PGQTEGETIS KPTGQIDIKP TILNLLGIDT
TNDIRFGHDM FSNEYTGFVV LRDGSFITDK YAYKNNIFYD RITGEIVDLP KNEAQALIKR
AQNELRMSDK IIEGDLLRFS ESNKIKTGEV QTKIKETEK