Gene GBAA_2049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_2049 
Symbol 
ID2817661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp1921661 
End bp1923040 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content36% 
IMG OID637788923 
Productpolysaccharide biosynthesis family protein 
Protein accessionYP_018688 
Protein GI47527339 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.686608 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGGGT CACCGTTTAT ACGTGGAACG ATATTTTTAA CGATGGCAAC GATGATATCT 
AAAATGTTAG GGTTTATATA TGTTATACCA TTTACAGCTA TGGTCGGTAC GAGTGGGTAT
GTTTTATACA CGTATGCATA TCGTCCATAT ACAATTATGC TCAGCATTGC GACAATGGGA
CTACCACTTG CGGTTTCAAA GATGGTATCA AAATATGATC AATTAAATGA TTATCATACG
GTTAAGAGAG TGTTGAAAAG TGGAATTGTG TTTATGTTTA TAATGGGAGT TATTTCGTGT
TTTACGTTAT ATATGCTAGC TCCGCATTTG GCTAAACTTG TAATCGATGG AAATGATCAA
ACGGGGAATA GTGTAGCGGC GGTCACTACT AATATTCAAA TTGTAAGTTT TGCGTTAATA
CTTGTACCAG TAATGAGTTT ATTAAGGGGC TTTTTTCAAG GGTTTCAATC GATGGGGCCT
TCTGCTCTAA GTGTAGTTGT AGAGCAATTT TTTCGGGTAT TAACCATCTT GATAGGAAGC
TTTGTCGTTT TATATGTTTT AAAAGCTTCA GTCTCACTAG CTGTTGGTAT TTCAACGTTT
GGTGCTTTTA TGGGAGCAAT AGGTGGATTA ACTGTTTTAA GTGCGTATTA CATAAGGAGG
AGAAAGCACT TAAAGAAAAA AGAAATGGCG AGTATACCGC AAACAACGAA ATCTTTTTTC
TCACTATATA AGGAGCTCTT CACATATTCA ATACCATTTG TGGTAGTTGG TTTAGCAATT
CCGTTGTATC AGACGATTGA CACATTTACA ATTAATAAAT TGCTTATACA AATAGGATAT
ATGCAAGGAG AAGCGGAGAA GATTAATGCA ATAATTGGAC TTGTTCAGAT GGTTGTACTT
ATCCCAGTTT CCGTTGCGAC TGCTTTTAGT ATGTCACTTG TACCTGAGAT GACAAAAGCC
TATACAGCAG GAAATGTGAA GTTACTGTAT AAGCATTTTA CGAGGACGAA TCTATTAGTA
GTAGGGATTA CGGTGCCAGC GGCAATTGGA ATGATGGTGT TAGCAAAACC AGTGTATACT
CTTTTATTTG GTGCCGGAAA TGATCCGGAG ATGGGAAGAG TTATTTTACA GTATTACGCT
CCGGCTTGTA TACTATTTTC GCTTTTTACA GTAACGGCTG CTATGTTGCA AGGAATTAAT
CAACAACAGA AAACAGTGCT AGGGTTAGTG ATTGGCATTA TTGTGAAAAT CGTTTTAAAT
ATTGTATTGC TTCCGTATTT TGATTATGTA AGTTTTATTA TTTCAACATA CGCTGGTTAT
ACGATTTCAG TTGGCTTTAA CTTGTGGATG CTTTCTAAAT ATGTTATAAA GGCAACATAA
 
Protein sequence
MKGSPFIRGT IFLTMATMIS KMLGFIYVIP FTAMVGTSGY VLYTYAYRPY TIMLSIATMG 
LPLAVSKMVS KYDQLNDYHT VKRVLKSGIV FMFIMGVISC FTLYMLAPHL AKLVIDGNDQ
TGNSVAAVTT NIQIVSFALI LVPVMSLLRG FFQGFQSMGP SALSVVVEQF FRVLTILIGS
FVVLYVLKAS VSLAVGISTF GAFMGAIGGL TVLSAYYIRR RKHLKKKEMA SIPQTTKSFF
SLYKELFTYS IPFVVVGLAI PLYQTIDTFT INKLLIQIGY MQGEAEKINA IIGLVQMVVL
IPVSVATAFS MSLVPEMTKA YTAGNVKLLY KHFTRTNLLV VGITVPAAIG MMVLAKPVYT
LLFGAGNDPE MGRVILQYYA PACILFSLFT VTAAMLQGIN QQQKTVLGLV IGIIVKIVLN
IVLLPYFDYV SFIISTYAGY TISVGFNLWM LSKYVIKAT