Gene GBAA_2083 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_2083 
Symbol 
ID2820087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp1950971 
End bp1952179 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content35% 
IMG OID637788955 
Productglycosyl transferase family protein 
Protein accessionYP_018725 
Protein GI47527376 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.38657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAATG TACTCGTAAT AAATTTCCCT GGAGAAGGTC ATATAAATCC GACTTTAGCT 
ATTATAAGTG AGTTAATTCG GCGAGGGGAA ACAGTTGTTT CGTATTGTAT TGAAGATTAT
AGAAAGAAGA TTGAGGCAAC AGGTGCAGAA TTCCGAGAGT TTGAAAATTT TCTCTCCCAA
ATTAATATTA TGGAACGAGT AAATGAAGGT GGGAGTCCTT TGACGATGCT ATCTCATATG
ATTGAAGCAT CAGAGCGTAT TGTTACTCAA ATTGTAGAAG AAACAAAAGG AGAACAGTAC
GATTACTTAC TATACGATAA TCATTTTCCA GTAGGACGTA TTATAGCGAA TGTTTTACAA
TTACCTAGCA TTTCGTCTTG TACAACGTTT GCTTTTAATC AGTACATTAC TTTTAACGAT
GAACAAGAAT CGAGACAAGT AGATGAAACG AATCCGTTAT ATCAATCTTG TTTAGCGGGA
ATGGAAAAAT GGAATAGACA GTATGGAATG AAATGTAATA GTATGTACGA TATTATGAAT
CACCCTGGTG ATATTACCAT CGTATACACT TCAAAGGAAT ATCAACCACG TTCAGATGTA
TTCGATGAAT CGTATAAGTT TGTCGGTCCA TCAATTGCTA CTCGAAAAGA AGTAGGTAGC
TTTCCTATGG AAGATTTAAA AGGTGAAAAA TTGATTTTTA TTTCTATGGG AACAGTTTTT
AATGAACAAC CTGAGTTATA TGAAAAATGT TTTGAAGCGT TTAAAGGGGT AGAAGCGACA
GTCATATTAG CTGTTGGTAA GAAGATAAAT ATAAGTCAGT TTGAAAACAT TCCGAATAAC
TTTAAGTTGT ATAATTATGT GCCACAATTA GAAGTATTAC AGCATGCTGA TGTATTCGTG
ACACACGGTG GTATGAATAG TTCGAGTGAA GCACTATATT ACGGTGTCCC GTTAGTTGTA
ATTCCGGTAA CAGGAGATCA GCCTTTAGTT GCGAAACGAG TGAATGAAGT AGGGGCAGGA
ATAAGGCTTA ATCGTAAAGA ATTAACTTCT GAATTGTTAC GTGAGACTGT AAAGGAAGTA
ATGTATGATG TAACGTTTAA GGAAAATAGT CGTAAAGTTG GAGAGTCACT TCGAAATGCT
GGTGGATATA AAAGGGCAGT TGATGAAATA TTTAAAATGA AAATGAATTC GTACTTGAAA
CTTAAATAA
 
Protein sequence
MANVLVINFP GEGHINPTLA IISELIRRGE TVVSYCIEDY RKKIEATGAE FREFENFLSQ 
INIMERVNEG GSPLTMLSHM IEASERIVTQ IVEETKGEQY DYLLYDNHFP VGRIIANVLQ
LPSISSCTTF AFNQYITFND EQESRQVDET NPLYQSCLAG MEKWNRQYGM KCNSMYDIMN
HPGDITIVYT SKEYQPRSDV FDESYKFVGP SIATRKEVGS FPMEDLKGEK LIFISMGTVF
NEQPELYEKC FEAFKGVEAT VILAVGKKIN ISQFENIPNN FKLYNYVPQL EVLQHADVFV
THGGMNSSSE ALYYGVPLVV IPVTGDQPLV AKRVNEVGAG IRLNRKELTS ELLRETVKEV
MYDVTFKENS RKVGESLRNA GGYKRAVDEI FKMKMNSYLK LK