Gene GBAA_pXO1_0199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_pXO1_0199 
Symbol 
ID2820241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007322 
Strand
Start bp166698 
End bp167747 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content33% 
IMG OID637682862 
Producthypothetical protein 
Protein accessionYP_022460 
Protein GI47566508 
COG category[S] Function unknown 
COG ID[COG2357] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value1.3269600000000002e-32 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAATTAA ATCAGCAAGA GTTTTTTGAA ACATATAACA TAGATACACA AGAGTTTGAG 
ACTGCAGAAA TTAGTTGGAA TAGTTTATTA GAGATACATA GTGATTACTT ATCCTATAAA
GAAACATTAA TTCCTACAGC AGAACATTTA TCAATGATGT TGCGTACTCA CCCAGCATCT
CATACTGTTA GGTCTAGGGT TAAAGATGCT GGGCACTTAA TTGATAAAAT TATTAGAAAA
ACTATTAGAG AAAAGGAGAA AAATCCTGAT TATTACATCG ACGTTAATAA TTATAAATCA
GAGATTACAG ATTTAATTGG AATTAGAGTT CTGCACCTTT ACAAAGACCA AGCAGCTCCT
ATTGATAAAT TTATCCGTGA TACTTGGGAT TTAAGGGAAA AATGTACCAT CTACTACCGT
CAAGGTGATT ACTCAAAACA AGAAGAACCT AAAAATAATG ATTTATTTAA TTTCAAGGTA
CATCCATTTG GCTATCGTTC ATGGCACTAT TTAATCAGTT CGCAAGCAAC AAAAAACGTT
CACATTGCAG AAATTCAGGT AAGAACAATT TTTGAAGAAG GTTGGAGTGA AATTGACCAT
CAGCTAAGAT ATCCTAATAA TATGAACGAC GTCCAGCTAA CTAAGCAGCT ATTAGTTTTA
AATAGAGTTG CTGGAAGTGC TGACGAAATG GCGACTGTAA TCAGAGAATT AGTTGCTGAA
AATAATATAA AACAAAAATC TATTGATGAA TTAAAAACAC AGCTTGACAC ATTGATGAAG
GAAAATAATA TTGAAAAAGC TGTTAAAGAA AAATTCCAGA AGAAAGTTGA AGAGCTTCAA
GATCAACTGG CACTTAAATT ACCAAATAAT AGATGGTTTA ATACACCTAT ATATTTGGGA
GAAGGACCAA CGCCCGCTTC ATCCCCCCTA CTTAAAGCCG GCGAACAATT TGTTATTGAT
TTTAAAGCAG ATGAGCCATA TGACATTAAG CGATATTTAA GTGGTTATAC ACATCCTGCA
CTCATTAAAA AAGCAGATCC ACAGAAATAA
 
Protein sequence
MQLNQQEFFE TYNIDTQEFE TAEISWNSLL EIHSDYLSYK ETLIPTAEHL SMMLRTHPAS 
HTVRSRVKDA GHLIDKIIRK TIREKEKNPD YYIDVNNYKS EITDLIGIRV LHLYKDQAAP
IDKFIRDTWD LREKCTIYYR QGDYSKQEEP KNNDLFNFKV HPFGYRSWHY LISSQATKNV
HIAEIQVRTI FEEGWSEIDH QLRYPNNMND VQLTKQLLVL NRVAGSADEM ATVIRELVAE
NNIKQKSIDE LKTQLDTLMK ENNIEKAVKE KFQKKVEELQ DQLALKLPNN RWFNTPIYLG
EGPTPASSPL LKAGEQFVID FKADEPYDIK RYLSGYTHPA LIKKADPQK