Gene GBAA_4612 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4612 
Symbol 
ID2816249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp4190306 
End bp4191361 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content35% 
IMG OID637791303 
Producthypothetical protein 
Protein accessionYP_021259 
Protein GI47529910 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000312577 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAGA AGCGTAGACG CATTTTTTTA TTTTCGATTA TTGCACTGCT TTTAGTTTGT 
GGTTCAGTCT ATGCGTATAT TTCATCCGCA TTAGGACCAG TTGATACCGG GAATAAAAAA
GAGATTGAAG TAGAAATTCC AAAGGGATCA TCTACTAGTA AAATTGGTGA GATTTTAGAA
GAAAAAGGTG CTGTGAAAAA CGGTACAGTT TTTAGTTTTT ATACAAAGGC TAAATCTAAA
AATTTACAAG CGGGTACATA TTTATTAAAT CCTTCAATGA GTGCGAAAGA TGTTATGGAG
CAAATGTCAT CTGGTAATGT ACATCGTCCA GCTCTTTATA AAGTGACGAT AAAAGAAGGA
GCACAAGTAA CTGAAATTGC AGAAACGGTT GCAAACGAAT TAAAGTGGAA TAAAGATGAT
GTCGTACGTC AATTAAACGA TAAAGCATTT ATTCAAAAAA TGCAGCAAAA GTATCCGAAG
TTGTTAACCG ATAAAATCTT TGATAGCAAT ATTAAATATC CGTTAGAAGG TTATTTATAT
CCTGCGACGT ACTCTTTCTA TAAAAAAGAT ACGACGTTAG AAGAAGTTGT AATTCCAATG
CTTGAAAAAA CGAATGCAAT CATTGTTCAA AACGAGGCAA AAATGAAAGC GAAAAACTGG
GATGTTCACC AGCTTTTAAC ATTGTCTTCA CTTATTGAAG AAGAGGCAAC AGGCTTTACA
GATCGTCAAA AGATCTCTAG TGTCTTTTAT AATCGTTTAG CAAAAGGCAT GCCACTGCAA
ACTGATCCGA CGGTATTATA TGCACTTGGA AAGCATAAAC AACTTGTGTT ATACGAAGAT
TTAAAGGTTA ACTCACCATA CAATACGTAT GTGGTGAAAG GATTGCCTGT CGGTCCGATT
GCAAACTCTG GCAAACATTC AGTGGAAGCG GCGTTAGAAC CCGCGCAAAC AGATTATTAT
TATTTCTTAG CTGCACCAAC TGGTGAAGTG TATTATGCGA AAACATTGGA AGAGCATAAT
GCATTAAAGC AAAAATATAT TACGAAAAAG CAGTGA
 
Protein sequence
MKKKRRRIFL FSIIALLLVC GSVYAYISSA LGPVDTGNKK EIEVEIPKGS STSKIGEILE 
EKGAVKNGTV FSFYTKAKSK NLQAGTYLLN PSMSAKDVME QMSSGNVHRP ALYKVTIKEG
AQVTEIAETV ANELKWNKDD VVRQLNDKAF IQKMQQKYPK LLTDKIFDSN IKYPLEGYLY
PATYSFYKKD TTLEEVVIPM LEKTNAIIVQ NEAKMKAKNW DVHQLLTLSS LIEEEATGFT
DRQKISSVFY NRLAKGMPLQ TDPTVLYALG KHKQLVLYED LKVNSPYNTY VVKGLPVGPI
ANSGKHSVEA ALEPAQTDYY YFLAAPTGEV YYAKTLEEHN ALKQKYITKK Q