Gene BAS4280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4280 
Symbol 
ID2853165 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4190678 
End bp4191748 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content35% 
IMG OID637507516 
Producthypothetical protein 
Protein accessionYP_030528 
Protein GI49187276 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000498286 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGTAGAGA ATCAAGTGAA AAAGAAGCGT AGACGCATTT TTTTATTTTC GATTATTGCA 
CTGCTTTTAG TTTGTGGTTC AGTCTATGCG TATATTTCAT CCGCATTAGG ACCAGTTGAT
ACCGGGAATA AAAAAGAGAT TGAAGTAGAA ATTCCAAAGG GATCATCTAC TAGTAAAATT
GGTGAGATTT TAGAAGAAAA AGGTGCTGTG AAAAACGGTA CAGTTTTTAG TTTTTATACA
AAGGCTAAAT CTAAAAATTT ACAAGCGGGT ACATATTTAT TAAATCCTTC AATGAGTGCG
AAAGATGTTA TGGAGCAAAT GTCATCTGGT AATGTACATC GTCCAGCTCT TTATAAAGTG
ACGATAAAAG AAGGAGCACA AGTAACTGAA ATTGCAGAAA CGGTTGCAAA CGAATTAAAG
TGGAATAAAG ATGATGTCGT ACGTCAATTA AACGATAAAG CATTTATTCA AAAAATGCAG
CAAAAGTATC CGAAGTTGTT AACCGATAAA ATCTTTGATA GCAATATTAA ATATCCGTTA
GAAGGTTATT TATATCCTGC GACGTACTCT TTCTATAAAA AAGATACGAC GTTAGAAGAA
GTTGTAATTC CAATGCTTGA AAAAACGAAT GCAATCATTG TTCAAAACGA GGCAAAAATG
AAAGCGAAAA ACTGGGATGT TCACCAGCTT TTAACATTGT CTTCACTTAT TGAAGAAGAG
GCAACAGGCT TTACAGATCG TCAAAAGATC TCTAGTGTCT TTTATAATCG TTTAGCAAAA
GGCATGCCAC TGCAAACTGA TCCGACGGTA TTATATGCAC TTGGAAAGCA TAAACAACTT
GTGTTATACG AAGATTTAAA GGTTAACTCA CCATACAATA CGTATGTGGT GAAAGGATTG
CCTGTCGGTC CGATTGCAAA CTCTGGCAAA CATTCAGTGG AAGCGGCGTT AGAACCCGCG
CAAACAGATT ATTATTATTT CTTAGCTGCA CCAACTGGTG AAGTGTATTA TGCGAAAACA
TTGGAAGAGC ATAATGCATT AAAGCAAAAA TATATTACGA AAAAGCAGTG A
 
Protein sequence
MVENQVKKKR RRIFLFSIIA LLLVCGSVYA YISSALGPVD TGNKKEIEVE IPKGSSTSKI 
GEILEEKGAV KNGTVFSFYT KAKSKNLQAG TYLLNPSMSA KDVMEQMSSG NVHRPALYKV
TIKEGAQVTE IAETVANELK WNKDDVVRQL NDKAFIQKMQ QKYPKLLTDK IFDSNIKYPL
EGYLYPATYS FYKKDTTLEE VVIPMLEKTN AIIVQNEAKM KAKNWDVHQL LTLSSLIEEE
ATGFTDRQKI SSVFYNRLAK GMPLQTDPTV LYALGKHKQL VLYEDLKVNS PYNTYVVKGL
PVGPIANSGK HSVEAALEPA QTDYYYFLAA PTGEVYYAKT LEEHNALKQK YITKKQ