Gene BAS3498 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS3498 
Symbol 
ID2851084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp3472101 
End bp3473351 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content38% 
IMG OID637506740 
Producthypothetical protein 
Protein accessionYP_029753 
Protein GI49186501 
COG category[S] Function unknown 
COG ID[COG5280] Phage-related minor tail protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCAG GAGGAAGAAT TAAAGGAATT AGTATTTCAA TTGATGGTGA AACCACGGGA 
CTTCAAAATG CATTAAAAGA TGTTAATAAG CGTAGTAATG ATTTAACCAA AGAGCTTAAA
GATGTTGAGC GATTATTAAA ATTTGATCCA GGTAATATTG AAGCTTTAGC CCAAAAGCAA
CAGTTACTGA CTCAGCAAAT TGAAAACACC ACACAAAAGT TAGATAAATT AAAGGCAGCT
GAGCAACAAG TCCAAGCACA ATTCCAAAAC GGAAAAATTT CCGAAGAACA ATACCGCGCA
TTCAGGCGTG AAATTGAATT TACAGAAGGA TCGCTTAATG GCCTGAAGAA TAAGCTTGGA
AACATGAAGG CTGAACAAGA TAGTGTAGCA AGTTCAACAA GACAATTAGA AACATTGTTT
AGCGCTACTG GAAAAAGTGT TGATGATTTC GCGGGGGCAT TAGGAAATCG TCTTGTGAAT
GCAATTCGAA GTGGAACGGC TACCAGTAAG CAGTTAGATC AAGCAATTGG AATTATCGGA
CGAGAAGCAT TAGGAACAGA AGCCGATATT GAAAAGTTAC AACGTGCACT TCGATCTGTA
GATGCTGGTA ATACGATTCA ACAAGTACAA AACGAATTAA GAGACTTACA ACAGGAAGCT
GGCAAAACAG AGAAAAAGTT TGAAGGATTA AAAATAGGAT TAGAAAATGT TATAGGTGGA
TTGGCAGCTG GTGGCGGTAT TGCAACCGCT ATTGAAAAAG CGATGGATAT GTCAAGTCTA
CAAACAAAAA TTGATATCAC ATTTGATGTT CCTGAGTCTT CGAAAAAATC AGTGGAAGAA
GCTATTAGGG GCGTTAGTAC TTATGGTATT GACGCTGAAG AAGCATTAGA AGGTGTTCGC
AGACAATGGG CTTTAAATAA GGATGCTTCT GATGAAACAA ATGCCGCTGT GGTTAAAGGG
GCAGCGACTA TTGCAGCATC TTACGCTGGA ATTGATTTTA ATGAACTTAT ACAAGAAACC
AATGAGATTG GTGCAACGTT AGGTATTACT AACGAGGAAG CATTGGGGTT AGTTAATACA
TTATTAAAAA CAGGATTTCC ACCAGAACAA TTGGATATTA TCGCTGAATA TGGGGATCAG
ATGGTTCAAG CTGGATTTTC AGCGAAAGAA GTCCAAGGAA TCCTGTCTGC AGGAGTCGAC
ACTAAAAGTT GGAATATCGA TAACCTTTTG GATGAAAAAT TGTCCCTATG A
 
Protein sequence
MMAGGRIKGI SISIDGETTG LQNALKDVNK RSNDLTKELK DVERLLKFDP GNIEALAQKQ 
QLLTQQIENT TQKLDKLKAA EQQVQAQFQN GKISEEQYRA FRREIEFTEG SLNGLKNKLG
NMKAEQDSVA SSTRQLETLF SATGKSVDDF AGALGNRLVN AIRSGTATSK QLDQAIGIIG
REALGTEADI EKLQRALRSV DAGNTIQQVQ NELRDLQQEA GKTEKKFEGL KIGLENVIGG
LAAGGGIATA IEKAMDMSSL QTKIDITFDV PESSKKSVEE AIRGVSTYGI DAEEALEGVR
RQWALNKDAS DETNAAVVKG AATIAASYAG IDFNELIQET NEIGATLGIT NEEALGLVNT
LLKTGFPPEQ LDIIAEYGDQ MVQAGFSAKE VQGILSAGVD TKSWNIDNLL DEKLSL