Gene GBAA_4093 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGBAA_4093 
Symbol 
ID2814700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. 'Ames Ancestor' 
KingdomBacteria 
Replicon accessionNC_007530 
Strand
Start bp3768279 
End bp3769448 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content36% 
IMG OID637790803 
Producthypothetical protein 
Protein accessionYP_020738 
Protein GI47529389 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGGATGGA AAGGTGCTGG GTATAACTTC ACTTCTTGGT TTGGAAGGAA GTTTTGGGGG 
ATTGATAATG CAAAGTTAGC TACAAATGAG ACGATTTTCA GTGTGATTAG TAGATTATCT
AATACGGTAG CATCTTTGCC ATTAAAGCTT TATAAAGATT ATGACACGGT TTTTAACCAA
GTGTCTGATG TTGTGATGAA TGAACCTAAT CCAAACATGA CCGGATTTGA ATGGATAAAT
AAAATTGAAG TTTCAAGAAA TGAGACTGGG AATGGCTATG CAGCTATCAT CCGTGACATT
CGATTTCAAG TGGAATCATT AATCCCTATT GAATCCGCTT ATGTAACGCC TTTTTTAAAC
ACGGATGATA ATAATTTGTG GTATGAGGTA CGTGGGATTG AAGGTACATA TTACATCCAC
AATATGAACA TGTTTCATGT CAAGCACATC ACAGGTATTT CAAGATGGAA AGGTATTTGT
CCAATTGATG TTTTGAGAAA TACTCTTGAA TATGATAAGG CAGTACAAGA ATTTAGTTTG
TCAGAAATGC AGAAGAAAGA TAGTTTTATT TTGGATTATG CAACACAGGT AGATAATGAT
AAGAGACAAA AAATCATTGA TGATTTTAGA AGATTTTATC AAGAGAACGG TGGTATTTTA
TTCAGGGAAC CAGGTGTGAA TATAGAAGAA ATGGAGCGGA AATACTTCGC TTCAGATACG
TTAGCATCAG AACGAATTAC TCGTTCGCGA GTTGCTAACG TTTTTAATGT TCCTGTTTCT
TTTTTAAATG ATACGGAAGG TCAGAGTTAT AGTAGCAATG AGCAACTAAT GATTCAGTTT
GTTCAAATGA CTTTAACTCC TATTGCTCGG CAGTATGAAC AAGAAATGAA CCGAAAATTG
CTAAATAAAG CTGAGAGACA AGCTGGATAT TATTTTAAAT TTAATATGAG TGGTTTACTA
CGTGGCGATA CAGCAGCAAG AACACAGTTT TATCAAATGA TGCTTCGAAG TGGCGGACTA
ACACCAGATG AGGTGCGTGA ATTAGAAGAT AAACCACCAA AGGGAGGTTC AGCATCTCAA
TTGTGGATTT CTGGTGATTT ATACCCAATT GATATGGACC CATCTCAACG AAAGGGGGTG
AAAAGTAGTG GGAAAGAACA AACAGAATAA
 
Protein sequence
MGWKGAGYNF TSWFGRKFWG IDNAKLATNE TIFSVISRLS NTVASLPLKL YKDYDTVFNQ 
VSDVVMNEPN PNMTGFEWIN KIEVSRNETG NGYAAIIRDI RFQVESLIPI ESAYVTPFLN
TDDNNLWYEV RGIEGTYYIH NMNMFHVKHI TGISRWKGIC PIDVLRNTLE YDKAVQEFSL
SEMQKKDSFI LDYATQVDND KRQKIIDDFR RFYQENGGIL FREPGVNIEE MERKYFASDT
LASERITRSR VANVFNVPVS FLNDTEGQSY SSNEQLMIQF VQMTLTPIAR QYEQEMNRKL
LNKAERQAGY YFKFNMSGLL RGDTAARTQF YQMMLRSGGL TPDEVRELED KPPKGGSASQ
LWISGDLYPI DMDPSQRKGV KSSGKEQTE