Gene BAS4623 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS4623 
Symbol 
ID2851333 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp4518446 
End bp4520389 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content52% 
IMG OID637507859 
Producttriple helix repeat-containing collagen 
Protein accessionYP_030869 
Protein GI49187616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGAAAG TAGTGGAAGG AAATGGTGGT AAATCCAAAA TAAAAAGTCC ATTAAATTCT 
AATTTCAAGA TATTGTCAGA TCTAGTTGGC CCTACTTTTC CTCCAGTTCC AACTGGAATG
ACAGGGATAA CGGGAAGTAC GGGAGCAACG GGAAACACGG GTCCAACGGG AGAAACGGGA
GCAACGGGAA GCGCCGGGAT AACAGGAAGT ACGGGTCCAA CGGGAAACAC GGGAGGAACA
GGAAGCACAG GTCCAACGGG AAACACGGGA GCAACAGGAA GTACTGGGGT AACAGGAAGC
ACCGGGGTAA CAGGAAGTAC GGGAGTAACA GGAAGTACTG GGGTAACAGG AAGTACGGGT
CCAACAGGAG AAACGGGAGG AACAGGAAGT ACTGGGGTAA CAGGAAGTAC AGGGGCAACA
GGAAGCACCG GGGTAACAGG AAATACGGGT CCCACAGGAA GTACCGGAGC AACGGGAAAC
ACAGGTTCAA TAGGAGAAAC GGGAGGAACA GGAAGTATGG GTCCAACAGG AGAAACGGGA
GTGACAGGAA GTACGGGAGG AACAGGAAGC ACCGGGGTAA CTGGAAACAC GGGTCCAACG
GGAAGCACCG GAGTGACGGG AAGCACGGGA GTGACGGGAA GCACAGGTCC AACGGGAAGC
ACGGGAGTGA CGGGAAGCAC AGGTCCAACA GGAAGTACTG GGGTAACAGG AAGCACCGGG
GTAACAGGAA ACATGGGGCC AACGGGAAGC ACCGGGGTAA CAGGAAATAC GGGATCGACA
GGAACCACGG GAGCAACCGG AGAAACAGGT CCAATGGGAA GTACGGGAGC AACGGGAACC
ACGGGTCCGA CAGGAGAAAC GGGAGAAACG GGAGAAACGG GAGGAACAGG AAGCACAGGT
CCAACGGGAA ACACGGGAGC AACAGGAAGT ACTGGGGTAA CAGGAAGCAC CGGGGTAACA
GGAAGTACGG GAGTAACAGG AGAAACAGGT CCAACGGGAA GTACGGGAGC AACGGGAAAC
ACGGGTCCAA CAGGAGAAAC GGGAGGAACA GGAAGTACAG GGGCAACAGG AAGCACTGGG
GTAACAGGAA ATACGGGTCC AACAGGAAGC ACCGGGGTAA CCGGAAATAC GGGAGCAACA
GGAGAAACAG GTCCAACAGG AAATACGGGA GCGACGGGAA ATACCGGTCC AACAGGAGAA
ACGGGAGTGA CAGGAAGTAC GGGTCCAACA GGAGAAACGG GAGTGACAGG AAGTACGGGT
CCAACAGGAA ACACGGGAGC AACAGGAGAA ACGGGAGCAA CAGGAAGTAC TGGGGTAACA
GGAAACACGG GTTCAACAGG AGAAACAGGT CCAACGGGAA GTACGGGTCC AACAGGAAGC
ACCGGAGCAA CGGGAGTGAC GGGAAACACA GGTCCAACCG GAAGCACCGG AGCAACGGGA
GCAACAGGAA GCACAGGTCC GACCGGCAGC ACCGGAACAA CAGGAAATAC GGGAGTAACA
GGAGATACCG GTCCAACAGG AGCGACCGGG GTTAGTACAA CTGCAACGTA CGCGTTTGCG
AATAATACAT CAGGAAGTGT TATTTCTGTT TTGTTAGGTG GCACGAATAT TCCGTTACCA
AACAATCAAA ATATTGGACC GGGAATAACT GTTAGTGGTG GGAATACTGT ATTTACAGTT
GCGAATGCAG GGAATTATTA TATAGCCTAT ACAATTAATT TAACAGCAGG CTTACTTGTA
AGTTCCCGTA TAACTGTAAA TGGCAGTCCG CTTGCGGGAA CGATAAACTC CCCGACAGTG
GCTACTGGTT CATTTAGTGC AACAATAATT GCTAGCTTGC CTGCTGGAGC TGCCGTTAGC
TTACAACTAT TTGGAGTAGT TGCGTTGGCT ACATTATCTA CGGCAACGCC AGGAGCTACT
TTAACGATTA TTAGATTGAG TTAA
 
Protein sequence
MVKVVEGNGG KSKIKSPLNS NFKILSDLVG PTFPPVPTGM TGITGSTGAT GNTGPTGETG 
ATGSAGITGS TGPTGNTGGT GSTGPTGNTG ATGSTGVTGS TGVTGSTGVT GSTGVTGSTG
PTGETGGTGS TGVTGSTGAT GSTGVTGNTG PTGSTGATGN TGSIGETGGT GSMGPTGETG
VTGSTGGTGS TGVTGNTGPT GSTGVTGSTG VTGSTGPTGS TGVTGSTGPT GSTGVTGSTG
VTGNMGPTGS TGVTGNTGST GTTGATGETG PMGSTGATGT TGPTGETGET GETGGTGSTG
PTGNTGATGS TGVTGSTGVT GSTGVTGETG PTGSTGATGN TGPTGETGGT GSTGATGSTG
VTGNTGPTGS TGVTGNTGAT GETGPTGNTG ATGNTGPTGE TGVTGSTGPT GETGVTGSTG
PTGNTGATGE TGATGSTGVT GNTGSTGETG PTGSTGPTGS TGATGVTGNT GPTGSTGATG
ATGSTGPTGS TGTTGNTGVT GDTGPTGATG VSTTATYAFA NNTSGSVISV LLGGTNIPLP
NNQNIGPGIT VSGGNTVFTV ANAGNYYIAY TINLTAGLLV SSRITVNGSP LAGTINSPTV
ATGSFSATII ASLPAGAAVS LQLFGVVALA TLSTATPGAT LTIIRLS