Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS0757 |
Symbol | |
ID | 2848737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | - |
Start bp | 811432 |
End bp | 812592 |
Gene Length | 1161 bp |
Protein Length | 386 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 637504019 |
Product | hypothetical protein |
Protein accession | YP_027033 |
Protein GI | 49183781 |
COG category | [S] Function unknown |
COG ID | [COG3584] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAT TTATGGGTAT AGCAACAGCA GCGGTTTTTG GTCTTGGGAT TTTCACAACA TCTGCTAAAG CAGAAACAAT CGTAACGACT GATGTACTAA ATGTACGAGA AAACCCAACT ACTGAATCAA AAGTTGTAGG AAAATTATTA GATGGATATA AAGTTAACGT TTTACATACA GAAAACGGAT GGTCAAAAGT GAAATTGAAT AGCGGTAAAG AAGCTTTCAT TAGCGCTGAC TACACAAAAG ACACTTACTA CGTAACAGCT AACGTATTAA ACGTACGTGC TGGTGCAAAC ACAGACTCAG AGATTCTTGG GAAATTGAAA CAAGATGACG TAATCGAAAC AACACACCAA GTTGAAAATG GTTGGATCCA ATTTGAATAT AACGGAAAAA CAGCTTACGT TCACGTTCCT TACTTAACAG GTAAAGCTCC AGTTAAAGTT CAACCAGTAG TTAAAGCTGA AAAAACAACT ACAGTTCAAG ATACAGCTAA AGCCGTGGCA ACAACTAAAG CTCGTGAAGT AGCTGAAACG CAAGCAAAAG CTAAAGCGGA GGAAGCAACT AAAGCTCGCG AAGTAGCTGA AGCTCAAGCA GCGGCTAAAG CTCGTGAAGC AGCTAAGGCT CAAGAGGCAG CTAAAGCTCA GGCAGAAGCT AAAGCTCAAG AAGCAGCTGA AGCTCAAGCA GCGGCTAAAG CTCAAGAGGC AGCTAAAGCT CGTGAAGCAG CTAAAGCTCA GGCAGAAGCT AAAGCTCAAG AAGCAGCTGA AGCTCGTGAA GCGGCTAAAG CTCAAAAACC AGCTACACAA CAACCTGTTG CAAAAGAAAC TGAAACAAGT GCACCATCAT CTTCTCGTGA GTTACGCGTT GTAGCAACAG CTTACACAGC AGATCCACTT GAAAATGGTT ATAAAGCAGG CGACCAAGTA AAATCAGCTT TAGGTCACAA CTTAACAGCT AATCCAAACA TGAAACTAAT CGCAGTTGAT CCAAGTGTCA TTCCATTAGG TTCAAAAGTA TGGGTTGAAG GTTACGGAGT AGCAATCGCT GGTGATACTG GTGGAGCTAT TAAAGGAAAC AAAATCGACG TTTTAATGCC AGACAAAGGT ACATCAAGTA ACTGGGGACG TAAAACAGTT ACAGTTAAAG TATTAAACTA G
|
Protein sequence | MKKFMGIATA AVFGLGIFTT SAKAETIVTT DVLNVRENPT TESKVVGKLL DGYKVNVLHT ENGWSKVKLN SGKEAFISAD YTKDTYYVTA NVLNVRAGAN TDSEILGKLK QDDVIETTHQ VENGWIQFEY NGKTAYVHVP YLTGKAPVKV QPVVKAEKTT TVQDTAKAVA TTKAREVAET QAKAKAEEAT KAREVAEAQA AAKAREAAKA QEAAKAQAEA KAQEAAEAQA AAKAQEAAKA REAAKAQAEA KAQEAAEARE AAKAQKPATQ QPVAKETETS APSSSRELRV VATAYTADPL ENGYKAGDQV KSALGHNLTA NPNMKLIAVD PSVIPLGSKV WVEGYGVAIA GDTGGAIKGN KIDVLMPDKG TSSNWGRKTV TVKVLN
|
| |