Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GBAA_3841 |
Symbol | |
ID | 2815459 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. 'Ames Ancestor' |
Kingdom | Bacteria |
Replicon accession | NC_007530 |
Strand | + |
Start bp | 3514729 |
End bp | 3516165 |
Gene Length | 1437 bp |
Protein Length | 478 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637790563 |
Product | hypothetical protein |
Protein accession | YP_020476 |
Protein GI | 47529128 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.000174288 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTCGTT ATGACGACAG TCAAAACAAA TTCTCCAAAC CATGCTTTCC AAGTAGCGCT GGACGAATCC CGAATACTCC ATCAATCCCA GTTACTAAGG CACAACTTAG AACATTTCGC GCAATCATTA TTGATTTAAC AAAAATAATC CCAAAACTTT TCGCAAATCC ATCTCCCCAA AATATTGAAG ATCTAATCGA TACATTGAAC CTACTAAGTA AATTTATTTG TTCACTAGAC GCTGCTTCCT CCCTGAAAGC ACAAGGATTA GCTATTATTA AAAACTTAAT AACTATATTA AAAAACCCAA CTTTCGTAGC AAGTGCTGTA TTTATCGAGC TTCAAAATCT AATTAATTAT TTACTATCCA TTACAAAACT ATTCCGAATT GACCCTTGCA CACTTCAAGA GCTTCTTAAA TTAATAGCAG CATTACAAAC CGCTTTAGTT AATTCTGCTT CATTCATTCA AGGACCTACT GGACCTACTG GACCTACTGG GCCAGCTGGT GCTACCGGTG CTACTGGACC TCAAGGTGTT CAAGGACCAG CAGGCGCTAC CGGTGCCACT GGACCTCAAG GTGTTCAAGG ACCAGCAGGT GCTACTGGCG CTACTGGACC TCAAGGTGCT CAAGGACCAG CAGGTGCTAC CGGTGCTACT GGACCTCAAG GTGCTCAAGG ACCAGCAGGT GCTACTGGTG CCACTGGACC TCAAGGTATT CAAGGACCAG CAGGTGCTAC CGGTGCTACT GGACCTCAAG GCGTTCAAGG GCCAACGGGT GCTACTGGTA TAGGAGTTAC CGGACCTACT GGGCCTTCTG GTGGGCCTGC TGGTGCTACT GGACCTCAGG GACCTCAAGG TAATACAGGT GCTACTGGAC CTCAAGGTAT TCAAGGGCCT GCTGGTGCTA CTGGTGCCAC TGGACCTCAA GGTGCTCAAG GACCGGCTGG TGCTACCGGC GCTACTGGAC CTCAAGGTGT TCAAGGGCCA ACGGGTGCTA CTGGTATAGG AGTTACCGGA CCTACTGGGC CTTCTGGACC TAGCTTCCCT GTAGCAACAA TTGTTGTAAC AAACAACATT CAACAAACAG TACTCCAATT TAACAACTTC ATTTTTAATA CTGCAATTAA CGTAAACAAC ATTATCTTCA ACGGCACAGA TACAGTTACT GTTATCAACG CTGGTATTTA TGTCATTAGC GTATCCATCT CTACAACTGC ACCAGGATGT GCACCACTCG GAGTAGGAAT TTCAATAAAT GGAGCAGTCG CAACTGACAA CTTCTCTTCA AATCTAATAG GCGACTCACT TTCATTCACT ACGATCGAAA CGTTAACTGC CGGCGCGAAC ATTTCTGTCC AATCCACTCT TAATGAGATT ACGATCCCTG CAACAGGAAA CACTAATATT CGTCTAACTG TATTTAGAAT CGCTTAA
|
Protein sequence | MSRYDDSQNK FSKPCFPSSA GRIPNTPSIP VTKAQLRTFR AIIIDLTKII PKLFANPSPQ NIEDLIDTLN LLSKFICSLD AASSLKAQGL AIIKNLITIL KNPTFVASAV FIELQNLINY LLSITKLFRI DPCTLQELLK LIAALQTALV NSASFIQGPT GPTGPTGPAG ATGATGPQGV QGPAGATGAT GPQGVQGPAG ATGATGPQGA QGPAGATGAT GPQGAQGPAG ATGATGPQGI QGPAGATGAT GPQGVQGPTG ATGIGVTGPT GPSGGPAGAT GPQGPQGNTG ATGPQGIQGP AGATGATGPQ GAQGPAGATG ATGPQGVQGP TGATGIGVTG PTGPSGPSFP VATIVVTNNI QQTVLQFNNF IFNTAINVNN IIFNGTDTVT VINAGIYVIS VSISTTAPGC APLGVGISIN GAVATDNFSS NLIGDSLSFT TIETLTAGAN ISVQSTLNEI TIPATGNTNI RLTVFRIA
|
| |