Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BAS2236 |
Symbol | |
ID | 2850634 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus anthracis str. Sterne |
Kingdom | Bacteria |
Replicon accession | NC_005945 |
Strand | - |
Start bp | 2238940 |
End bp | 2239878 |
Gene Length | 939 bp |
Protein Length | 312 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 637505484 |
Product | inosine-uridine preferring nucleoside hydrolase family protein |
Protein accession | YP_028497 |
Protein GI | 49185245 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG1957] Inosine-uridine nucleoside N-ribohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAAAAG TGTACTTCAA TCATGATGGT GGTGTAGATG ATTTAGTTTC ACTATTTTTA TTACTCCAAA TGGATAATGT CGAACTTACA GGTGTTTCAG TTATCCCAGC AGATTGCTAT TTAGAGCCAG CAATGTCTGC AAGTCGAAAA ATTATTGATC GCTTCGGCAA AAATACTATT GAGGTAGCAG CTTCTAATTC TCGTGGAAAA AATCCGTTTC CAAAAGACTG GCGGATGCAT GCATTTTATG TAGATGCTTT ACCCATTTTA AATGAGTCTG GAAAAGTTGT AACGCACGTA GCAGCAAAGC CTGCGCATCA TCATTTAATT GAGACTCTTC TACAAACTGA AGAGAAAACA ACTTTATTAT TTACAGGTCC TCTTACCGAT TTAGCCCGTG CACTATATGA AGCACCTATA ATCGAAAATA AAATTAAACG TTTAGTTTGG ATGGGCGGTA CATTTCGTAC TGCAGGCAAT GTACATGAAC CTGAACATGA TGGAACAGCC GAATGGAATT CGTTTTGGGA CCCTGAAGCA GTAGCTCGCG TATGGGAAGC AAATATAGAA ATCGACTTAA TAACGCTAGA AAGTACAAAC CAAGTTCCCC TAACTATAGA CATACGTGAA CAATGGGCAA AAGAGAGAAA GTATATCGGT ATTGATTTCC TTGGTCAATG TTATGCAATT GTTCCCCCTC TTGTTCACTT TGCAAAGAAC TCTACCTACT ATTTGTGGGA TGTATTAACT GCTGCCTTTG TTGGGAAAGC TGATCTAGCA AAAGTACAAA CGATCAATAG TATCGTTCAT ACATACGGGC CAAGCCAAGG GCGTACAGTG GAAACTGATG ATGGGCGGCC GGTACATGTT GTTTATGATG TAAACCACGA TCGATTTTTC GACTATATAA CTCGGTTAGC AAAGAAAGTC TCTACTTAA
|
Protein sequence | MKKVYFNHDG GVDDLVSLFL LLQMDNVELT GVSVIPADCY LEPAMSASRK IIDRFGKNTI EVAASNSRGK NPFPKDWRMH AFYVDALPIL NESGKVVTHV AAKPAHHHLI ETLLQTEEKT TLLFTGPLTD LARALYEAPI IENKIKRLVW MGGTFRTAGN VHEPEHDGTA EWNSFWDPEA VARVWEANIE IDLITLESTN QVPLTIDIRE QWAKERKYIG IDFLGQCYAI VPPLVHFAKN STYYLWDVLT AAFVGKADLA KVQTINSIVH TYGPSQGRTV ETDDGRPVHV VYDVNHDRFF DYITRLAKKV ST
|
| |