Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BRA1185 |
Symbol | |
ID | 1165638 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Brucella suis 1330 |
Kingdom | Bacteria |
Replicon accession | NC_004311 |
Strand | - |
Start bp | 1186840 |
End bp | 1188327 |
Gene Length | 1488 bp |
Protein Length | 495 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637332272 |
Product | sulfatase |
Protein accession | NP_700338 |
Protein GI | 23500898 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.653201 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGTCC TCTATATCGA CATTGACAGC CTGCGCCGCG ATCATCTGGG CTGCTACGGA TATCACCGCA ACACGTCACC GACTATCGAC GCTATCGCGC GGGAAGGCGT GCGGTTTGAG AATATCTATG TTTCGGATGT TCCCTGCCAT CCGTCGCGCA CGGCGCTGTG GTCGGGCAGG CACGGTTTTC GCACCGGAGT GGTTGGCCAT GGAGGCACGG CCTGCGAGCC TTTTCGGGAG GGGGCAAGTC GCGCCTGGGC AGGCACCTTC TATGAAGAAG GCTGGATGAA AGCTCTTCGC GATCTGGACT ATTACACCAC CACCATATCC AGCTTCGGCG AGCGGCATGG CTGCTGGCAT TGGTATGCCG GTTTCAACGA GGTCATGAAT TGCGGCAAGG GCGGGATGGA AAATGCTGAT GAAATCGTGC CGATGGCCAT AGACTGGATT GCGCGCAACA AGTCGCGCAA ATGGTTCCTG CATGTCAATC TGTGGGATCC GCATACCCCT TATCGTGTGC CCGAGGAATG GGGCGATCCT TTCGCCGGTG AGCCGCTGCC CGCATGGATG ACAGAGGAGG TTCTGGCGAG AAGCATCGCT GGTTACGGTC CGCACAGCCC ACAGGAACCG ACCGGCTTTT CCGCGGACAA CCCACATGCG CATTACACCA GAATGCCAGC CCCGATAGAT TCGATGGAGA AAGTTGCGGC CTGGTTCAAC GGTTATGATG CCGGTGTTCG CTACGCAGAC GAGCACATCG GCCATCTGAT CGCGGCGCTG AAGGAGCATG GTCTTTACGA CAATACGATC ATTGTGATTG GAGCCGATCA TGGCGAAAAT CTGGGTGAGC TGAATGTCTG GGGAGATCAT CAGACGGCGG ATGAATTCAC CTGCAATGTG CCGCTTATCA TTCGCTGGCC GGGCCAGGCT CCGGGCGTAA ATCATGGGCT GCACTACCAT TTCGACTGGC CTGCAACATT GATTGACGGG CTTGGCGGAA AAGTACCGTC GGTTTGGGAT GGAAAGTCAT TCGCACCGGC AATGCGCCGG AACGAGAATG GCGGGCGCGA TTTCCTCGTG CTTAGTCAGG GTGCCTGGGC CGTGCAGCGC GGTGTGCGGT TCCGTCATCA GAATGAAGAT TGGCTGATGC TTCGCACATG GCATGACGGG CTGAAGGATT TCGGGCCTGT CACCCTGTTC AATCTCACAT CCGATCCGCA TGAGCAGACG GATCTATCCC AAAGCCGCCC CGACATTGTC GCACATGCCA GCCGTATGCT CGATGAATGG TATGGCACAA TGGCCCAATG CAGCGATCAG GATGTCGATC CGCTCATGAC GGTTCTGCGC GAGGGCGGCC CTTATCATAC GCTGGGTGAG CTTCCGGCCT ATCTAGAACG GCTCCGCGCC ACGGGACGGG GAGACCATGC CGACGCTCTT GCCGCGCGTC ATCCGCAGCG TGAGAAGGAA AAGGCAACTT TGCGATAA
|
Protein sequence | MNVLYIDIDS LRRDHLGCYG YHRNTSPTID AIAREGVRFE NIYVSDVPCH PSRTALWSGR HGFRTGVVGH GGTACEPFRE GASRAWAGTF YEEGWMKALR DLDYYTTTIS SFGERHGCWH WYAGFNEVMN CGKGGMENAD EIVPMAIDWI ARNKSRKWFL HVNLWDPHTP YRVPEEWGDP FAGEPLPAWM TEEVLARSIA GYGPHSPQEP TGFSADNPHA HYTRMPAPID SMEKVAAWFN GYDAGVRYAD EHIGHLIAAL KEHGLYDNTI IVIGADHGEN LGELNVWGDH QTADEFTCNV PLIIRWPGQA PGVNHGLHYH FDWPATLIDG LGGKVPSVWD GKSFAPAMRR NENGGRDFLV LSQGAWAVQR GVRFRHQNED WLMLRTWHDG LKDFGPVTLF NLTSDPHEQT DLSQSRPDIV AHASRMLDEW YGTMAQCSDQ DVDPLMTVLR EGGPYHTLGE LPAYLERLRA TGRGDHADAL AARHPQREKE KATLR
|
| |