Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bind_0555 |
Symbol | |
ID | 6198516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Beijerinckia indica subsp. indica ATCC 9039 |
Kingdom | Bacteria |
Replicon accession | NC_010581 |
Strand | - |
Start bp | 614747 |
End bp | 616207 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641704546 |
Product | sulfatase |
Protein accession | YP_001831696 |
Protein GI | 182677550 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGGTC TCTTTTCTTT CAACCGCCGC AATCTGCTCA TCGGAACGGC GGCAGCGACT GTTACTTCTT TGCCCCAGAT CGCACGGCCT GCAACGGGTG ATAGAGCGCC GAATATTATC TTCATCCTTG CCGATGATCT CGGCTATGCG GATGTTTCCA TTTATGGACG GCCCGATCTC TCCACACCTA ATATCGACGG CATCGGGCTC AAGGGAGCAC GTTTGCTTCA GGCTTACGCA AATTCCGCTG TCTGCTCGGC GACACGGACG GCTTTGCTCA CCGGCCGCTA TCAATATCGC GAGCGGGTGG GCCTTGAGGA GCCGATCGCC GGCAATATCC ATGTCGGCCT GCCACCGCAG CGCCCGACCT TGCCCTCGCT TTTGAAAAAA GCAGGTTACA CCACGACTCT CATCGGTAAA TGGCATCTCG GCACATTGCC GGACTTCGGC CCACTGCAAA GCGGCTATGA TCATTTCTAC GGCTTTCGCG GCGGTGCCGT CGATTATTAT TCACACAAAG GCACCGATGA TCAGGACGAT CTGTGGGATC AAGATACAAA GGTTCACCAA ACCGGTTATT TGACGGAATT GCTCGGCGAC CGCGCCATCG AAACCATCAA CGCTTCAGCC AAAACCGGCC AGCCTTTCTT CATCAGCCTG CATTTCAATG CGCCCCATTG GCCCTGGGAA GCGCCCGGGG ATGAAGCGGA ATCCGCGCGT GTGGCAGGGA CGCGCCTGTT CGACTTCGAT GGCGGATCAC AAGCGACCTA TCGCGGCATG ATCGCAGCGA TGGATCTCCA AATCGGGCGC ATTGTGCAGG CTCTGCAAGC CAATGGGATC AGCGAGAATA CGATTGTCAT CTTCACAAGC GATAATGGCG GTGAGCGTTT TGCCGATACA TGGCCGTTTA CCGGCCGTAA GACGGAACTA CTCGAAGGCG GATTGCGCAT CCCTGCCCTC GTCTCTTGGC CGGCGCGGAT CAAAGCAGAT CAAACCATCG ATCAGGTCAG CATCAGCATG GATTGGCTGC CGACTCTCTT AGCGGCCGCT GGGAGCGAAC CCGATCCAAA TTTTCCCTCC GACGGGATTA ATCTGCTGCC TTTCCTGAGC GAAAGCAAAG CCGCTATCCC TCGCAAATTG TTCTGGCGCT ACAAAGCCAA TGCCCAGCGC GCAGTGCGCG ATGGCGATTA CAAATATCTC AAAATCCGGG ACAATGATTT TCTCTTCAAC GTGGTCGATG ATCCGCTGGA ACGCGTCAAT CTGAAAGAGC GCCACAAAGA TATTTACAAT CGCCTTCTCG CCGAATGGCT CGAGTGGAAC AGCACTATGC TACCCGAAAT CACTGAGAGC TTTACGCACG GCTTCACGGG TCACGAACTT GCCGATCACT ATGGCGTGAC CGCACCAACC ACAGAACCTG ACAATCCCGC GCCTCTTCGG GCGATGCGCC GCGATGATTA A
|
Protein sequence | MPGLFSFNRR NLLIGTAAAT VTSLPQIARP ATGDRAPNII FILADDLGYA DVSIYGRPDL STPNIDGIGL KGARLLQAYA NSAVCSATRT ALLTGRYQYR ERVGLEEPIA GNIHVGLPPQ RPTLPSLLKK AGYTTTLIGK WHLGTLPDFG PLQSGYDHFY GFRGGAVDYY SHKGTDDQDD LWDQDTKVHQ TGYLTELLGD RAIETINASA KTGQPFFISL HFNAPHWPWE APGDEAESAR VAGTRLFDFD GGSQATYRGM IAAMDLQIGR IVQALQANGI SENTIVIFTS DNGGERFADT WPFTGRKTEL LEGGLRIPAL VSWPARIKAD QTIDQVSISM DWLPTLLAAA GSEPDPNFPS DGINLLPFLS ESKAAIPRKL FWRYKANAQR AVRDGDYKYL KIRDNDFLFN VVDDPLERVN LKERHKDIYN RLLAEWLEWN STMLPEITES FTHGFTGHEL ADHYGVTAPT TEPDNPAPLR AMRRDD
|
| |