Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | TM1040_2113 |
Symbol | |
ID | 4076427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Ruegeria sp. TM1040 |
Kingdom | Bacteria |
Replicon accession | NC_008044 |
Strand | + |
Start bp | 2217848 |
End bp | 2219566 |
Gene Length | 1719 bp |
Protein Length | 572 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 638007432 |
Product | sulfatase |
Protein accession | YP_614107 |
Protein GI | 99081953 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.367046 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCACCCA GCGGCATGTT TTGCCAAAGG GTTTTTGTGC CTCCCGACCT AACCTATCGT CGTTTTGGCA CTCGGAACAA GGCATGCGCT ATGAATATTC TCTTTATCAT GTTCGACCAG CTCCGGTTCG ACTACCTGAG CTGCGCGGGC CACCCGCATC TCAAGACGCC TCATATCGAC CGGCTGGCCG AGCGCGGCGT GCGGTTCACT AACGCTTATG TGCAATCCCC GATCTGCGGC GCCGCACGGA TGAGCTGCTA TACCGGGCGC TATGTGTCCA GCCACGGGGC GCAATGGAAC AACTCGCCCT TGCGGGTGGG CGAATGGACC ATGGGGGATC ACCTGCGCAT GGCGGGCATG GGCTGCTGGC TCATTGGCAA GACCCATATG AATGCCGACA GCGAGGGCAT GGCCCGCCTC GGCCTCAGCC CCGACAGCGT GATTGGCGCA CGCCAGGCCG AATGCGGCTT TGACGTCTGG ATCCGCGACG ATGGCCTCTG GGCCGAGGGG CCGGACGGGT TTTATGACCA AAAACGCAGC CCCTATAATG AATACCTTAA ATCAAAAGGT TACGCGGGCC ACAACCCCTG GCACGACTTC GCCAATGCGG GTCTTGAGGG CGAAGAAATG GCGTCCGGCT GGTTCATGGC CAATGCGCAG AAGGCCGCGA ATATTGCCGA GGAAGACAGC GAAACCCCGT GGCTCACCAC CAAGACCATC GAGTTCATTG AACAGGCCGA GGGGCCATGG TGCGCGCATG TGAGCTATAT CAAGCCGCAT TGGCCCTATA TCGTGCCCGC GCCCTATCAC GACATGTACG GACCGGAGCA TGTGCTGCCC GCCGTCAAAG ACCCCGCGGA GCGCGAAGAC CCGCACCCCG TTTACGGTGC CTTCATGGGC AATGCCATCG GTCAGGCCTT CTCGCGCGAG GAAGTCCGAC AGGCCGCCAT CCCCGCCTAT ATGGGCCTCA TCAAGCAATG CGACGACCAG ATGGGGCGGC TGTTTGAGTA CCTTGAGGAC ACCGGCCGGA TGGATGACAC GATGATCGTG ATCACCTCTG ACCACGGCGA CTATCTGGGC GATCACTGGT TGGGCGAGAA GGATCTCTTT CACGAACCCT CCGTCAAAGT GCCGATGATC ATCTATGATC CCCGCCCCGA CGCCGACGCC ACCCGAGGCA CCACCTGCGA CGCGCTGGTG GAAAACATCG ACCTGCTGCC CACTTTCGTG GAGGCCGCAG GCGGCGAGGT CGCAGATCAC ATTCTGGAGG GGCGCGCGCT TACACCATGG CTGCATGGTC AGACACCCGA GGTGTGGCGG GACTACGCAA TCAGCGAATA CGACTATTCC GGCACGCCGA TGAGTGTGAA GCTTGGCAGC GCCCCCCGCG ATGCGCGGCT GTTTATGGTG ACGGACACAC GCTGGAAATT CATGCACGCC GAGGGCGGCC TGCCGCCAAT GCTATTTGAT CTGGAAAACG ACCCGCAGGA ATTTCACGAC CTTGGCCGCA GCCCAGACCA CACCGAGGTG ATCGATATGA TGTATGCGCG CCTCGGTCAG TGGGGGCGGC GCATGTCGCA ACGCATCACC CGCTCGGACG CGCAGATCAT TGCGGGGCGC GGCGCTTCAC GCGGCAAAGG CATTTTGCTT GGGGTCTATG AACCTGAGGA CGTCCCCGCT GAGTTAACCG TAAAATATCG CGGCAAACCG CCGACCTGA
|
Protein sequence | MPPSGMFCQR VFVPPDLTYR RFGTRNKACA MNILFIMFDQ LRFDYLSCAG HPHLKTPHID RLAERGVRFT NAYVQSPICG AARMSCYTGR YVSSHGAQWN NSPLRVGEWT MGDHLRMAGM GCWLIGKTHM NADSEGMARL GLSPDSVIGA RQAECGFDVW IRDDGLWAEG PDGFYDQKRS PYNEYLKSKG YAGHNPWHDF ANAGLEGEEM ASGWFMANAQ KAANIAEEDS ETPWLTTKTI EFIEQAEGPW CAHVSYIKPH WPYIVPAPYH DMYGPEHVLP AVKDPAERED PHPVYGAFMG NAIGQAFSRE EVRQAAIPAY MGLIKQCDDQ MGRLFEYLED TGRMDDTMIV ITSDHGDYLG DHWLGEKDLF HEPSVKVPMI IYDPRPDADA TRGTTCDALV ENIDLLPTFV EAAGGEVADH ILEGRALTPW LHGQTPEVWR DYAISEYDYS GTPMSVKLGS APRDARLFMV TDTRWKFMHA EGGLPPMLFD LENDPQEFHD LGRSPDHTEV IDMMYARLGQ WGRRMSQRIT RSDAQIIAGR GASRGKGILL GVYEPEDVPA ELTVKYRGKP PT
|
| |