Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Taci_0369 |
Symbol | |
ID | 8630179 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermanaerovibrio acidaminovorans DSM 6589 |
Kingdom | Bacteria |
Replicon accession | NC_013522 |
Strand | + |
Start bp | 395361 |
End bp | 397247 |
Gene Length | 1887 bp |
Protein Length | 628 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | sulfatase |
Protein accession | YP_003316888 |
Protein GI | 269791984 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00151431 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTTCA GGGGTAATCC GGGTGGGATC CGGGGGTTGG CGGGGCACCT CATATGGCCC CTGGCGGTCT GGACCGTCAG CTCCTTCAAG TTCCTGGCCC TCTGCGACTC CATGAGGGGG GGATCCCAGT GGTGGCCCCA GGTTCTCTGG TCCACCATCC TGGGTGCCCT GGCCCTTTCG GGGCTGCCCC TTCTGCTCCC CAGGAGGGTG AGGCCCCTGG GGTTCGTGGC GGTGAGCGCC CTGGTGTCCG CCCTCTGTCT GGCGGACCTG CTCTACTATA GGTACTACAC GGACCTCTTC ACCGTTCGCA GCCTCTCCCT GGCGGGACAG CTGGGGGACG TGTGGGACTC GGTCCGGTCC CTCTTCTCCA TGGGGGACCT GCTGTTCCTC ATGGACCTGC CGGTCCTCCT GGCGGTGGCC CTCTTGGACT TGAGATCCAG GGAGAGGGAC CTCAGGTTCC GCTTCCGGGC CGCGGGGACC TGCCTGTGCC TGTTGGGCCT TTTGCCCTTC GGCTTCCAGA CCGGGATCGT CAGGCTCAGG GTGCCCGGCT ACGTGAGGGC CATGTGGGAC CGGCCGGCGG TGATGTTCAC GCTGGGGCCC GTGGGGTACC ACCTGGCGGA CCTCATTAAC GCCTCGTCGG ACCTCTTCGC CTCCCGGGCG GTGGCCGCCC AGGACGAGGA GGCCCTCCTG GATTGGGCGG TCAAGCGCCG CCAGCAGGTG CCGGTCTCCC AGCGGGACTC CCTCCACGGG GCCGGCAAGG GGCTCAACCT GATAGTGGTC CAGGTGGAGG CCCTCCAGGG GTTCGTCATG AACCTGAAGG TGGGGGGCCA GGAGGTGACC CCCAACCTTA ACCGCCTGGC CCGGCGGAGC CTCTACTTCC CCAACGTCTA CAACCAGACC GGTGCGGGCA ACACCTGCGA CGCGGAGTTC ATGGTCCAGA CCTCCCTCTT CCCCTCCGCC ACCGGGGTGG CCTACGTGAG GTTCTCGGGC AACTACTTCG AGTCCCTCCC CCGGATCCTC CAGAGGAACG GCTACGAGAC CCTATCCATG CACGGCAACA GGGCCTCCTT CTGGAACAGG CACCGGATGC ACCCCGCCCT GGGTTTCCAG GAGTTCTTCA GCAAGGAGAG GCTGAGGAGC GATGAGGAGA TAGGCCTGGG CCTGTCGGAC CGGAGCTTCT TCGAGCAGGG AGCCCGGATC CTATCGGAGC GGAAGGGGCC CTTCTACGCC TTCATGGTGA CCTTGAGCAG CCACCACCCC TTCTCGTTCC AGGGGATACC CAGGACGCTC AAGCTGGACG GGTCCCTGGA GGGGACCTTT CTGGGGGATT ACCTCAACGC CATTCACTAT GCGGATGCTC AGATAGGCAG GTTTCTGCAG GATCTCAGGA GGAGGGGGAT CCTGGACAGG TCGGTGCTGG TGGTCTACGG GGACCACACC GCCATACCCA ACGCCAACGG GTCGGAGCTG GAGGTCCTCC TGGAGCAGGA TCTAAAGGAC CCGTTGAGGT GGAGGGCCCT GCAGAAGGTG CCGCTGATGA TACGGCTCCC CAAGGGCAGG GGTGCCAGGG TGGTGGAGAC CACCGGGGGG CACGTGGACA TAGGCCCCAC CGTGGCGTCC ATCATGGGGG TGAACATGCC CCTGGCGTTC GGCCAGGACC TTCTAACCGC CCGGGTGGGC ACCGTGGTCT TCCGAAACGG GACCTTCATA AGGGGCGGGG TCTTCGTGGA CCCCACCGCC CGGATGGCCT GGAGCATGGG GACCTTGTCG GAGGTGCCCT TCGACGCCTA CCGGGACCTG GCGGAGGGGG CGTCGGAGGC GCTGCGCCTG TCGGACCTGC TACTGGAGAA GGACATGGCC CAGAGGGTTT GCGAGAGGCT GCCTTGA
|
Protein sequence | MKFRGNPGGI RGLAGHLIWP LAVWTVSSFK FLALCDSMRG GSQWWPQVLW STILGALALS GLPLLLPRRV RPLGFVAVSA LVSALCLADL LYYRYYTDLF TVRSLSLAGQ LGDVWDSVRS LFSMGDLLFL MDLPVLLAVA LLDLRSRERD LRFRFRAAGT CLCLLGLLPF GFQTGIVRLR VPGYVRAMWD RPAVMFTLGP VGYHLADLIN ASSDLFASRA VAAQDEEALL DWAVKRRQQV PVSQRDSLHG AGKGLNLIVV QVEALQGFVM NLKVGGQEVT PNLNRLARRS LYFPNVYNQT GAGNTCDAEF MVQTSLFPSA TGVAYVRFSG NYFESLPRIL QRNGYETLSM HGNRASFWNR HRMHPALGFQ EFFSKERLRS DEEIGLGLSD RSFFEQGARI LSERKGPFYA FMVTLSSHHP FSFQGIPRTL KLDGSLEGTF LGDYLNAIHY ADAQIGRFLQ DLRRRGILDR SVLVVYGDHT AIPNANGSEL EVLLEQDLKD PLRWRALQKV PLMIRLPKGR GARVVETTGG HVDIGPTVAS IMGVNMPLAF GQDLLTARVG TVVFRNGTFI RGGVFVDPTA RMAWSMGTLS EVPFDAYRDL AEGASEALRL SDLLLEKDMA QRVCERLP
|
| |