Gene Fjoh_4647 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFjoh_4647 
Symbol 
ID5094101 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFlavobacterium johnsoniae UW101 
KingdomBacteria 
Replicon accessionNC_009441 
Strand
Start bp5619188 
End bp5621110 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content38% 
IMG OID644744068 
Productsulfatase 
Protein accessionYP_001196965 
Protein GI146302374 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3119] Arylsulfatase A and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAA TTATGAATAA CGCTGTAATT AAAAGCTCTA AAAAAGGGCT TTTAATTACT 
GCAATTCTGG TTGCACAACT AGGAATTGCA CAAGAAAAAG AAGAATTTAA AGGTACAATT
GGCAAAACAC TGGCTGATTC TAAAGAATAC TGGCCAGATC CGGTAAAAGC GCCAAAAGGC
GCTCCAAACA TTGTCTGGAT ATTATTGGAT GATGTTGGAT TTGGAGCTTC AAGCGCTTTT
GGAGGTTTAA TCAATACGCC TACTTTTGAT AATCTGGCGA ATAACGGTTT ACGTTATACA
AACTTTCACA CTACAGCTAT TTGCGCGCCA ACACGTGCTG CATTATTAAC CGGAAGAAAT
TCTGGAAGAG TTCACGTAAG CGGTTTTTCT CACACTGTTT TATCAGCAGG TTTTCCCGGC
TGGGACGGAA GAATTCCTTC TGATAAAGGA ACAATTGCCG AAATTTTACG CGACAATGGC
TATAACACTT TTGCCGTTGG TAAATATGGT GTAACTCCTG ATGAAGAAGC TACAGACGCA
GGACCATTTG ACAGATGGCC GACTGGAAAA GGTTTCGATC ATTTTTACGG ATTCCTGGGT
TCGCAGACAG ACCAATACAA TCCTGATTTA GTGGAAGATC AGACTCATAT TAAACCTGAC
GGACGTCATT TAAATGAATT GATTACAGAC AAAGCCATCA GCTATATTAA AACACAGCAG
AAAGCGGCGC CGGGAAAACC TTTCTTTTTA TACTATGCAC CGGGAGCTGT TCACGCACCT
CATCAGGTAG CTGAAAAATG GAGCGACGCA TACAAAGGTA AATTTGACGA AGGTTATGAT
GTGTACCGCG AAAAAGTTTT AGCGAATCAA AAGAAATTAG GAACTGTTCC GGCTAATGCT
GTATTGCCAG AACGCAATCC GCTTATTACA GAATGGAAAA AATTAACTCC TGACCAAAAG
AAAGTATATG CAAGATTCAT GGAAGTTTAC GCCGGATTCC TGACTTATAC TGATTATGAA
ATTGGAAGAG TGGTTAATTA TTTAAAAGAA ACCAATCAGC TTGATAATAC TCTGATTTTT
GTGGCAATTG GAGATAATGG AGCCAGTAAA GAAGGAACTT TGCAGGGAAC GATCAACCAG
AGTTTATTTT CACAAGGAAA ATCAGATGAA GAAAATCTTC AAAGCAACTT AAACAACATT
GGCGAAATTG GAACTGCAAA AGGTCTTAAT ACCAACTATC CTTTAGGATG GGCACAGGCT
ACAAACGTTC CTTTCAAAAA CTGGAAACAA GATGCACATT CTGAAGGAGG AACACGCAAT
CCGCTGATTG TATTTTATCC AAACGGAATT AAAGACAAAG GCGGTATCAG AAATCAATAC
AGCCACGTAA CCGATTTACT GCCAACGACT TTAGATATTG CAGGAATTAC TGCTCCGGAA
TATATCCGAA ATATTAAACA AGATATTATT CAGGGCTCAA CATTCAAAGC TTCATTGGAT
AATCCAAAAG CAGAATCTTT ACACAAAGTT CAGTACTACT ATATCTTTGG CAACAGAGCT
ATTTATAAAG ACGGATGGAA AGCGGCAGCT GCACATTTAC CAGATTCATT TGCCGTAAAA
CAATCATTAG GGAAAAATGA AAAACCTGCT GCAAGTAATT TTGATACTGA TGTTTGGGAA
TTATACAACC TGAACGAAGA TTTTAACGAA CGTAACAATC TGGCTAAAAA GTATCCGGAA
AAACTGGCTG AACTTCAAAA ATTATTTGAC GAACAGGCAA AAGAAAACAA TGTTTATCCG
TTAATTGACT GGCAGGACGT GTACAACAGA AGAATTCACA ATACTGCTGC TGACAAAGGT
AAAACATTAC AGGATTTGGT AAAACAAGTA ACCAAACCAG CTGATACAGG AAGCAGTAAT
TAA
 
Protein sequence
MKKIMNNAVI KSSKKGLLIT AILVAQLGIA QEKEEFKGTI GKTLADSKEY WPDPVKAPKG 
APNIVWILLD DVGFGASSAF GGLINTPTFD NLANNGLRYT NFHTTAICAP TRAALLTGRN
SGRVHVSGFS HTVLSAGFPG WDGRIPSDKG TIAEILRDNG YNTFAVGKYG VTPDEEATDA
GPFDRWPTGK GFDHFYGFLG SQTDQYNPDL VEDQTHIKPD GRHLNELITD KAISYIKTQQ
KAAPGKPFFL YYAPGAVHAP HQVAEKWSDA YKGKFDEGYD VYREKVLANQ KKLGTVPANA
VLPERNPLIT EWKKLTPDQK KVYARFMEVY AGFLTYTDYE IGRVVNYLKE TNQLDNTLIF
VAIGDNGASK EGTLQGTINQ SLFSQGKSDE ENLQSNLNNI GEIGTAKGLN TNYPLGWAQA
TNVPFKNWKQ DAHSEGGTRN PLIVFYPNGI KDKGGIRNQY SHVTDLLPTT LDIAGITAPE
YIRNIKQDII QGSTFKASLD NPKAESLHKV QYYYIFGNRA IYKDGWKAAA AHLPDSFAVK
QSLGKNEKPA ASNFDTDVWE LYNLNEDFNE RNNLAKKYPE KLAELQKLFD EQAKENNVYP
LIDWQDVYNR RIHNTAADKG KTLQDLVKQV TKPADTGSSN