Gene Dfer_3855 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_3855 
Symbol 
ID8227450 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp4691162 
End bp4693162 
Gene Length2001 bp 
Protein Length666 aa 
Translation table11 
GC content51% 
IMG OID644931696 
Productsulfatase 
Protein accessionYP_003088224 
Protein GI255037603 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1368] Phosphoglycerol transferase and related proteins, alkaline phosphatase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.593214 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.690048 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTTCAT ACACAGGCGC TTCCGGTTTC GGATGGCGCA TTCATCAGGT GCTGCTGATC 
AGGCTGCTCG GCGTCATGAT TTTGTTCAGC ATTTGCCGGA TTCTCTTCTA CTTGTTCAAC
CAATCGTTTT TTCCGGAGGT AAATCTGGCC CTGGCGCCCC GGCTGTTTCT CGGAGGTGTG
CGGTTCGACC TTGTGGCGGT GCTCTACACC AATATCCTTT ACGTTTTCCT GACGCTCATT
CCGGTAACCT TCAAATACAG GCCGGCATTT CAAACGTTTC TGGATTACCT GTTCATCATC
ACCAACAGCA TTGCCCTGCT GGCCAATTGC GCGGATTTTA TCTACTACCG TTTCACCATC
AAACGGAGCA CGTGGTCGGT TTTGCAGGAA TTTTCGCATG AAAACAACAT GCATAAGCTC
TTTGGCGGGT TCATTGTAAA CTATTGGTAT GTGGTGCTGG TGTGGATCAT GCTCGTGGCG
CTGGTGGTGT ACATCACACG CCGCTGGCGC GTGAGTACCA GGCCCGCCCG TCCGGTCTGG
CAGATATTTC TCGTGCACAG TGTATTGATG ACCGTTTCGG TGTTTCTTTT CATCGGCGGG
GTAAAAGGCG GCTTCCGGCA CAGTACCCGG CCGATCACAC TCAGTAATGC AGGCGAGTAC
GTGGATAAGC CCAAGGAGAT GTTTATCGTA CTGAACACGC CGTTCTGCAT TTTCAAAACT
TTGAAACGCT CGGATTACCA GCGGGCGGAT TATTTCAGAT CTGAGCAGGA AATGAATGAC
GTTTATTCAC CCATTCACAC GCCTGAGCCG GACGCGCCTG CATTTCGTCC GAAAAACGTG
GTGATCCTGA TCTGGGAAAG TTTTGGGAAG GAAATGGTAG GGACTTACAA TAAAGATTTG
GAAAATGGCA CCTACAAGGG CTACACGCCA TTTATCGACT CGCTGATGCC CCATAGCAAG
GTGTTCTGGT ATTCGTTTGC AAATGGCGCG AAGTCCATCG AAGCGATTCC GTCGGTACTT
ACGAGCATTC CGGGTATTCT GGAACCCTTC ATTCTGACAC GCTATACCGA CAATAAACTG
CCCAGCCTGC CCGAAATGCT GCAAGGCAAA GGCTACCATA CCTCCTTTTT CCACGGGGCA
CCCAACGGAT CGATGGGTTT CAAGGCATTC ACGAACCTGA TCGGCATTAA AGATTATTAT
GGCAAAACGG AATACAACAA CGATGCCGAT TACGACGGCA TCTGGGGCAT TTGGGACGAG
GAATTCATGA AATTCTGGGG CAGAAAGCTC GATACATTCC AGGAGCCGTT TATGAGCACG
CTGTTCACGG TGTCGTCGCA CGACCCGTAC AAGGTGCCGG CGCGCTACCG GGGCAAATTT
CCGAAAGGCC CGCTGCCGAT CTACGAAACG ATGGGTTATA CCGACAATGC ATTGCGCCGT
TTCTTCGATT CCGTGAAGGA TAAGCCATGG TTTAAAAACA CCTTGTTCGT TATCACGGCA
GATCACGCCG CCACATTTGC GCATTATCCC AAATACCAGA CCTCGGTCGG AAACTTCTCC
ATCCCCATCA TTTTCTATGC GCCGGGCGAC GCCTCGCCGG GCGAGGTGCC GACAGGAGAA
GAACCGAAAG TGGAAGTTCT AATGAAAGGC GTCGACAGCA CGCGCCTCGT GCAGCAAATC
GACATTATGC CTTCCATTTT GGGGTATCTG CATTACGACA AACCCTATTT TGGTTTCGGG
AAAAACGTTT TCGGGAACCC TCCGATCAAC TTCGCTGTCA ACTATGACGG GGTCTATCAA
TGGTTCAATG GGCCATATGT TTTGCAATTT GATGGCCGGA AAACCGTAGG TTTGTATAAA
TATCAGGAAG ACAAGCTCTT GAAAAATAAC CTTGCAGGCC GTATTCCCGA GGTTCAGGGG
CCTATGGAAT TGCAGGTGAA GGCATTTATC CAGCAATATT CCAACCGGAT GCTGGACGAC
AAGCTGACGG TAACACCTTA G
 
Protein sequence
MSSYTGASGF GWRIHQVLLI RLLGVMILFS ICRILFYLFN QSFFPEVNLA LAPRLFLGGV 
RFDLVAVLYT NILYVFLTLI PVTFKYRPAF QTFLDYLFII TNSIALLANC ADFIYYRFTI
KRSTWSVLQE FSHENNMHKL FGGFIVNYWY VVLVWIMLVA LVVYITRRWR VSTRPARPVW
QIFLVHSVLM TVSVFLFIGG VKGGFRHSTR PITLSNAGEY VDKPKEMFIV LNTPFCIFKT
LKRSDYQRAD YFRSEQEMND VYSPIHTPEP DAPAFRPKNV VILIWESFGK EMVGTYNKDL
ENGTYKGYTP FIDSLMPHSK VFWYSFANGA KSIEAIPSVL TSIPGILEPF ILTRYTDNKL
PSLPEMLQGK GYHTSFFHGA PNGSMGFKAF TNLIGIKDYY GKTEYNNDAD YDGIWGIWDE
EFMKFWGRKL DTFQEPFMST LFTVSSHDPY KVPARYRGKF PKGPLPIYET MGYTDNALRR
FFDSVKDKPW FKNTLFVITA DHAATFAHYP KYQTSVGNFS IPIIFYAPGD ASPGEVPTGE
EPKVEVLMKG VDSTRLVQQI DIMPSILGYL HYDKPYFGFG KNVFGNPPIN FAVNYDGVYQ
WFNGPYVLQF DGRKTVGLYK YQEDKLLKNN LAGRIPEVQG PMELQVKAFI QQYSNRMLDD
KLTVTP