Gene Dfer_1039 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_1039 
Symbol 
ID8224609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp1225806 
End bp1227233 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content55% 
IMG OID644928900 
Productprotein of unknown function DUF1501 
Protein accessionYP_003085453 
Protein GI255034832 
COG category 
COG ID 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.214334 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.240283 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATATCC AATGGAGTAG GAGAGAGTTC TTGCAACGCG CCAGCGCGGC CACGATGGCT 
GCATTGGCAG CGGGAGCTCC CGTATCAAAC CTGCTAACCT CGTGCCGGGG TAAAGCGGGA
GCCGATTCGA CGGCGGACAC GGTGATACTA TTATGGATGG CGGGCGGTAT GGCGCACACC
GAAACTTTCG ACCCCAAAGC CTATACGCCA TTCGAAAAAG ACATGGAAGG AAACCGTGTT
TTGAGCACAT TCAAATCGCT TCCTACCAAG CTCGACGGCA TCCATTTTTC GGAAGGCCTG
CAATCGATAG GGCAAGTGAT GGACAAGGGA ACACTCATCC GCTCTTACGT GGCGGCCGAC
ATGGGACATA TCCTGCATTC GCGGCACCAG TACCACTGGC ATACCTGCTA CGAGCCGCCA
CAAACGGTGG CCGCACCGCA TATGGGTTCG TGGATTGCAA AGGAGCTGGG ACCCAAAAAT
CCGGTAATCC CCGCATTCGT GGACATCGGT CAGCGCTTCA CGGTAGGCGA GGCCGAAGAG
TTGAAAGCAT TCCATACGGC GGGCTTCCTC GGCAACGAGT TCGGACCGTT CTTTATCCCC
GACCCGAGCC AGGGCCTCGA AAGCGTGCGT CCACCCGTGG GCATGGATGC GAAGCGTTTT
GAACGTAGAA ACCAGCTGTA CAACGAGCTG ATCAATAACA GCCCGGTGGG GGAATTTGGC
AGCGACTACC AGCGCGAATC CCTCAAACGC TCCATGGAGC AGGCTTATGC ATTGCTCAAT
TCGCCGGAAT CCAAAGCATT CGACCTCAGC ACCGAACCTA AGAAAAGCTA CGACATTTAT
AACACCGGCC GCTTCGGGCT CGGTTGCCTG CTCGCACGCC GCCTGACCGA ACAAGGTGCC
CGGTTCATCA GCGTGACCAC CGAATATGAG CCGTTCAAAG GCTGGGACAC GCACGAAAAT
GGTCATACGC GTTTGCAGGA AATGAAAAAG CAGATCGACG GTCCGGTGGC CCAGCTTATT
AAAGACCTCG ATGAAAAAGG CCTGCTCGAC CGCACTATGG TTGTCCTCGC GAGCGAATTC
AGCCGTGATA TGATGGTGGA AGGTCGCCCG GATGCGAAAG TGAAGGAACA GGTAGCGCAG
CCGGACATCC TTTCGGACCT CAAATTCTAC GGCATGCACC GCCATTTCAC CGACGGCTGT
TCCATGCTCA TGTTCGGTGG CGGCATTAAA AAGGGCTTTG TATACGGCAA AACCGCCGAC
GAACGCCCAT GCAAAACGAT TGAGAACCCG ATCAAGATCG AAGGCGTTCA CCAAACCATC
TACCACGCGC TCGGCATTCC GCCGGACACG CAATATGAAA TCGAAAAGCG GCCGTTCTAC
ACGACACCGG ATGGTAAGGG GCTGGCGGTG AAGGAATTGT TGATATAG
 
Protein sequence
MNIQWSRREF LQRASAATMA ALAAGAPVSN LLTSCRGKAG ADSTADTVIL LWMAGGMAHT 
ETFDPKAYTP FEKDMEGNRV LSTFKSLPTK LDGIHFSEGL QSIGQVMDKG TLIRSYVAAD
MGHILHSRHQ YHWHTCYEPP QTVAAPHMGS WIAKELGPKN PVIPAFVDIG QRFTVGEAEE
LKAFHTAGFL GNEFGPFFIP DPSQGLESVR PPVGMDAKRF ERRNQLYNEL INNSPVGEFG
SDYQRESLKR SMEQAYALLN SPESKAFDLS TEPKKSYDIY NTGRFGLGCL LARRLTEQGA
RFISVTTEYE PFKGWDTHEN GHTRLQEMKK QIDGPVAQLI KDLDEKGLLD RTMVVLASEF
SRDMMVEGRP DAKVKEQVAQ PDILSDLKFY GMHRHFTDGC SMLMFGGGIK KGFVYGKTAD
ERPCKTIENP IKIEGVHQTI YHALGIPPDT QYEIEKRPFY TTPDGKGLAV KELLI