Gene Dfer_3862 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_3862 
Symbol 
ID8227457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp4698175 
End bp4701081 
Gene Length2907 bp 
Protein Length968 aa 
Translation table11 
GC content53% 
IMG OID644931703 
Productexcinuclease ABC, A subunit 
Protein accessionYP_003088231 
Protein GI255037610 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.609984 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.59901 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAAC TAGCTGTGCC CCGGATCGAG GCCGATCAGT TCGATGAACT GGATCCGAGA 
CAATACATCC TGATCAAAGG TGCCCGAGTT AACAATCTGA AAAACATTGA TGTGGCGATC
CCGCGCAACA AGCTGGTCGT CATTACGGGC CTCTCCGGTT CCGGCAAGTC CTCACTGGCT
TTCGACACGC TTTTCGCGGA GGGCCAGCGC ATGTACGTGG AAAGCCTCAG CAGCTACGCC
AGGCAATTTC TGGGGCGGAT GGAAAAGCCC GAAGTGGAGT ATATCAAGGG CATTTCACCG
GCTATTGCCA TTGAGCAGAA GGTCAATACC CGTAATCCAA GGTCTACCGT GGGCACATCC
ACTGAGATTT ACGATTATCT TAAACTGCTT TTCGCGCGGA TTGGGCATAC CTATTCGCCC
GTTTCCGGCG AAATCGTGCG GAAAGATTCG GTGACCGACG TGGTGAATTA CATGCTTGCG
CACGATGAGG GCACGCGTGT GATGATTCTC GCGCCGCTGC TCATCCGCGA CGGCCGCTCG
CAGATCGAGG AGCTCACCAT TCTGCTATCC AAAGGCTACA ACCGCATTGT GCTCAACGAT
GAGGTAGTGT CGATAGAAGA CATTCTCGAA AATCCGGACC AGTTTCCGCA AGCCGGGAAC
CGGGTTTTTA TCCTCATCGA TCGCGCCGCG ATCCGCCACG ACGACGAAGA CACGCAATAC
CGCCTCTCCG ACTCCGTGCA AACCGCGTTT TTTGAAGGTG AAGGCACGTG TACAGTGCAT
ATTGTAGGCG GTGAAAAAAG GGTTTTTTCC GATAAGTTCG AGCTCGACGG CATTACGTTC
GAAGAACCGA CCGTGAACCT TTTCAGCTTT AACAATCCAT TCGGCGCCTG CAAAAGGTGC
GAAGGATTTG GAAAAATACT AGGCATTGAC CCCGACCTGG TGATCCCCGA TAAAAATCTT
TCGGTGTACG ACGGTGCCAT TGCGCCCTGG CGCACCGAAA AAATGGGCGA ATGGCTGGTG
CCGCTCGTCC GCAACGGCGT CAAGTTCGAT TTCCCGATCC ACAGACCTTA CAAAGACCTA
ACGCCGGTCC AAAAAGAGCT TTTATGGACC GGTAACCAAT ATTTTCAGGG AATCAACGAT
TTCGTTTCAG AACTTGAATC ACAGACACAC AAAATCCAGT ACCGGGTAAT GCTCTCGCGT
TTCCGTGGGC GCACCACCTG CCCCGACTGC CGCGGCTCGC GCTTGCGGAA GGATGCTTCG
TATGTCAAAA TCGGCGGCAA ATCCATTACC GATCTGGTGT TAATGCCTAT CCAGGATGTT
TCGGTGTTTT TCAAACAGCT CGAACTCGGC GACCACGACC GGCAAATCGC CCGCCGTGTC
CTGACCGAAA TCGAATCGCG CCTCGACTAC ATGAACCGCG TGGGCCTCGG CTACCTGACA
TTGAACCGGT TGACGAGCAC ATTGTCGGGT GGCGAGTTCC AGCGCATTAA GCTGGCCACT
TCGTTGGGAA GCGCGCTGGT GGGCTCGATG TACATTCTGG ATGAACCGAG CATTGGCTTG
CACCCGCGCG ATACGCAGAG GCTGGTAGGC GTATTGGAAT CGCTGCGCGA TCTCGGCAAT
ACGGTGATTG TGGTGGAACA CGAGGAGGAA GTTATGCACG CCGCAGATCA GATCATCGAC
ATCGGCCCCG AAGCGGGAAC CGGCGGCGGC CACGTCGTTT TTCAGGGAAA TCAAAGCGAT
ATCGAGGAGC TGAAAATCAA TGGTTCAGGG GCAGGGAATA GTCAGGAGGG CATTGGTAGT
CAGGAGGTCA TGAGTAGTGG GGCCGGGCAT ATGCGGTCAC ATACTTTGGA TTTTCTTTTG
GGAAATGATT CCATTCCCGT TCCTTCTGTC CGCAGAAAGT CTTTGCATTT CCTGGAAATT
AAAGGTGCCA GGGAGAATAA TTTGAAGGAG CTGGACGTGA AAATTCCTTT GAACAACCTG
ACTGTCGTAA CGGGCGTGAG TGGTTCAGGG AAATCTACTT TGATCCGTAA AATCCTGTAT
CCCGCTTTAA TGCGCTTGAA AGGCGAATTC AGCGAGGATG TGGGGCGTTT TGACGCGCTC
ACGGGCAGTA TCGATCGCAT TGAGGCGGTG GAGATGATCG ACCAGAATCC GATCGGCAAG
TCGTCGCGTT CCAACCCGGT GACTTATATT AAGGCATATG ATTATATCCG GCAAATGATG
TCGGAGCAGC CGCTCTCGAA AGCGCGCGGC TATAAACCTT CGCATTTCTC CTTTAACGTC
GACGGCGGCC GCTGCGAAAT ATGCCAGGGT GAAGGGGAAG TGAAGGTCGA AATGCAGTTT
ATGGCCGACA TTTACCTGAC TTGCGAGGGC TGCAACGGCA AGCGTTTCAA GCAGGAAATA
CAGGAAGTGC GCTATCACGA CAAAGATATT GCCGAAATCC TGGACATGAC TGTTGATGAA
GCGATCGACT TTTTCCGGGA AACGGAACCT AAGCTGGCCG ACAAGCTGCT GCCTTTGCAG
GAAGTGGGTC TCGGGTACGT CGGTCTGGGG CAGTCGTCGA ACACCTTGTC CGGCGGTGAA
GCGCAGCGTG TGAAACTGGC CTCTTTCCTC GGCAAAGGCT CTTCCAACAA AGGCAAAACG
CTGTTTATTT TCGATGAACC AACGACCGGT CTGCATTTCC ACGACATTAA AAAGCTGCTG
AAAGCCATTA ATGCGTTGGT GGAACAGGGC GATAGCGTGA TCATCATCGA GCACAATATG
GAGGTGATCA AAAGCGCCGA CTGGATCATC GACCTTGGTC CGGAGGGCGG CGAAAACGGG
GGTAACCTCA CCTTCACTGG CACACCGGAG GAGATGCTTA AACTGGACGG GAATTACACG
GCGGAGTTTT TGAAGGAAAA GATTTGA
 
Protein sequence
MMQLAVPRIE ADQFDELDPR QYILIKGARV NNLKNIDVAI PRNKLVVITG LSGSGKSSLA 
FDTLFAEGQR MYVESLSSYA RQFLGRMEKP EVEYIKGISP AIAIEQKVNT RNPRSTVGTS
TEIYDYLKLL FARIGHTYSP VSGEIVRKDS VTDVVNYMLA HDEGTRVMIL APLLIRDGRS
QIEELTILLS KGYNRIVLND EVVSIEDILE NPDQFPQAGN RVFILIDRAA IRHDDEDTQY
RLSDSVQTAF FEGEGTCTVH IVGGEKRVFS DKFELDGITF EEPTVNLFSF NNPFGACKRC
EGFGKILGID PDLVIPDKNL SVYDGAIAPW RTEKMGEWLV PLVRNGVKFD FPIHRPYKDL
TPVQKELLWT GNQYFQGIND FVSELESQTH KIQYRVMLSR FRGRTTCPDC RGSRLRKDAS
YVKIGGKSIT DLVLMPIQDV SVFFKQLELG DHDRQIARRV LTEIESRLDY MNRVGLGYLT
LNRLTSTLSG GEFQRIKLAT SLGSALVGSM YILDEPSIGL HPRDTQRLVG VLESLRDLGN
TVIVVEHEEE VMHAADQIID IGPEAGTGGG HVVFQGNQSD IEELKINGSG AGNSQEGIGS
QEVMSSGAGH MRSHTLDFLL GNDSIPVPSV RRKSLHFLEI KGARENNLKE LDVKIPLNNL
TVVTGVSGSG KSTLIRKILY PALMRLKGEF SEDVGRFDAL TGSIDRIEAV EMIDQNPIGK
SSRSNPVTYI KAYDYIRQMM SEQPLSKARG YKPSHFSFNV DGGRCEICQG EGEVKVEMQF
MADIYLTCEG CNGKRFKQEI QEVRYHDKDI AEILDMTVDE AIDFFRETEP KLADKLLPLQ
EVGLGYVGLG QSSNTLSGGE AQRVKLASFL GKGSSNKGKT LFIFDEPTTG LHFHDIKKLL
KAINALVEQG DSVIIIEHNM EVIKSADWII DLGPEGGENG GNLTFTGTPE EMLKLDGNYT
AEFLKEKI