Gene Dfer_4541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_4541 
Symbol 
ID8228144 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp5478360 
End bp5481350 
Gene Length2991 bp 
Protein Length996 aa 
Translation table11 
GC content47% 
IMG OID644932387 
Producthypothetical protein 
Protein accessionYP_003088907 
Protein GI255038286 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.444049 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.286675 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACAAAT GTACTTTTAC AAAAATCTTC CTCACCTTCT TAATCTTTGG CACGTGTGTG 
GCGCAAATTG CACTCGGTCA GGGAACACCA TGGAATGGCT TTTACGGGAA CGAATGGCTT
CAGGGGAAAT ATAGCCAGAA GTGGTTGCGC ATAGGAATAT CTCAAAAAGG TATACATCGT
GTTACGCTTC CGGCTGGTTT TATGGTGGGG TCCGACCCCA ATGTGTTACA CCTGTATCAC
CGCGGCGTAG AAGTGGCGCT TACTGCTGCA AGCACAACCC AAATCGAATT TTACGGCGTA
CCCAACGACG GGGCTTCAGA TTCGTTGCTT TACCGTCCTT ATTCCGCACG GACGAACCCA
TATTACAGCC ACTTTTCCGA CGAAAGTTCT TATTTTCTGA CAATCGGTTC ACCTAATGGG
AAAAGGGCTG CGGTGGAGAA TTTGACCGAC CCTGCCGGGT CTGACCTGCT GACCTACCAC
ATGCAAACCG ATAATGTGGC ATATCCAAAC GATTACACCC ATGCCACCAA TTATCCGACC
AGGCCAACGA ACTTGAACAG CTTCATGGAA GACGGCCAAA CCAGGAGTTC GACAAGGTTT
AATGATACGG ATGCGCACCC GATTTTTTCG GAGTTCCCTA TTTCAGTGAA AAAGCAGGTG
GGATCTGATC CGCCGCAGGC CGAAATTCTC GTTCACGGTC GCTCGAATTA TTCGCCTAAT
GGCATTTATC CGCGAAATAT TCATGCCTAT GTCGGGAAAG ATGCAGGCTC GTTGAGAAAA
GTGGATCAGA AGTCGATCGA CGGATTTATG TACGCGAAAT TCAATTTCCC GATTCAACAG
ACAGACCTCA ATGCGGGTGT AGGACGATTT GGATTTCAGC TGGACGCGCC GTTACAAGGT
GCGGCGAACG ACAGATTTTC TGTAACCTAT TATAATGTAA CATATCAGCA GCAAATCGAT
ATGTTCGGCA GCAATTCTTA TCTGTTTACA TTTCCATCGG CTGCTCAGGG TTCAAAAAGC
AGAATTGCGG TGACAACACC TCCGGGAGGC ACTGTGAAGT TCTATGATAT AACTGACCCG
GCGAATCTCC GTGTGGTTAA TGGAACAACT GCGAGTGTCA TTTTCAGCCG ACCGAATACG
AAACCGCTGT TAATGCTGGC CACCAACCAA ACCTTTGATG TAGACGGAGG CAAAGTTTCT
GTGGTGACAT TCCCTCAGAT CAACAAGTCG GACTACGACT ACATGATCGT GAGCAGTGAA
TCGCTGATCG AAGCCGCGGA CTCCATTGCA TTGTACAGAA GCGCTGAGAC GGCGGGAAGG
AAATTTAAGA CAGGTATTTT TAAGATCAGG GATATCTATA ATCAATTTAA CTATGGAGAG
CCGAGCCCTA TTGCTATCAG GCGATTTGTT GACTTTATGA TTTCTGATTT GAAAAAGGAT
AATCTTGGCA GGCTGGATAA GTATTTATTG CTGCTTGGAA TATCTGTTAC CCGTAATGAC
CGAATCAATA AAGAGCTGCT GAACGAAGTT CCGACGTTCT CTTTCCCCGG TTCGGATCTC
CTGTTGATCG AGGGTCTCCA AGGCACGCCC AGGGACGTTC CGGCTATTCC TGTCGGTCGA
ATTCCTGCGA CCAACAATGC ACAGGCACTG GCTTATCTGA GGAAAGTGAA GCAGTATGAA
AATGCGACAA CCAATTTGGC CTGGCGGAAA AATGTGGCAC ATATCAGCGG AGGGAAAACC
TCTTCCGAGA TCAACGACCA CGTTTCCAAC CTGAACACGG CAGCGCTCAA AGTGAAAGCG
AATGTTTTTC AGGGAAGAGT TTATACGGCT AAAAAACCCG TTGCAACGGA TCAGGTTCTT
CAAATGGATA CATTGGCCAA GCACGTCAAC GTAAACGTCA CTGCTGCTGC CGATAGCGCC
GGTGGGCTGG GTATGATCAC CTATTTCGGG CACAGCGTTC CCTATCAGAC AGACTATAAC
TTCGGATACG TATCCGATGG TGCGAAAAAG TATAACAACC CGAACAGATA CCCGATCATT
TTCTACAATG GTTGTGATAT CCTGAATGTT TTCAGCAACA ATTTCAATGA AACTGTGAAT
ACGAACACAT CCCGGCCGCA GTCTCTTGAC TGGCTTTTGA GCGCCAACAA AGGAGCAATC
GCTGTGTTTG GAAACTCGTG GGCAGGTTAT GCTTCAAGCT GTAATAATTA CATGCAAAAG
ATCTACACGG AGCTGTTTAC CAAAAATGAC GCCGACAGAG ATTTGATGGG CCGTGTTCTG
CAAGTAGTAG CTGCGCAGAC CAAAGCGGGC GGCGGATTCA GAATGGGATT TGAAAATGCA
AGAACGGCGG AGGTTTATGT TGCCGATCAG GCGCAGATTC ATCAGACTGT TCTCCTCGGT
GATCCTGCTC TGAAAGTGCT TGTTTCAACA GAAGGAGGTT TGCCCGTTAC ACTCGTTTCA
TTTAATGCGC AAGCTGCCGG TAACCAGGTT AATGTAAACT GGAAAACTAG CTCAGAGGTG
AACAATAGCC ATTTCGTAGT GGAAAGAAGC TACAATGCTA AGAATTTCGA GGTGGTGGGA
CGTGTAGAGG GCAAAGGTAC GATCAACGAA GAGTCGGTTT ACAACTTTAT CGACACCAAG
CCGCTGCCAG GTGTGAGCTA CTACCGTTTG AAACAGGTTG ACTATGTAAC AACCGGCGCC
GACGGAAAGC AGATTGATGG CAAAAGTACC TATTCTCGCA TTGTATCTGT TGAGCGCGAA
GGAACCAGCT TGCTGACGGT GTATCCGAAC CCGGTGACTG ATGTAGTAAA CATTACTCTC
AACAATGTGG TGAAACTGAA AAAGTGGAGC CTGATTGGCT TGGACGGCGC CGCTAAAAAG
TCGGGTACCG GAGCGAAAAT CGACCTTTCG AACTACCCAT CAGGGACTTA TATGGTTGAG
ATAACCACCG TAAATGACGA CGTGTATTAT CAGAAGGTCG TTAAAAAGTA G
 
Protein sequence
MNKCTFTKIF LTFLIFGTCV AQIALGQGTP WNGFYGNEWL QGKYSQKWLR IGISQKGIHR 
VTLPAGFMVG SDPNVLHLYH RGVEVALTAA STTQIEFYGV PNDGASDSLL YRPYSARTNP
YYSHFSDESS YFLTIGSPNG KRAAVENLTD PAGSDLLTYH MQTDNVAYPN DYTHATNYPT
RPTNLNSFME DGQTRSSTRF NDTDAHPIFS EFPISVKKQV GSDPPQAEIL VHGRSNYSPN
GIYPRNIHAY VGKDAGSLRK VDQKSIDGFM YAKFNFPIQQ TDLNAGVGRF GFQLDAPLQG
AANDRFSVTY YNVTYQQQID MFGSNSYLFT FPSAAQGSKS RIAVTTPPGG TVKFYDITDP
ANLRVVNGTT ASVIFSRPNT KPLLMLATNQ TFDVDGGKVS VVTFPQINKS DYDYMIVSSE
SLIEAADSIA LYRSAETAGR KFKTGIFKIR DIYNQFNYGE PSPIAIRRFV DFMISDLKKD
NLGRLDKYLL LLGISVTRND RINKELLNEV PTFSFPGSDL LLIEGLQGTP RDVPAIPVGR
IPATNNAQAL AYLRKVKQYE NATTNLAWRK NVAHISGGKT SSEINDHVSN LNTAALKVKA
NVFQGRVYTA KKPVATDQVL QMDTLAKHVN VNVTAAADSA GGLGMITYFG HSVPYQTDYN
FGYVSDGAKK YNNPNRYPII FYNGCDILNV FSNNFNETVN TNTSRPQSLD WLLSANKGAI
AVFGNSWAGY ASSCNNYMQK IYTELFTKND ADRDLMGRVL QVVAAQTKAG GGFRMGFENA
RTAEVYVADQ AQIHQTVLLG DPALKVLVST EGGLPVTLVS FNAQAAGNQV NVNWKTSSEV
NNSHFVVERS YNAKNFEVVG RVEGKGTINE ESVYNFIDTK PLPGVSYYRL KQVDYVTTGA
DGKQIDGKST YSRIVSVERE GTSLLTVYPN PVTDVVNITL NNVVKLKKWS LIGLDGAAKK
SGTGAKIDLS NYPSGTYMVE ITTVNDDVYY QKVVKK