Gene Dfer_4941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_4941 
Symbol 
ID8228547 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp5969343 
End bp5972462 
Gene Length3120 bp 
Protein Length1039 aa 
Translation table11 
GC content55% 
IMG OID644932789 
ProductTonB-dependent receptor 
Protein accessionYP_003089306 
Protein GI255038685 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAAA ATCTACTTCA CAACAACTTG AAGTTACTAT GCCTTTTCCT CATGCTCTGC 
TGCCAGATGG CGCGGGCACA GGACGGGCAG GTAACCGGGA AAGTTACAGA CAAGGACGGA
ATCGAAATCC CGGGTGCGAA TATTGCCGTA AAAGGAACTT CACAAGGCAC TTCGGCTGAC
GGCAACGGCC GGTACACCAT TCAGGCACCA CCAGGGGCCA CGCTGGTATT CAGTTTTATT
GGTTTCAAAA AGCAGGAGGT AGTGCTGGGT TCTCAACGTA CCAATGTGAA TGTACAAATG
GAACCCGACG TTTCCATGCT CAACGAAGTG GTGGTGACGG CACTCGGACA AACGCAGGAA
AAACGGGCGA TCGGCTACTC AGTCCAGTCG ATCCGGTCGG ATGAAATCCG GGAATCGGGT
AACCCGAACA TGATCGGGGC ATTGCAGGGC AAGATCGCCG GCGCGGTGAT TACCGGGTCG
GGCGGGGCGC CCGGAGCGGG AGTGAACATC ATTCTCCGGG GCATCACCTC GCTCAGCGGC
AGCGCCGACA ACCAGCCGCT GTTTGTGATC GACGGCATCA TCATCAGCAA TGCCACCACA
GCCGGAAACC CGCTGCCGAG CGCCGGATCG GCCTCGCCGG GCGCCTCCGA GCAATTTGCG
AATACCAATC GCGCGGCGGA TATCAATCCA GACGATGTCG AAAGCGTTTC GGTCCTGAAA
GGCCCTGCCG CAACGGCGCT TTACGGCCTG CGCGCTTCCA ATGGGGCCAT TATTATCACT
ACCAAACGCG GCAAATCGGG AAAATTGAAC GTGTCGGTGT CGCTTTCGGG CGGGGCGGAT
GTACTGGGGA AGTCGCCCGA CATTCAAACG CGCTTCATTC AGGGACGTTT CGGCGAGTTC
ATTTCCCCCA CCGAAGTGCG CCAGCGGACG CCCTACCAGT CGTTCGGGCC GTCGATCATC
GGCAATAATA CCGACCGTAT TTACGACAAC TTCCGCGGAT TTTACCAAAC AGGCTTCCGC
ACAAACAACA ACATTACCCT GACGAAGGGC AGCGAAAAAG GCAATATCTA CGTCTCGGCG
GGTCATAATT ACCGGCAGGG GATCGTGCCG GCCACGCATT TTGAGCGGAC ATCGGCCAAA
ATAGCAGGCA CGTATCATTT CACCAGGCGA TTTTCGGCTT CCGGCTCCTT CAACTACATC
CGCTCCGGCG GGAAACGGCC GCCGGCGGGC GATAAGTCGG TGTTCAGCGC ACTTTTCTAC
TGGCCTAATA CCTATGACGT GAACGATTAC CTGAATGCGG ACGGCTCTTA CAAAAACCTC
CTGCCCGGTT TCACCGACAA CCCGGTGTAC CTGGTCAAAA AGAGTCCGCG GGTCGACGCC
GTGAACCGCT ATATAGCCGA CCTGACATTA AACTACAACA TTACCGACTG GCTTACAGCA
AAGTATCAGA TCACCCTGGA CGCATTCAAC GAACGCCGTA ATCGCCAGGT AGACAGTACT
TTCGATGTCG GAACGGCGGT GAAAGGCTTC CTGATCAAAG AGTACATTAA TTATCGTGAG
GTCAATTCCA ATTTGTATGT GACGGCCTCG AAACAGTTCA ACGAAAGCTG GAACGGCTCG
CTGATGGTCG GTAACTCGGT GGTCGGCAGC AAGCGGCCCG ATAGCTATTA TGAGCGGGGA
GAGGGCTGGA ATGCGCCTTT TACGGACGAG ATCAGCAGTT ATCGCAACCA GCAGAAGCGT
TTCTATTCCC CGCTGCAATA CCGGATCGTC AGCTTCTTTG CCGATGCGAA ACTGAGCTTT
CGGGAAATGC TCTATTTCAA TGCGACGGGC CGTAACGACA TCGTTTCGAC ATTGCCCAAA
GCCAACAATT CGTTCTTCTA CCCATCGTTC AGTGCCGGCT ACATTTTTAC CGAAAACCTT
CCCAAAAACA ACATCCTCAG TTACGGCAAG CTGCGTGCTT CGTGGGCGCA GGTGGGCAAG
GGCACCGATC CGTACGTGAT TGGCGTGTAC TACGAGCTGG CCGATAACTT CCCGTTCGGT
AACACGGTTC CCGGCTACAT CCGCCGGTCC ACCACCGCGG CCGCCAACCT GAAACCCGAG
CGGACCACGT CCATCGAATT CGGTACCGAG CTGCGTTTTT TCAGCAACCG GCTTATGCTC
GATGCCACCT ATTTTACGAT GGACAGCAAG GACCAGATCG TCCGCGCGCC GGTTTCCAAT
GTTTCAGGCT ATTCATTTTA TTATACCAAT ATCGGTTTGA TTCGCAACAA AGGCGTGGAG
CTGCTTGCGA CGTTTAAGCC GGTGCAAAAG CCGCGTTTCA GCTGGGATAT GTCTTTGAAT
TTTACCAAAA TGTCGGGGAA GGTGATCGAA ATGCCGGACG AGCTCGAAGA AATCTCCTAT
TTCGACAATG GAAGTCGCGG TGTGCTGAAA GTGCGGGAAG GCAGCAAGCT GGGCGAATTG
TGGGGCCTCG ATTACCTGCG CGCCCCGGAT GGCCAGCTAC TCATCCAGGC GAATGGTTTT
CCGCTCACAA GCCAGGTGAC GGTGCCTTGG GGTAATGCAT TGCCCGACTG GACGGCCGGT
TTGACCAACA CATTCAACTA CCGCGGCCTG GGCCTGTCGT TCCTGCTCGA ATGGCGGCAC
GGCGGCGACG TGGTCGATCT TGGTGAGCGG AATGCATTCC GGTCGGGTTC CATCGAAATC
ACCGGGCGGA GATACGAGCA GGTCGTTTTC AAAGGTGTGG TCGAGCAAAA AGGGGCCGAC
CAGTCCGTAA CCTACGTACC CAACACCAAA GCGGTGATCC TCGACGACGC ATTCTATAAC
CCATCGACGG CGCGCTATAT GGGCAATTCG GCTCAGTTCA ACATTCAGGA TGGCTCCTGG
TTCAGGCTGC GAACGGTAGG CCTTTCGTAC GCGATCCCGA AAACCGCATT GGCGAAATCG
GTTTTCAAGG GCGGCGTGCG GTTCCACTTT ACAGGCACCA ATCTTTTCCT GAATACACCC
TTCCGCGGCT ACGACCCCGA AGCGCTCACA TTCGGCTCGG GCACGAATAT CATTGGTTTT
GTGGGCAGGA ACAATCCGGC CTCGCGCAGT TTTCAGCTAG GTGTTAATGT AAATTTTTAG
 
Protein sequence
MMKNLLHNNL KLLCLFLMLC CQMARAQDGQ VTGKVTDKDG IEIPGANIAV KGTSQGTSAD 
GNGRYTIQAP PGATLVFSFI GFKKQEVVLG SQRTNVNVQM EPDVSMLNEV VVTALGQTQE
KRAIGYSVQS IRSDEIRESG NPNMIGALQG KIAGAVITGS GGAPGAGVNI ILRGITSLSG
SADNQPLFVI DGIIISNATT AGNPLPSAGS ASPGASEQFA NTNRAADINP DDVESVSVLK
GPAATALYGL RASNGAIIIT TKRGKSGKLN VSVSLSGGAD VLGKSPDIQT RFIQGRFGEF
ISPTEVRQRT PYQSFGPSII GNNTDRIYDN FRGFYQTGFR TNNNITLTKG SEKGNIYVSA
GHNYRQGIVP ATHFERTSAK IAGTYHFTRR FSASGSFNYI RSGGKRPPAG DKSVFSALFY
WPNTYDVNDY LNADGSYKNL LPGFTDNPVY LVKKSPRVDA VNRYIADLTL NYNITDWLTA
KYQITLDAFN ERRNRQVDST FDVGTAVKGF LIKEYINYRE VNSNLYVTAS KQFNESWNGS
LMVGNSVVGS KRPDSYYERG EGWNAPFTDE ISSYRNQQKR FYSPLQYRIV SFFADAKLSF
REMLYFNATG RNDIVSTLPK ANNSFFYPSF SAGYIFTENL PKNNILSYGK LRASWAQVGK
GTDPYVIGVY YELADNFPFG NTVPGYIRRS TTAAANLKPE RTTSIEFGTE LRFFSNRLML
DATYFTMDSK DQIVRAPVSN VSGYSFYYTN IGLIRNKGVE LLATFKPVQK PRFSWDMSLN
FTKMSGKVIE MPDELEEISY FDNGSRGVLK VREGSKLGEL WGLDYLRAPD GQLLIQANGF
PLTSQVTVPW GNALPDWTAG LTNTFNYRGL GLSFLLEWRH GGDVVDLGER NAFRSGSIEI
TGRRYEQVVF KGVVEQKGAD QSVTYVPNTK AVILDDAFYN PSTARYMGNS AQFNIQDGSW
FRLRTVGLSY AIPKTALAKS VFKGGVRFHF TGTNLFLNTP FRGYDPEALT FGSGTNIIGF
VGRNNPASRS FQLGVNVNF