Gene Dfer_4139 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_4139 
Symbol 
ID8227737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp4998475 
End bp5001864 
Gene Length3390 bp 
Protein Length1129 aa 
Translation table11 
GC content55% 
IMG OID644931982 
ProductBeta-galactosidase 
Protein accessionYP_003088507 
Protein GI255037886 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.852691 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTTT CTACCCCCCG CGCATTTCTG ACGCTGTTTT TTTTAATACA AATCGTGCAC 
CACGCGGCTC ATGCACAGGC CATTCCCGAA TGGCAGGACC CGCAGGTGAT CAGCATCAAT
ACCGAAAAGC CCCGCGCCGA TTTTTTCCCG TACACCACGG AAAAAGCCGC ATTGGCAATG
GACAAAAAAG GCTCGTTCGT GCAGTCGCTG AACGGCAGCT GGAAGTTCAA ATGGGCACCG
CATCCGTCGA AAGCGCAGCT GAATTTCTAC GATCCCAAAG TCTCCGACGC AAGCTGGGAT
AACATCCCCG TTCCGTCCAA CTGGCAGGTC GTCGGCGCGC GCGAAGGCCG CAAATACGAC
CGCCCGATTT TCAGCAACAT CAAACACCCG TTTAAGGCCA CGCCGCCGCG CATTAATGCG
GATACCAATG CGGTAGGCAT GTATCGAACC ACGTTCACGG TCGCAGATGT GAAGGATAAG
CAAATATTCC TGCATTTCGG TGGCGTGCAG TCTGCCTGCT ATGTGTGGCT GAATGGTGTA
GCGATCGGTT ATCACGAGGA CGGCATGACG CCATTCGAGT TCGACGTGAC AGAAGATGTA
AAGGCCGGCG TGAACAACCT GGCTGTGGAA GTTATCAATT GGTCAGACGG CAGCTACCTC
GAAGACCAGG ATTACTGGCG GCTGTCCGGC ATTTTCCGGG ATGTGAACCT GCTGCTTTTG
CCAAAGGTAG TTTTGACAGA TTATTCGGTA AGGACGATCC TGGACGCCAA CCACGACAAT
GCGACTTTGA AACTCAGCGC ATTTGTAAAA AACTACGGCC AGCAGCCCAT TCATGCGCAC
CAGGTGCTGT TTACATTGTA TGATGCTGCC AAAAACGTGG TGACTACGCC TGTGAGCCAG
ATGGTAGGCA CGCTTGAAAT GGGCCGCGAA GGTGCCGTCC GTGCCGAAAT GCCGGTGCCG
AGCCCCGCCA AATGGAGTGC CGAAACGCCG AATTTGTACA TGCTCACCGT CCAGCTCATG
AACTCCGACG GCAAGGTGAT CGAGGCGACC AGCCAGCGCG TGGGCTTCCG GGATGTAAAA
ATCAAAGGCG GACAACTCCT TGTGAATGGC AAAGCGATAA CCATCAAAGG CGTGAACCGC
CACGAGTTCG ATCCCGAAAC CGGCCGTGTG ATCAGCCGCG AGTCGATGAT GCGGGACATT
ACTTTAATGA AGCAGCACAA TATCAATGCC GTAAGGACCT CGCATTATCC CAATGCTTCC
GAATGGTATG ACCTGTGCGA CCAATATGGC CTTTATGTGA TGGATGAGGC CAATATCGAG
AGCCACGAGC TGTGGAGCAA AGGCATTATC CTCGCCGACA ACCCGCAATG GCGCTCGGCA
TTTCTCGCAC GCGGCAATGC GATGGTGGAA CGCGACAAAA ACCACCCGTC GGTGATCATC
TGGTCGCTGG GTAACGAGTC GGGAATGGGG CAGAATTTCG TGGATATGGG CGATTTCATC
AAGCTCGCCG ACCCTACCCG GCCGATCCAT TACGAGGGCC GGAAAGATTA CAAACCAACC
ACGCTGAGCA GTTTCGACAT TATTTCCGTC ATGTATCCAT CCACGCAGGA TATGACCGAG
CTGGTTAAAA AGGACAAAAC CCGCCCGCTG ATCGTATGCG AATATGCGCA CGGCATGGGC
AATAGCGTCG GTAACCTCAA AGAATATTGG GACGTAATCG AAAAATATCC CACCATGCAA
GGCGGCTTCA TCTGGGACTG GGTGGACCAG GGACTCAAAC TGAAACGCCC CGACGGCACC
GATTACTGGG ATTACTTCAA TTACCTCGAC GGCGCCAATG CCGGCGACGG CCTCGTGAAC
CCCGACCGCA CGCCCCAGCC CGAGCTGAAC GAAGTGAAAA AGGTGTATCA ATATGTCAAA
TTTGAAATGC CCGACACGCT GAAAACTGGC GAAAAAGCCC TTACGCTGCA CAATACCTAC
GACTTTCAGT CGCTGAATGC ATTTGAACTG GTTTGGTCAG TGATCGAAAA CGGCAAGCCG
GCAGGCAAAG GCGGCAGCAT TGCAAACCTG AATGCATTGC CGCGCCACAA ACAGCAGCTT
ACCATCCCCT ACGAGCTGCC TGCCGCTTCC AAACCGAATG CGGAATATTT TCTGAATCTG
AGTTTGAGAC TAAAAGATGC CACGCTGTGG GCGCCGAAAG GCCATGAAGT GGCCTGGCAT
CAGGTGCCGG TGGTGAAACC GGCCACACCG CGGCCAAACC TGAGCCTGTA CGGCGAACGC
CCGCTCCGCA TTGCGCAAAT CAGCTCGGCC CGCGTGCAGG TTGCCGGCCA GGATTTCACG
GTGGTGTTCG ACAAAAACGA AGGCCGCATG ATTTCTTTTA AAAACAAAAA AGAAGAAATG
CTCGAAAGCG GGCCATATGC CAACTTCTGG CGCGTCCCGA CCGATAACGA CGAAGGCGGC
GCAGCCAAAA GCTACGCCAC GCAATGGCGC AATTTCGGTC TGGATACATT GGAGCGCGTT
TCCTCCGAAA TGAAAACCCA GCGGCTCACG GCACAGATTT ACAAGGTGAC GCTCAGCCAG
ACATTAAAGC AGCCAAAAGG TGAAATGGAT GTGCAATCGG TATATATGGT TTATGCTTCG
GGCGACATTC ATGTACAAAA CACTTTCACG CCCCGCGGCG AATGGCCCCC GCTCGCCAAA
ATTGGCATGC AGCTCCGAAT GCCGGCCACA TTTACCAAAA CCCAATGGTT CGGCAACGGC
CCCCACGAAA CCTATGCAGA TCGTAAAACG AGCGCAAAAG TGGGCATTTA CGCCGGAACG
GTGGCCGAGC AGCATTTCCC TTACATTACG CCGCAGGAAA ACGGCAATAA AACCGGCATT
CGCTGGGCAA CCGTCACCAA TGCGGAAGGC ACCGGCCTGC TCGTGCTGAG CGATACCGCG
TTCAATTTCA ACGTGCACGA TTATACAGAC AAAGACCTCC TGGCAGCAAA ACGCCGCGCA
GCGGTTTTGG CCCGGGGAAC GTCCACGACC GTTAACATCG ACCTCGCGCA AATGGGCCTG
GGCGGCGACG ATAGCTGGTC GCCGCGCGTG CATGAAGCGT ACCTGCTGCC CGCGAAAACC
TATTCATACG CATTCAGGCT GAGGCCGATC GAAAGCACTT CCAATATCGA GCAGATAGCG
GCCGTGCGCC TGCCTTATGT GGATCAGAAA GAAACCAATG AGAGCGTCTC GACCGCGGAA
ACCGCTGCGG CGACCGAAGA AGCAGTAACA GAAGACGAAG AGGAAGAAGA GGCGGTAACA
GCACCGGTAC GCAAAACGAC GGTCAAAAAA GCGCCTGTTC GCAAAAAGGT CGTGCGGAAG
AAGAAACCGA CCCGGCGCCG CAGGAGATAA
 
Protein sequence
MQFSTPRAFL TLFFLIQIVH HAAHAQAIPE WQDPQVISIN TEKPRADFFP YTTEKAALAM 
DKKGSFVQSL NGSWKFKWAP HPSKAQLNFY DPKVSDASWD NIPVPSNWQV VGAREGRKYD
RPIFSNIKHP FKATPPRINA DTNAVGMYRT TFTVADVKDK QIFLHFGGVQ SACYVWLNGV
AIGYHEDGMT PFEFDVTEDV KAGVNNLAVE VINWSDGSYL EDQDYWRLSG IFRDVNLLLL
PKVVLTDYSV RTILDANHDN ATLKLSAFVK NYGQQPIHAH QVLFTLYDAA KNVVTTPVSQ
MVGTLEMGRE GAVRAEMPVP SPAKWSAETP NLYMLTVQLM NSDGKVIEAT SQRVGFRDVK
IKGGQLLVNG KAITIKGVNR HEFDPETGRV ISRESMMRDI TLMKQHNINA VRTSHYPNAS
EWYDLCDQYG LYVMDEANIE SHELWSKGII LADNPQWRSA FLARGNAMVE RDKNHPSVII
WSLGNESGMG QNFVDMGDFI KLADPTRPIH YEGRKDYKPT TLSSFDIISV MYPSTQDMTE
LVKKDKTRPL IVCEYAHGMG NSVGNLKEYW DVIEKYPTMQ GGFIWDWVDQ GLKLKRPDGT
DYWDYFNYLD GANAGDGLVN PDRTPQPELN EVKKVYQYVK FEMPDTLKTG EKALTLHNTY
DFQSLNAFEL VWSVIENGKP AGKGGSIANL NALPRHKQQL TIPYELPAAS KPNAEYFLNL
SLRLKDATLW APKGHEVAWH QVPVVKPATP RPNLSLYGER PLRIAQISSA RVQVAGQDFT
VVFDKNEGRM ISFKNKKEEM LESGPYANFW RVPTDNDEGG AAKSYATQWR NFGLDTLERV
SSEMKTQRLT AQIYKVTLSQ TLKQPKGEMD VQSVYMVYAS GDIHVQNTFT PRGEWPPLAK
IGMQLRMPAT FTKTQWFGNG PHETYADRKT SAKVGIYAGT VAEQHFPYIT PQENGNKTGI
RWATVTNAEG TGLLVLSDTA FNFNVHDYTD KDLLAAKRRA AVLARGTSTT VNIDLAQMGL
GGDDSWSPRV HEAYLLPAKT YSYAFRLRPI ESTSNIEQIA AVRLPYVDQK ETNESVSTAE
TAAATEEAVT EDEEEEEAVT APVRKTTVKK APVRKKVVRK KKPTRRRRR