Gene Dfer_5566 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDfer_5566 
Symbol 
ID8229181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDyadobacter fermentans DSM 18053 
KingdomBacteria 
Replicon accessionNC_013037 
Strand
Start bp6691158 
End bp6692369 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content50% 
IMG OID644933412 
Producthistidine kinase 
Protein accessionYP_003089921 
Protein GI255039300 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.430665 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCGG AAGCACCCCA TTTACACACC ATTTCCCATG ACAATATCCC GCTTTGCCGG 
GTAGGCGGCA GCGTCTTAAA CGCCGGCCCA AAAGCGCTTC CGGAGCAGCT ATCTGAATTT
TTTACCGGGA TTTTCAGCAC CTCCGAATGG CCGGCGAGGT GGTACTGCGG CTACTGGTCC
GATTTCCATG GCTGGCTTTA CATCGTTTCC GATCTTTTCA TCTGGGCGGC CTACTTTGTG
ATCCCCACTT TCCTGGTCCG GTTTATATTA AAACGGAAAG ACTTTCCATT CCCGAAAACC
ATCTGGCTCT TCGTGGCCTT CATCTTTTTC TGCGGCGCCA CGCATTTAAT CGACGCGCTG
ACTTTCTGGG TGCCGGTGTA CCGTTTCAGC TCATTGGTAC GGTTCGCGAC GGCCATTGTA
TCGCTCACCA CGGTTTATTA TCTGTTCAAA ATATTCCCGA ACGTCCTCCT GCTCAGGTCC
GTGGCCGATC TGCAACGGGA GATCGACGAG CGGACGACGA TCGAAGAGAA GATGGCGCAG
AAGAACAGCC AGCTGCAAAG CTTCACTCAC ATCCTCTCGC ACAACCTGCG GAACCACGCG
AGCAATATTG CGCTCCTCAC CGATTTCGTG GACGAATCGA CGCTCTCGAA GGATAATGAA
GAGCTTTTCC AGAAGATTAA AACCGTATCC AAACACCTCA ATACCACCCT CGACGACCTG
TCGCAGGTGA TCAAAATCCG CGACAACCAG CTGGAAGGCG AACAGCTGGA CATCCGGGAA
GTGACCGAAC GGGTGTTGGG CGTGCTCGAC GAAAGCCTGC ACACGAGCCA GGCCGAGGTA
CGGATGGATT TCAGCGAGCG GGAAATCGTG TTTCCGCACA TTTACCTGGA AAGCATTCTG
ATGAACCTGA TCTCGAACGG TATAAAATAC AAAAAAGATG GTGAACCTCC TTTGATCACA
TTGCGTTTCT ACCGTAACGA AAATGGCCTT AAAGTACTGG AATACAGCGA CGAAGGAAAG
GGCATCGACC TTTCGCTTCA TTCGGACAAG ATCTTTGGTT TGTATAAAAC TTTCCATAAA
CACCGGGATG CTCACGGAGT TGGACTATTT TTGATAAAAA ATCAGATTGA AGCCCAGGGC
GGGAACATTG AGGTATTCAG TAAGGTCGAC GCGGGTATTA CTTTTAAAAT TACATTCAAC
GAAAATGCTT GA
 
Protein sequence
MNPEAPHLHT ISHDNIPLCR VGGSVLNAGP KALPEQLSEF FTGIFSTSEW PARWYCGYWS 
DFHGWLYIVS DLFIWAAYFV IPTFLVRFIL KRKDFPFPKT IWLFVAFIFF CGATHLIDAL
TFWVPVYRFS SLVRFATAIV SLTTVYYLFK IFPNVLLLRS VADLQREIDE RTTIEEKMAQ
KNSQLQSFTH ILSHNLRNHA SNIALLTDFV DESTLSKDNE ELFQKIKTVS KHLNTTLDDL
SQVIKIRDNQ LEGEQLDIRE VTERVLGVLD ESLHTSQAEV RMDFSEREIV FPHIYLESIL
MNLISNGIKY KKDGEPPLIT LRFYRNENGL KVLEYSDEGK GIDLSLHSDK IFGLYKTFHK
HRDAHGVGLF LIKNQIEAQG GNIEVFSKVD AGITFKITFN ENA