Gene CHU_2301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCHU_2301 
Symbol 
ID4185212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCytophaga hutchinsonii ATCC 33406 
KingdomBacteria 
Replicon accessionNC_008255 
Strand
Start bp2669155 
End bp2670129 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content39% 
IMG OID638072301 
Producttranscriptional regulator 
Protein accessionYP_678904 
Protein GI110638695 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0634001 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAAG TAGTACTAGT TGTTCCTAAA GGGAATATAA ATATGAGCAG CATTACAGGC 
TCGTTTGAAA TTCTGTCGCG CGCTAATGCG TATTGGAAAA AGATGGGAAA CAACGCGGTG
TTTGAGATTT GTATTGCAGG GTATGAACCG GAACTCACGT TAGGCAGTGG TTTCTTTTCA
CTTCATCCGG TGGCTATAGC TTCTATAAAA AAAGCAGACC TGATCGTGAT TCCTTCTCTA
TCCTACGATT ATGAACAGGT ACTTAAAGAT AACAGCATAC TGATTGACTG GATAAGAGAA
CAGTATAAAA ATGGTGCAGA AGTTGCCAGT ATCTGTACTG GTGCATTTTT ACTGGCTGCA
ACAGGATTAC TTGACGGTAA ATCCTGTTCA ACACATTGGA ATGCTGCAAA CGATTTCAGA
AAGATGTTTC CGGATACCGA TCTTCAGGTT GACAAACTGA TTGTTGCTGA AAAAGGAATT
TATACAAATG GCGGCGCTTA TTCGTTTCTA AACCTGATTC TCTTTTTAGT TGAAAAATAT
TTCAATCGGG AAACAGCTAT ATATTGTTCA AAGGTTTTTC AAATTGAAAT TGACCGTACT
TCACAATCAC CTTTCACTAT TTTCCAGACA CAGAAAAATC ACGGCGATGC CATCATCAGC
AGGGCACAAA CGTATATAGA GGAACATGTA GGTGAAAAAA TTTCGTTTGA AGAACTAGCT
TCTCTGCTTG CCGTAAGCAG ACGTAATTTC GACAGACGTT TTATTAAAGC AACCAACAAT
ACACCGGTAG AATATTTGCA GCGTGTAAAA ATAGAAGTGG CCAAAAATAG TCTGGAAAAG
GGCCGCGCAA CTGTTTCTGA AGTTATGTAT GAAGTTGGTT ATGCGGACGA CAAAGCATTC
AGAGAAGTAT TTAAAAAGAT CACCGGTATT TCTCCGCAGC AATATCGTGC AAAATATACC
CGCGACCTTT TGTAA
 
Protein sequence
MKQVVLVVPK GNINMSSITG SFEILSRANA YWKKMGNNAV FEICIAGYEP ELTLGSGFFS 
LHPVAIASIK KADLIVIPSL SYDYEQVLKD NSILIDWIRE QYKNGAEVAS ICTGAFLLAA
TGLLDGKSCS THWNAANDFR KMFPDTDLQV DKLIVAEKGI YTNGGAYSFL NLILFLVEKY
FNRETAIYCS KVFQIEIDRT SQSPFTIFQT QKNHGDAIIS RAQTYIEEHV GEKISFEELA
SLLAVSRRNF DRRFIKATNN TPVEYLQRVK IEVAKNSLEK GRATVSEVMY EVGYADDKAF
REVFKKITGI SPQQYRAKYT RDLL