Gene Bind_1720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1720 
Symbol 
ID6199692 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1944057 
End bp1945052 
Gene Length996 bp 
Protein Length331 aa 
Translation table11 
GC content61% 
IMG OID641705711 
ProductDeoR family transcriptional regulator 
Protein accessionYP_001832839 
Protein GI182678693 
COG category[K] Transcription 
COG ID[COG2390] Transcriptional regulator, contains sigma factor-related N-terminal domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.925466 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0186791 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGAAAC GGTTGAGTGA GCGCGGGCAG AATATCGAGC AGAAGCACCA GGACCAGAAA 
CGTTTCGATC TCGCGGCGCG GGCGGCTTGG CTCGCTTATG CGCTTGGCCG GACTCAGGAT
GAAATCGCGG CGGAATTGAA TGTTTCGCGC CAGAACGCGC AGCGTCTCAT CGCGCTCGCC
AGCGCGGCCG GCCTCGTCAA ATTCCGCCTC GATCATCCGC TCGCCGATTG CATCGCCAAA
GCGCAGAAAC TGCGGGATAA ATTCAGCCTC AAACATGTCG AGGTCGTCCC CTCGGCGCAA
AGCGATGAGG ACAATAGGCT TTCCGTGGGG ATCGCCGTGG CCTCCTATAT CGAGACTCTT
CTCTCGCAGA CGGAGCCGCA GATTCTGTGT CTTGGTACCG GCCGCACCTT ACGCTCGGCC
GTGCATCAAA TGCCCCTGAT GGAAAAGCCG AAGCACAAGA TCGTATCCCT CATCGGCACG
GTGGGACCCG ATGGTCGCGC AAGTCCCTAT GATGTCGTCA TGCGGCTTGC CGACCGTGTT
GGGGCACAAT GCTACCCCCT GCCCATGCCG GTCCTGGCCG ATGGCCCTGA AGAACGACGC
ATGCTCCAGT CCCAGAAGGG TTTGCGCGCC CTTCATGATC TCGCCGAAGA GGCACGGACA
TGGATCGTGG GCGTCGGCGA CCTTGGATCC CAAGCCTCCC TGCATGTCGA TGGATTTATC
ACAGACGAGG AACTTGCCGA ATTGCAGGCC AAGGGCGCGG TTGGAGAAAT TCTCGGCTGG
GCTTTCGATG GGCAAGGTGG ACCGGTCAAG ACCTCGATCC ATGACAGGCT GATCGCCGTG
GGGCTTGCCA CGCCCATCGC CTCGCACCGA ACGATTATCG CGGCGAGCGT CGGCGCGCAT
AAGATCGCCC CCTTGCTCGG CGCGCTCCGA GGCGGGTTGG TCAATGGCGT GCTCACGGAT
GAGAGGACGG CGCAAACCTT GATCGAAGCG CCTTGA
 
Protein sequence
MQKRLSERGQ NIEQKHQDQK RFDLAARAAW LAYALGRTQD EIAAELNVSR QNAQRLIALA 
SAAGLVKFRL DHPLADCIAK AQKLRDKFSL KHVEVVPSAQ SDEDNRLSVG IAVASYIETL
LSQTEPQILC LGTGRTLRSA VHQMPLMEKP KHKIVSLIGT VGPDGRASPY DVVMRLADRV
GAQCYPLPMP VLADGPEERR MLQSQKGLRA LHDLAEEART WIVGVGDLGS QASLHVDGFI
TDEELAELQA KGAVGEILGW AFDGQGGPVK TSIHDRLIAV GLATPIASHR TIIAASVGAH
KIAPLLGALR GGLVNGVLTD ERTAQTLIEA P