Gene Bind_0042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_0042 
Symbol 
ID6200919 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp43489 
End bp44523 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content64% 
IMG OID641704038 
ProductAraC family transcriptional regulator 
Protein accessionYP_001831190 
Protein GI182677044 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG TCGACAAGTG CAAGATCCCG CAGGCGTTCT GGCGGGCAGC TGAGCAGTTC 
GACATCCCGT CGGCCGCGCT GTTGCGGCAG GCGCGGCTGC CGGCAACGCT TCATCTGGGC
ACACAAGTCT TCGTCACCAC GGCGCAGTAT TTCTCGCTGA TGCAGGCGAT GGCGGACCTG
TCCGGCGACT CCGCGCTTGG CATCCGAATG GTGCAATCCG TCGATACGGC GGTCCATCCG
CCGTCGAGCC TCGCCGCCTT CTATGCCCGC GACTATCGCG ACGGGCTGAC CCGGCTCGCA
CGGTTCAAGC GCCTGTGCAC CCCAGAGCAG TTGCAGGTCG TCGAGGCGGG TGGCGACTGC
ACCATCTCCA CCGAATGGCC CTTCGCCGCG GCAGCCGAAC CCAGCATATC CGTCGATATT
ACTTTCGCCA CGTTGGTAGA ACTGGGACGG CGCGCTACCG GGCGTACCAT CGTGCCACGT
CGGTTGGAGC TGACCCGGCC GGGACCGATA GACGCAATTC ATGCGGAATA TTTCGGCTGC
CCGATCCGTA CCAAGGCCCC GCGGAACCTT TTGGTGCTCG ACGCCGCCGA TCTCGATCGT
CCGTTCCCGG GACACAATCC CGAGATGCTG GAGATGCTGA CGCCGGCCCT CGGGGCGGCG
CTCGGTGAGT TGGAGGCGCA GAGTTCGATC GCCGAACAGG TGAAGATCGT GGTGAAACGC
AGTTTGGCGA GCGGCCAGCC CGGCCTCTCC GACGTGGCAA AGCAACTCGG CATGAGCGAT
CGAACCCTCC AGCGGCGTAT CACCGAGGAA GGATCGACCT TTCGTGATCT GCTGTCGGAA
GCTCGCCGGG ATCTTGGTCG CCATCTCCTG ACCGACCCCG CCACGGACAT CGATGAAGTG
GCCTGCCTGC TCGGCTATCA GGACACCACG TCCTTCTACC GCGCTTTCCG GGAATGGGAA
GGCATGCCGC CGAACCGCTG GCGCGAGACG AATATGAACA GGCCCCGCGC ACTTGAAACC
GCCGGTCTCC ATTGA
 
Protein sequence
MAQVDKCKIP QAFWRAAEQF DIPSAALLRQ ARLPATLHLG TQVFVTTAQY FSLMQAMADL 
SGDSALGIRM VQSVDTAVHP PSSLAAFYAR DYRDGLTRLA RFKRLCTPEQ LQVVEAGGDC
TISTEWPFAA AAEPSISVDI TFATLVELGR RATGRTIVPR RLELTRPGPI DAIHAEYFGC
PIRTKAPRNL LVLDAADLDR PFPGHNPEML EMLTPALGAA LGELEAQSSI AEQVKIVVKR
SLASGQPGLS DVAKQLGMSD RTLQRRITEE GSTFRDLLSE ARRDLGRHLL TDPATDIDEV
ACLLGYQDTT SFYRAFREWE GMPPNRWRET NMNRPRALET AGLH