Gene Bind_1991 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1991 
Symbol 
ID6201182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2275663 
End bp2276790 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content55% 
IMG OID641705979 
ProductLuxR family transcriptional regulator 
Protein accessionYP_001833103 
Protein GI182678957 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00358246 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGATTGATC CCGTCGTCGA AAATTTGATC CGCGCCATCT ATGATTGCGT CATCGATCCA 
TCGGGTTGGG AAGACGTCCT GCAAAGGATC GTCACCCATA CCCATGGCGT TGCCGCCGCT
TTGGAAGCGG AGATGTCAGA GACAAAGCCT AAGAAGATCG CAAGCTATAA TTTCGATCCT
TTTTATGACT TTGCTTATCG AAGCCATTTT CATGCCTTGA ATCCGTTCAT CCCTGGCCGA
TTGTCCCAAA TGGCCGAAAC CGTTTCTATT GGCAATTCCA TAACCGATAC CGCTGCTTAT
CGTGCGTCAT CCTTTTATAA TGAATTCGCC AAGCCCCAGG GATGGGAAGC CTTTATCAGT
GTGAATCTCA ATGGACCTGG GGGCGCCGAT GTCTTCGCCC TCATGCGAAG CCAAAAAACC
GATTTCGCTC AGACGGGCAT CGAACATTTT CTGACTCTTC TCGCGCCGCA TTTGCGGCGT
GCCTATGATC TTTCCAGCCT CCTGGCCCAT AGTCGTCAGA CGGCGGAGTT TCTAGGGCGG
GCGATTGCCA CTGCGGGGTT CGGCACTATT CTCCTGAGCG AAAAATGCCG GATCGTCTAT
GCCAATGAGG TTGCCGAGGA GCTGCTTCGT CAGCAACAGG GGCTGGCTTT CATTCGAGGT
GAACTCGTCG CGGAGGCAAC GACCCTGACC TCCCGACTTG CGGCCATGGT GCGCGCCTGT
GTCGACCCGC GGGCCTTGAC CGATCCGCTC ACCACAATGC TCGAGCTCCC GCGCCGCGGT
TCGGATCAGC CCATCCGCGT GCATGTCTTG CCCCTTCAGG AAAAGACGGC GGCGATGGTG
GCCCATCGGG CACGGCCCGT TGCCGCGCTT TTCCTGGTCA ATCCGCAGCA TGATCTTTCC
ACTCGGATGC AAAGTTTTGC CGATGCTTAT TCCTTAACGT CGATCGAGAT CGCAATCCTG
GGGGAACTCA TTCACAGTGA GTCGCTGACA TTGGTCGCTG CGAAGCTTGG CGTGTCTGCC
TCGACCTTGC GGACGCATAT GGGACGCTTG ATGGCCAAGA CCGGGACAAG AAATCGACTT
GAACTTCTCC GCAGCTTCTT CGAAATGTTC TGCTTCGCTT CGCGGTAA
 
Protein sequence
MIDPVVENLI RAIYDCVIDP SGWEDVLQRI VTHTHGVAAA LEAEMSETKP KKIASYNFDP 
FYDFAYRSHF HALNPFIPGR LSQMAETVSI GNSITDTAAY RASSFYNEFA KPQGWEAFIS
VNLNGPGGAD VFALMRSQKT DFAQTGIEHF LTLLAPHLRR AYDLSSLLAH SRQTAEFLGR
AIATAGFGTI LLSEKCRIVY ANEVAEELLR QQQGLAFIRG ELVAEATTLT SRLAAMVRAC
VDPRALTDPL TTMLELPRRG SDQPIRVHVL PLQEKTAAMV AHRARPVAAL FLVNPQHDLS
TRMQSFADAY SLTSIEIAIL GELIHSESLT LVAAKLGVSA STLRTHMGRL MAKTGTRNRL
ELLRSFFEMF CFASR