Gene Bind_3598 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3598 
Symbol 
ID6201318 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp4076135 
End bp4077568 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content57% 
IMG OID641707552 
ProductXRE family transcriptional regulator 
Protein accessionYP_001834642 
Protein GI182680496 
COG category[R] General function prediction only 
COG ID[COG3800] Predicted transcriptional regulator 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.149473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTCAAA AATTTTTTGC CGGAACCCAG ATCCGACGCC TCAGGGAGGC TCACGCCCTG 
ACTCAAGGCG CCTTTGCCGA GCGTCTTGGG ATCTCGCCTA GTTATTTGAA TCAGATTGAA
AATAACCAGC GTCCCCTTTC CGCCTCTGTC CTGCTCAGTC TCGCTCAATC CTTTTCCGTC
GATCTCAGCG AATTTGCGCA GGAAGATACC GATCGCCTCA TCGGCGATCT CAAGGAAGCC
CTGGCCGATC CGCTCTTTTC AGGACTGACG CCCAGTGGTC AGGATCTGAA GATGATCGCT
GGCAATGCGT CCTGGTTCGC ACATGCCTTT CTCCAATTGC ATACGGCTTT TCGCCGCACC
AATGAACGCA TGCAGACGAT GGACGATGAG GACATCCTCC ATCGGCCGGC GGGCACGGCC
TCGGATTCCG GCATGCTCTT GCCTTACGAG GAAGTCCGGG ATTTCTTCCA TTATCGCGGC
AATTACATCG ATGTGCTCGA TGTCGCGGCT GAGCGAATCG CCGAGGAAGA ACTCGGCATC
GGCGAGGGGA CGAAAGTCAA CGAACTCGCG GATTATCTCT TGCGCCGCCA TGCCGTGCGC
GTCGAATGCG AAAAGGTCGA GCCGACCTCG CGCTTCATGC GCCGCTATGA TCGCGTCGGC
CGCGTGCTCT CGATCCGGGA TGGTCTCGAT CCGACGAGCC GGGCGTTTTT GATTGCGCAT
CAGATCGCCC ATCTTGAGGA AGGCGATACA TTCGAGGCGA TCATCGCCGA GGCCGGTTTC
CGGGCGAGCG GCGCGGCCGG CATCACCAGG ATCGCGCTTG CCAATTATTT CGCCGGCGCC
TTGATCATGC CTTATGAGCG TTTCCTGAAT GCGGCGCGTT CGACACGTTA TGATGTCGAA
CGCCTCTGCT TGATGTTCGG TGCGAGTTTC GAACAGATCG GCCATCGTCT GTCCACCCTG
CAAAGGCCGA ATGCGCGCGG TGTCCCTTTC TATTTCGTGC GCATCGATCG GGCGGGAAAT
ATTCTGAAAC GTCATTCATC GACGCGTTTC CAATTTGCTC GTTTCGGTGG CACCTGTCCT
TTATGGAATG TGCATGAGGC ATTCGAAATG CCGAACCGCA CCTTGGTCCA GATCGCCGAA
ATGCCGGATG GCGTGCGCTA TTTGTGTGTC GCCCGTTCAG CCACCAAATC GGTTGGCAGC
CATCTCGCGC AACCCCGTCA TTACGCGCTC GGCATTGGAT GCGAGATCAG CTATGCGCAT
GATGTTGTCT ATTCCGACGC CATCGATCTA AAAACCACTC CGGTCGCCAA GATCGGTGTC
AGTTGCCGTA TCTGCGAACG CACCGATTGT CCGCAACGGG CAGCGCCGCC GATCGATCGC
GGCCTTCTCG TCGATCCGGA CCGCCGCGAT TTCGTTCCTT TCCGGTTTAC ATGA
 
Protein sequence
MRQKFFAGTQ IRRLREAHAL TQGAFAERLG ISPSYLNQIE NNQRPLSASV LLSLAQSFSV 
DLSEFAQEDT DRLIGDLKEA LADPLFSGLT PSGQDLKMIA GNASWFAHAF LQLHTAFRRT
NERMQTMDDE DILHRPAGTA SDSGMLLPYE EVRDFFHYRG NYIDVLDVAA ERIAEEELGI
GEGTKVNELA DYLLRRHAVR VECEKVEPTS RFMRRYDRVG RVLSIRDGLD PTSRAFLIAH
QIAHLEEGDT FEAIIAEAGF RASGAAGITR IALANYFAGA LIMPYERFLN AARSTRYDVE
RLCLMFGASF EQIGHRLSTL QRPNARGVPF YFVRIDRAGN ILKRHSSTRF QFARFGGTCP
LWNVHEAFEM PNRTLVQIAE MPDGVRYLCV ARSATKSVGS HLAQPRHYAL GIGCEISYAH
DVVYSDAIDL KTTPVAKIGV SCRICERTDC PQRAAPPIDR GLLVDPDRRD FVPFRFT