Gene Rleg_4778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4778 
Symbol 
ID8007031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012848 
Strand
Start bp149797 
End bp150819 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content59% 
IMG OID644821708 
Producttranscriptional regulator, AraC family 
Protein accessionYP_002972968 
Protein GI241113133 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0612761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTTA ATTCTCATAT ACATAACGAT GGCAGCCTCG CCCAACAGCT GGCTGTGCAG 
AGCATGTTGT TTCGGGAAAC GGCGCATAGT GGAACGGACC CGGATGAGCT CTCGGTAGTA
TTGTCGACGC CAAACTCCCC GATCAAGGTT GTGGCGGAAG GCAATATGCC GATTGCATAC
CACTGCAATT TCGTCTCCGT CGGAGAAGAG GTAACCGTCG CCGACTGTAC CTACGAAGGC
ACAATCCTGA TCAGGCGGGA AGCGCCCAGC GACAGGATGA TCGTTTTTCT ACCGATGGCA
GGGAACGCAT CCTTCGAAGG CATGCGGGAG CAAATATATT CTGTTCCCGC CCGTGGCACG
ATCCTTGAGG CAGGTCGTGC CGCGGGTGCT CGCCTATTCG GCCCCCGCCG TCATTTCGGC
CTGTTCGTCG ATCAGGCCAA GATCACTAGC CACCTCACGC ATATGTTCGA GAGAACGATC
AGCGGCGACG TCGATTTTCA TCCGCACATC GATCTGACGA CCGGCCCGGG GTTTGTATTG
CAGCAACTTG TCTCGAGCCT CCATCGCGGC CTCAGCCGGA ACGGACCGCT GCAACGGTCG
CCGCTGGCCG CCGGTTCGCT CTGCGACGCG GCGATCTATC TGCTTCTGGA GACCTGCCCC
AATCGTTATT CGAACGAGCT TGCGCTGCCT GCTCCGGCGC CGGCCCCGCG CCATGTGAAA
TGGGCCATCG ACTTCATGCA GGAACACGTC GCCGAGCCGA TTTCGCTCAA CGATATCGCG
ATGGCAGCCA AGGTCAGCGT TCGGACTTTG CAACAGGGTT TCCGGCAATT CAGGGATACG
ACCCCGATGT CCTACCTGCA TGACCTTCGG ATGGCCGCCG CCCATCGCGA TTTGCTGGAA
TCCGACCGGA AGCAGGTCAT CGCCGATGTC GCGCTCAGAT GGGGGTTTTC GCATCTGGGG
CGATTTGCAG CCGAATACAG GAAGCGTTTC GGGCAACTGC CGTCACAGAC CTTGAAGCGC
TGA
 
Protein sequence
MSVNSHIHND GSLAQQLAVQ SMLFRETAHS GTDPDELSVV LSTPNSPIKV VAEGNMPIAY 
HCNFVSVGEE VTVADCTYEG TILIRREAPS DRMIVFLPMA GNASFEGMRE QIYSVPARGT
ILEAGRAAGA RLFGPRRHFG LFVDQAKITS HLTHMFERTI SGDVDFHPHI DLTTGPGFVL
QQLVSSLHRG LSRNGPLQRS PLAAGSLCDA AIYLLLETCP NRYSNELALP APAPAPRHVK
WAIDFMQEHV AEPISLNDIA MAAKVSVRTL QQGFRQFRDT TPMSYLHDLR MAAAHRDLLE
SDRKQVIADV ALRWGFSHLG RFAAEYRKRF GQLPSQTLKR