Gene Rleg2_5631 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_5631 
Symbol 
ID6977022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp17584 
End bp18714 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content56% 
IMG OID643393088 
Producthypothetical protein 
Protein accessionYP_002277906 
Protein GI209546016 
COG category 
COG ID 
TIGRFAM ID[TIGR02308] RNA ligase, T4 RnlA family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.280064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGACC GCATACACCC TGCACGCGAA ATCCCGTTTC CCGACCTTAT CGCTGGCTTG 
AAGCGAGCCC AAGGGCTTGG CCATGTCCAT CGCCGTCAGA ACGCAACCGG TACTTTGCAG
CTCTACATCT ATACCCCCCG GTGCGTATAT GAGGATGGTT GGGATCAGTT TTCGCTGATC
GCTCGCGGTC TCATTGTGGA CGAGGGCGCT GGTCGGGTCG TTGCCACGCC GTTTCCGAAG
TTTTTCAATG TCGGCGAGCG GCATGGCGAA GTGCCCGATC TGCCGTTTGA GGCGTTCGAA
AAGCTCGATG GTAGTCTGAT AATCGTGTTC AATGATGCTG GCCGTTGGCA CGCAGCCACC
AAAGGCGCGT TCGACTCCGA ACAGGCCCTA TGGGCTCAAG CACGCTTGGA TGCCCACGAT
CTCTCCGGTC TGTCGCCGGA TACGACATAT CTGTTCGAGG CGGTATATCC GGAAAACAGA
ATTGTCGTGC GATATGCGGA GCCTGCCATG GTGATGTTGG CGGCCTACCA CGCTTCAGGT
CTTGAAGTAA CCTACGACGA GGTTCGAACG ACCAGCCAAG CGTTGGGATG GCGTGCGGCC
GAACGCCATG AGTTCGGGAA TATGGCGGAC ATGATGCTCC ATACTGCAAC GCTCCCACGC
GACAACGAGG GGTTTGTCGT TAGATTCACA AATGGCTTGC GCCTCAAACT CAAAGGCTCC
GAGTACCGTC GTATCCATGC GTTGATCTCA CGCTGCACGC CATTGGCAAT GTGGGAAGCA
ATGGCCGCTG GGGACGACAT GGCCGCGATT CGTCGTGATT TGCCGGAAGA GTTTTGGAGC
GATTTTGACA ACATCGTACG CCTCCTGACG AAGGAATACG CGGCGATGGA AAGGAAGGTC
GCTGCACTGG CAGCATCTGT CGCCCATCTT TCCGATAAAG AGTTGGGATT GTCGCTCAAT
TCACTGCCTG CTGACGTGGG TCCTTACGTT TTTGGCTTGC GAAAAGCAGG TGCAATCGCA
GGTAAGTCCC GAGACGCGTT GATGCGTTCC ATCAGACCCA CTGGCAACGT GTTGCCAGGT
TACCAGCCGT CATATGCCAT GGGGCGTGTG ATTGATGAGG CAACATCGTA G
 
Protein sequence
MNDRIHPARE IPFPDLIAGL KRAQGLGHVH RRQNATGTLQ LYIYTPRCVY EDGWDQFSLI 
ARGLIVDEGA GRVVATPFPK FFNVGERHGE VPDLPFEAFE KLDGSLIIVF NDAGRWHAAT
KGAFDSEQAL WAQARLDAHD LSGLSPDTTY LFEAVYPENR IVVRYAEPAM VMLAAYHASG
LEVTYDEVRT TSQALGWRAA ERHEFGNMAD MMLHTATLPR DNEGFVVRFT NGLRLKLKGS
EYRRIHALIS RCTPLAMWEA MAAGDDMAAI RRDLPEEFWS DFDNIVRLLT KEYAAMERKV
AALAASVAHL SDKELGLSLN SLPADVGPYV FGLRKAGAIA GKSRDALMRS IRPTGNVLPG
YQPSYAMGRV IDEATS