Gene Rleg2_3786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_3786 
Symbol 
ID6982549 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp3913568 
End bp3914980 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content65% 
IMG OID643398508 
Productprotein of unknown function DUF1338 
Protein accessionYP_002283274 
Protein GI209551357 
COG category[S] Function unknown 
COG ID[COG5383] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCACG CCTTCGTTTC AACTGACCGT ATCCGCTCGC TCTTCACCGA AGCGATGTCG 
CAGATGTATC GGGCGGAGGT GCCGCAATAT GGCACGCTGA TCGAACTGGT GGCGGATGTG
AATGCCGGCT GCCTCAAAAA TAATCCCGAT CTGCGCGAAC GGCTTGCCGG CGCCGGCGAA
CTGGAGCGCA TCGATGTCGA GCGCCACGGC GCCATCCGGC TCGGTACGGC GGAAGAGCTT
TTCACCATCC GCCGGCTGTT TGCGGTCATG GGCATGCAGT CGGTCGGCTA TTACGATCTC
TCGGTCGCGG GCGTGCCTGT TCATTCCACC TGTTTTCGGC CGATCGACGA GGCCGCACTC
AACATCAATC CGTTCCGCGT CTTCACCTCG CTGCTGCGAT TGGAGCTGAT CGAGGACGAA
GGGCTGCGCG GCGAAGCCGA AGCCATTCTG GCAAAGCGGC GCATCTATAC GCCGCGCGCC
GTCGCGCTGA TCGAGCGCCA CGAGCAGAAT GGCGGCCTGA CGGAGGCGGA GGTGACGGAG
TTCGTCGCTG AGTCGCTTGA GACCTTCCGC TGGCATGGCG AGGCGACGGT CAGCGCCGAA
ACCTACAAGC GCCTGCATGA TGCGCACCGG CTGATCGCCG ACGTCGTCAG CTTCAAGGGG
CCGCATATCA ACCATCTGAC GCCGCGCACG CTCGATATCG ACGCGGTCCA GGCCCGCATG
CCGGAACGCG GCATTACGCC GAAGGCCGTC ATCGAAGGCC CGCCGCGCCG CCATTGCGAT
ATCCTGCTGC GGCAGACGAG CTTCAAGGCG CTTGAAGAAA CGATCGTCTT TGCCGGTGAC
GCGGACGCGG TTCAAGGAAC GCATACCGCC CGTTTCGGCG AGATCGAACA GCGCGGCGTG
GCGCTGACGG CCAAGGGCCG GGCGCTCTAT GACCGGCTGC TTGCCTCGGT TCGCGGCGAA
GTGCAGGTCG GCGCCGGCGG CGCCAAGGCC GGCGCCTATG ACCAGGAACT CGCCGAGCGC
TTCAAGGCGC TGCCGGACAG CTGGGACGAG CTGCGCAGGC AAGGTCTCGC CTTCTTCCGC
TATTGCGCGA CGCCTGCGGG TATTGCCGCG GCCGTCGGCG GCACGCTACC CAAGGATCCG
GAAGCGCTGA TCGCCAAGGG TTACCTTGCC TTCTCGCCGA TCGTCTACGA AGACTTCCTG
CCAGTCAGCG CCGCCGGCAT CTTCCAATCG AACCTCGGCA CCGACCAGCA GCAGAATTAT
GCGACGCATT CGAACCGCGA TGCCTTCGAG GCGGCGCTCG GCGCCACCGT TCAGGACGAG
CTGGCGCTTT ATGCCGAGCG CCAGGCTGCC TCGCTGGATG CGGCGATGGA AGCGCTGGGC
CTTGCGGGTC TGCAGCTGAA GACCGTCGCG TAA
 
Protein sequence
MPHAFVSTDR IRSLFTEAMS QMYRAEVPQY GTLIELVADV NAGCLKNNPD LRERLAGAGE 
LERIDVERHG AIRLGTAEEL FTIRRLFAVM GMQSVGYYDL SVAGVPVHST CFRPIDEAAL
NINPFRVFTS LLRLELIEDE GLRGEAEAIL AKRRIYTPRA VALIERHEQN GGLTEAEVTE
FVAESLETFR WHGEATVSAE TYKRLHDAHR LIADVVSFKG PHINHLTPRT LDIDAVQARM
PERGITPKAV IEGPPRRHCD ILLRQTSFKA LEETIVFAGD ADAVQGTHTA RFGEIEQRGV
ALTAKGRALY DRLLASVRGE VQVGAGGAKA GAYDQELAER FKALPDSWDE LRRQGLAFFR
YCATPAGIAA AVGGTLPKDP EALIAKGYLA FSPIVYEDFL PVSAAGIFQS NLGTDQQQNY
ATHSNRDAFE AALGATVQDE LALYAERQAA SLDAAMEALG LAGLQLKTVA