Gene Rleg_4022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4022 
Symbol 
ID8014828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4098741 
End bp4099991 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content65% 
IMG OID644826591 
Producthypothetical protein 
Protein accessionYP_002977802 
Protein GI241206706 
COG category[S] Function unknown 
COG ID[COG4223] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.483809 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.844233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTATCGG GAAACCCGCC ACGCCATTCG AAGAGCGCCG ACGAACCGGT CACGATCGAC 
CTCGATGCAC AGGAATTCGC CGCTGCGGCC GATACCGAAA AACCGGTGAA CAATGAAACT
GCCGACGCCG ACAGCACCGC TGCCGCCGAT GTCGGCCTGC CGCCCGAAAC CGAGACTGCG
TCGCATGCCG AATATGAAGA GAAGCCTGTG ATGGAGGCCC CGGAGGAGGA ACCGGCAGCC
CCAGAACCGT CCTTTACCCC TCCTCCCGAA CAGCCTGAGC CAAAGAGCGC CGGCACCTCC
GGTCTCATTG CTGCGGGCAT CTTCGGCGGC CTCGTGGCGT TGCTTGGCGC CGGCGCCATC
CAGTATGCCG GTTACCTCCC AGGCTCCTCC GCACCGCAGA CGACCTCGCC GGAGACGGCC
AATCTTGCCG GTGAGATCGA CGGCCTGAAG CAGTCCGTCG CCAACCTTGC CGCCAATCCG
GCGAGCACAG ATAACGGCGA GCTTGCGAAA CGCGTCGCTG CGCTGGAAAC GGCTGCAAAA
GCTCCCGCAG CCGGCGCACC GGCCGATTCG GCAAATGTCG AGGCACTCAA CCAGAAGATT
GCGGAGCTGA CCGGTCAGGT CGACCAACTG CGCTCTACGC TTACCCAGTC ATCCGAGCAG
CAGACGACGA ACGGCGCCGA TATCGCCAAG CGCCTCGAAG AGGCCGAAAA GAAGCTGAAC
GAGCCGCGCG AGGACGTCGC CGTTGCCCGG GCTATCGCGG CTGCCGCCCT GAAGGCGGCG
ATCGATCACG GTGGCCCGTT CCTGGCCGAA CTCGACACTT TCGCCGGTGT CGCACCCGAC
GATCCAGCCG TCGCCGACCT TAGAGCCTTT GCCGAAACCG GCATTCCCTC ACGCACCGAG
CTGGTGGGCG AGGTTCCCGA TGTCGCCACC GCGATCGTCG AAGCCGTCAA CCAGCCGGAT
CCGAATCAAA GCTGGTCGGA CCGGCTGATG TCGAGTGCCA AGTCGCTGGT GAGCGTCCGT
CCCGTCGGCA ATATCGAGGG TGAAAGCGTC GAAGCCATCG CCGCCCGCAT GGAGGAGAAG
GTGAAGAACG GCGACCTGCC CGGCGCTTCC GCCGAATGGA ACAACCTGCC GGCTCTCGGC
AAGCAGGCCT CCGCCGCCTT CAAGCAAACG CTCGAAGCGC GCATCCGCGT CGAGGAACTG
GTCGGCGGGG CGCTGTCGAA AGCGGTCTCC GGCACCGGCA AGGAGGGATG A
 
Protein sequence
MVSGNPPRHS KSADEPVTID LDAQEFAAAA DTEKPVNNET ADADSTAAAD VGLPPETETA 
SHAEYEEKPV MEAPEEEPAA PEPSFTPPPE QPEPKSAGTS GLIAAGIFGG LVALLGAGAI
QYAGYLPGSS APQTTSPETA NLAGEIDGLK QSVANLAANP ASTDNGELAK RVAALETAAK
APAAGAPADS ANVEALNQKI AELTGQVDQL RSTLTQSSEQ QTTNGADIAK RLEEAEKKLN
EPREDVAVAR AIAAAALKAA IDHGGPFLAE LDTFAGVAPD DPAVADLRAF AETGIPSRTE
LVGEVPDVAT AIVEAVNQPD PNQSWSDRLM SSAKSLVSVR PVGNIEGESV EAIAARMEEK
VKNGDLPGAS AEWNNLPALG KQASAAFKQT LEARIRVEEL VGGALSKAVS GTGKEG