Gene Rleg_2036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2036 
Symbol 
ID8013067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2029783 
End bp2030922 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content62% 
IMG OID644824622 
Producthypothetical protein 
Protein accessionYP_002975853 
Protein GI241204757 
COG category[S] Function unknown 
COG ID[COG4222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.00898362 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCAAGC GTTTTCTCGC GGCTGCCGCT CTCGCCCTTC TGTCAAGCAC AGTCTCCGCC 
AGCGCCACAG ATATCGGCGC CACCTTCGCG ACCGCCTGCC CCTTCGGCGA TTGCGCCGCT
GGCATCTCGC TCTCCTATCT CGGTGAATTC GTCATCCCCA CAGGCCATAT CGAAAACGGC
GTCGAATTCG GCGGCATTTC TGGCCTCGAT TTCGATGTCG CCACCGGCCA TTATATCGCC
ATCAGCGACG ACCGCTCGGA AAGAGGCCCG GCCCGCTTCT ATGAACTCAA CGTCGATGTC
GACGCGTCGG GCCTTAAGCG CGTTTCGGTC GTCAAGCAGG TGACGCTGAA AGACAAGAAC
GGCGAGCTCT TCGTTGCCCG GACCGTCGAT CCAGAATCGA TCCGCCTTGG CAAGGATGGC
ATCTATTGGG GCAGTGAGGG CGACGGCAAG GCGCTGCTGG CGCCCTTCAT CCGCGTCGCA
TCGCCGGACG GTTCCTTCGT CCGCGAATTC AAGCTGCCGG AGGGCTTTGC ACCGACCGCA
GACAAGTCAA CAGGCATCCG CGACAACCTC GCTTTCGAGG ATCTCGCGGT CGCGCCCTCC
GGCGATGTTT TCGTCGGTGT CGAAGCGGCC CTTTACCAGG ACGGTCCGAA CCCCTCGCTG
ACGTCGGGCA GCCTGTCGCG CATCGTCCGC TACGACGGCG CCACCGGCGC GCCGAAAGCC
GAGTACGTCT ATCCCGTCTC GCCGATCCCG CAGGCCGCCA CCAAGGCCGA CGGCGGTAAT
GACAACGGCA TGTCTGAAAT GCTTGCCCTC GACGATCACC GCCTGCTCGC CGTCGAGCGG
AGTTATGCCC AGGGCTTCGG CAACAGCATC GAGATCATGA TGATGGATCT GACTGATGCC
ACCGATGTAT CCGCCATCGC GTCCCTCGCC AAAAACGACC AGCGCGTCGT CCCTGTCCGC
AAGAGCCAGG TCCTCGATTT GAGGGCGATC GGCCTCGTTC CCGACAATAT CGAGGCCATG
TCGCTCGGCA AGGCCAAGGA CGGCACCGAT CTTCTCATTC TCGGCTCCGA CAATAATTTT
TCGACCAGCC AGAAGACGCA ATTCTATGCC TTCAAGGTTC TCAACCGCCC GCAGCAGTAA
 
Protein sequence
MTKRFLAAAA LALLSSTVSA SATDIGATFA TACPFGDCAA GISLSYLGEF VIPTGHIENG 
VEFGGISGLD FDVATGHYIA ISDDRSERGP ARFYELNVDV DASGLKRVSV VKQVTLKDKN
GELFVARTVD PESIRLGKDG IYWGSEGDGK ALLAPFIRVA SPDGSFVREF KLPEGFAPTA
DKSTGIRDNL AFEDLAVAPS GDVFVGVEAA LYQDGPNPSL TSGSLSRIVR YDGATGAPKA
EYVYPVSPIP QAATKADGGN DNGMSEMLAL DDHRLLAVER SYAQGFGNSI EIMMMDLTDA
TDVSAIASLA KNDQRVVPVR KSQVLDLRAI GLVPDNIEAM SLGKAKDGTD LLILGSDNNF
STSQKTQFYA FKVLNRPQQ