Gene Rleg_1247 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_1247 
Symbol 
ID8015541 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp1224840 
End bp1226198 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content62% 
IMG OID644823828 
Producthypothetical protein 
Protein accessionYP_002975078 
Protein GI241203982 
COG category[S] Function unknown 
COG ID[COG4222] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.171773 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.0628011 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAACG TCCTTTTTGC TTCCGTATCG CTTTTCATCC TTGTCGCGGG CGCCGCTTCG 
GCCGACCAGC AGCAGTTCCC GGCCAAACTC GCCGGCCAGG CGATCCTGCC CGCCAACACC
ATGGTTCCGG CACCGGCCGA TGCCCCCGAA TTCCTCAAGC ATTCCGGCAA GTTTACGACG
CCGGACCGTA AGCGCGCCGA AGCGCTCGGC ACCGCTCCCG GCAAGGACGG CGCCCGCATC
ATCGATCTGA AGCTTCCCTT CGACGGTCAG CCGATCCAGG GTTTCTCAGG GGTCAAGACG
ATGGCCGACG GCACCTTCTG GACGCTCTCC GACAACGGCT TCGGCTCGAA GTCCAACTCG
TCTGACTCCA TGCTCTTCCT GCACCAGATG AAGTTCGACT GGGCCGGCAA CAAGGCTGAA
GTCGTCAAGA ACCTCTTCCT TTCCGACCCC AACAAGATTG CACCGTTCCC GATCGTGCTT
GAAGGCACCG ACACGCGTTA TCTCACCGGC GCCGACTTTG ACATCGAATC GATCCAGCCG
GTTGTAGACG GCTTCTGGCT CGGCGACGAA TTCGGTCCCT ACATCCTGAA GTTCGACACG
TCAGGCCGCC TCACCGACGT CATCCCGACG ACGCTCGACG GCAAGCCGGT GCTTTCGCCC
GACAATCCAC TTCTCTCGGT TCCGGCCAAC CCGGCCGCCA AGATGCCGGT CTTCAATCTG
AAGCGCTCCG GCGGCTTCGA GGGCCTCGCC ATGTCCAAGG ACGGCGCCAA GCTCTACGGC
CTGCTCGAAG GCGCCATCTA CAAGGATGAC CGCACGGTAG AAACCATCGA CGGCCACACC
GCCATCCGCG TCATCGAGTT CGATGTCGCG TCCAAGAAGT GGACCGGCCG CAGCTGGCTC
TATCCGTTCG AGGACAAGGG GGTATCGATC GGCGACTTCA ACGTGCTCGA CGACACCACC
GCTCTCGTCA TCGAGCGCGA CAACGGCGCC GGCACGACGG ACAGGGCCTG CGCCGACCCG
AAGCAGCCGA AGCCGGATTG TTTCGAAGCT CCGGCCGTGC TGAAGCGCGT CTACAAGATC
GAGTTCAACG ACGCCAATGT CGGCAAGGCG GTCCGCAAGA TCGGCTATAT CGACCTCCTG
AACATTCAGG ACCCCGACAA CAAGAAGAAG GCCGGCAGCA AGGACGGCGT CTACGACATG
CCGTTCGTGA CGATCGAAAA CGTCGATCGC GTCGACGCCA CGCACATCAT CATCGGCAAC
GACAACAACC TGCCCTTCTC GGCCGGCCGC GCCGTCGACA AGGCCGACAA TAACGAGTTC
AGCCTGCTTG AGGTTGGCGA GTTTTTGAAC GCGAAGTAG
 
Protein sequence
MKNVLFASVS LFILVAGAAS ADQQQFPAKL AGQAILPANT MVPAPADAPE FLKHSGKFTT 
PDRKRAEALG TAPGKDGARI IDLKLPFDGQ PIQGFSGVKT MADGTFWTLS DNGFGSKSNS
SDSMLFLHQM KFDWAGNKAE VVKNLFLSDP NKIAPFPIVL EGTDTRYLTG ADFDIESIQP
VVDGFWLGDE FGPYILKFDT SGRLTDVIPT TLDGKPVLSP DNPLLSVPAN PAAKMPVFNL
KRSGGFEGLA MSKDGAKLYG LLEGAIYKDD RTVETIDGHT AIRVIEFDVA SKKWTGRSWL
YPFEDKGVSI GDFNVLDDTT ALVIERDNGA GTTDRACADP KQPKPDCFEA PAVLKRVYKI
EFNDANVGKA VRKIGYIDLL NIQDPDNKKK AGSKDGVYDM PFVTIENVDR VDATHIIIGN
DNNLPFSAGR AVDKADNNEF SLLEVGEFLN AK