Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2400 |
Symbol | |
ID | 8013385 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2401842 |
End bp | 2403257 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644824981 |
Product | protein of unknown function DUF900 hydrolase family protein |
Protein accession | YP_002976211 |
Protein GI | 241205115 |
COG category | [S] Function unknown |
COG ID | [COG4782] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.506221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0050611 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGAGCACAG CAAAATGTCT TGACGGCCGA ATTCAACCAA GCCCTTCCCT AGCGCAATTC CAGCAAAAGT GCGCAGCGGT TTTGCGTCCG GAATTGCGTG AAAACAAAAA GACAGAGCAT TTCCGTGGCT CGGAGAAACT AGGTAATGCT CTGGGTTTGA AATGGATATC GGCTCTGCTG CTGCTCTGCG CATTGGCCGG ATGCGGCGGC CACCCGAAGA ACGTGCTCAT CCCGGTTGCC GATAGCGCGC CGAACGCCAC CAAGGTCGCC ATGCTCGTGA CCACGACGCG CAGCCGCTCC ACGATCCAAG GCGAAATGTT CACGGGCGAA CGCGCACTCG CCCCGGCCTT TGCCGACATC ACCGTATCCA TCCCGCCTGC GAACGTCCGC AAGATCGGCG AAGTCGCCTG GCCGAAAAGG CTTCCATCCA ATCCGGCAAC GGACTTCGCA ACCTTGAAAG CCGAGGAGAT CACGCGCGAC GACGCCAAGA AATGGCTGAG CGCCTCTGTC AGGAAGAGCC GCGACCGCAG CGTGCTGGTG TTCATCCATG GCTTCAACAA CCGTTTCGAA GATTCCGTCT ATCGCTTCGC TCAGATCGTC AAGGATTCTG GCGTCCACAG TGCGCCCGTG CTGGTGACGT GGCCGTCGCG CGGAAGTCTG CTCGCATATG GCTATGACAG GGAGAGCACC AACTACACGC GCAACGCATT GGAGATGCTC TTCCAGTATC TCGCGAAGGA TCCAGAGGTG AAGGAAGTCT CGATCCTGGC GCATTCGATG GGGAACTGGC TGGCGCTGGA AGCGCTTCGG CAGATGGCGA TCCGCAACGG CCGCCTCCCT GCCAAGTTCA AGAATGTCAT GCTGGCCTCG CCCGACGTGG ATGTCGATGT CTTCCGCCAG CAGATCGTCG ACATGGGCAA GCAGCATCCG AATTTCACCC TGTTCGTTTC GCGAGACGAC CGCGCGCTCG CGGTTTCCCG CAGGGTATGG GGCGACGTCG CCAGACTCGG CGCCATCGAT CCCGAGCAGG CCCCTTTCAA GAAAGAGTTG GCCGACAGCC AGATCACCGT GATCGACCTC ACCAAGGTCA AGGCGGGCGA CAGGCTGAAC CATGGAAAAT TCGCGGAATC GCCCGAGGTC GTTCAGCTCA TCGGCGCTCG GATTTCCGAC GGCCAGACGC TGACCGACAG CAAGGTCGGG CTCGGGGACA AGATCCTCGC CGCGACGACG AGCACGGCGG CGGCGGCGGG CAGCGCCGCC GGCCTGATCC TTGCCGCCCC GGTCGCGGTC GTCGATGCCG ACACCAGAGA TAATTACGCC GGCCAGGTCA GCGGCCTCAC GGGTCCCGTG GGTACGCGAC CGAAGGCGTC CGAGTGCACT GCCGCGGGCC GATCGAAGGA GACATGCAGG CAATAG
|
Protein sequence | MSTAKCLDGR IQPSPSLAQF QQKCAAVLRP ELRENKKTEH FRGSEKLGNA LGLKWISALL LLCALAGCGG HPKNVLIPVA DSAPNATKVA MLVTTTRSRS TIQGEMFTGE RALAPAFADI TVSIPPANVR KIGEVAWPKR LPSNPATDFA TLKAEEITRD DAKKWLSASV RKSRDRSVLV FIHGFNNRFE DSVYRFAQIV KDSGVHSAPV LVTWPSRGSL LAYGYDREST NYTRNALEML FQYLAKDPEV KEVSILAHSM GNWLALEALR QMAIRNGRLP AKFKNVMLAS PDVDVDVFRQ QIVDMGKQHP NFTLFVSRDD RALAVSRRVW GDVARLGAID PEQAPFKKEL ADSQITVIDL TKVKAGDRLN HGKFAESPEV VQLIGARISD GQTLTDSKVG LGDKILAATT STAAAAGSAA GLILAAPVAV VDADTRDNYA GQVSGLTGPV GTRPKASECT AAGRSKETCR Q
|
| |