Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_3662 |
Symbol | |
ID | 8014508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 3707991 |
End bp | 3708992 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644826225 |
Product | hypothetical protein |
Protein accession | YP_002977444 |
Protein GI | 241206348 |
COG category | [S] Function unknown |
COG ID | [COG4254] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.286788 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTGCA TCGCGTTAAG CGGCGCATGG CCGGCGCTCG CCGCCGAGCC GGTTGGCCAG GCCGTCGTCA TCAAGACGCA GGTGACGGGA CAGAGCGGGC CGATCGAGGT CGACACCAGC GTCCATCGCA ACGAGCGCAT CAAGACATCA CCGTCGGGGC TTGGCCAATT CGTGTTTCGT GACGGCACGA AGCTCGCGGT GGGCTGGGGT TCGTCGGTCG TGATCGACAA ATATGTCTTC GATGATTCCC AATCGGTCAA GAAACTGACG ATCAGGGCAG CAAAGGGCAC ATTCCGCTGG GTCAGCGGCA ATTCCAACTC CTCGGCCTAC CAGATCCTGA CGCCGGCCGG CACGATCGGC GTGCGCGGCA CCGCTTTCGA TTTCTACGTC GGCCCGGATG GCACGACCGC CGTCGTGCTG TTGAATGGCG CTGCCCGTTT CTGCGGCCCG GGCGGCTGCC GGCAATTGCA GCAGCGCTGC GATTGTGTGG TGGCCAAGCC GAACGGCGAT ATGTCGGCGG CACGCCGGGT CGATCCCAGC ATCCTCGCGA CACTCGGAAA TCAGCACGCC CTGCCCTTCC TCTCCGGCAA TCAGCGGCTT GCAGGCGGCA TCGGCATGCT CGGCGGCTGC AATATGGCGT CAGCCGCGCC GGAAAGAAGG GACAGGAACC GGCCCCCGCC TCCGGCTTCG CCCGATCCAC AGAAACAAGA TCCGCCGCCG AAACAGGCCG AGCCGCAAAA GGAACGGCCA CACAAGCCCG ATAAGCCGCA TCACGACAAA CCGCACCATG ATAGGCCACA TCACGACAGG CCGGATAGGC CCGACAAGCA TGATGGAAAC GACAGGCCGG GAAATCACAG CCAGAATCAC GGGAATGACC GGGATCATCG AGGACACGAC CGCGACCATG GCGGAGATCG TGATCACGAT AAGGATCGTG ATCGAGATCA CGGCAGGGAT CGCAACCACG ACAAGGGCCG GAGTTTCAAT CGGAACCGCT GA
|
Protein sequence | MLCIALSGAW PALAAEPVGQ AVVIKTQVTG QSGPIEVDTS VHRNERIKTS PSGLGQFVFR DGTKLAVGWG SSVVIDKYVF DDSQSVKKLT IRAAKGTFRW VSGNSNSSAY QILTPAGTIG VRGTAFDFYV GPDGTTAVVL LNGAARFCGP GGCRQLQQRC DCVVAKPNGD MSAARRVDPS ILATLGNQHA LPFLSGNQRL AGGIGMLGGC NMASAAPERR DRNRPPPPAS PDPQKQDPPP KQAEPQKERP HKPDKPHHDK PHHDRPHHDR PDRPDKHDGN DRPGNHSQNH GNDRDHRGHD RDHGGDRDHD KDRDRDHGRD RNHDKGRSFN RNR
|
| |