Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_5043 |
Symbol | |
ID | 6978137 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011368 |
Strand | - |
Start bp | 690035 |
End bp | 691126 |
Gene Length | 1092 bp |
Protein Length | 363 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 643394186 |
Product | nitrogenase cofactor biosynthesis protein NifB |
Protein accession | YP_002279004 |
Protein GI | 209547086 |
COG category | [R] General function prediction only |
COG ID | [COG0535] Predicted Fe-S oxidoreductases |
TIGRFAM ID | [TIGR01290] nitrogenase cofactor biosynthesis protein NifB |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0411446 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGAAGG TTGCCAGAAT AGCGCCTGCC GCTGACAAGT CAGAAGCGAC AAATTTTGGT GGCTACGGAT CTTTGTCCGA TGGCAGTTCA GTGTCAACAG CTATAGATCC AGCGACCTAT GAAAATATCA AAGACCACCC GTGCTTCTCG CGGGACGCTC ATCGGAATTA TGCGCGATTG CATCTAGCGG TAGCGCCCGC CTGCAACATC CAGTGCAACT ACTGCAATCG AAAATATGAC TGTGCGAATG AAAGCAGGCC AGGGGTAGCA TCGCATCGGC TAACTCCCGA TCAGGCTCTG CGCAAGACCA TGGCTGTCGC CAGCGAGGTC CCGCAGCTTT CTGTAGTCGG CATCGCCGGG CCAGGCGATG CTTGCTACGA TTGGAATAAG ACAAAAGCAA CACTCATACC TATCGCTCGG GAAATTCCGG ACATCAAGCT GTGCGTCTCG ACCAATGGCC TCGCACTACC TGACCGTGTC GAGGAGCTCG TCGACATGAA TGTCGGACAC GTCACGATCA CCGTCAATAT GGTAGATCCG AATATCGGAA CGAAGATCTA TCCGTGGATA TTCTATCAGG GCCGCCACTA TAACGGCATC GAGGCCGCGA AGATCCTTCA TGAGAGGCAA ATGCTCGGGC TGGAAATGCT TACAGAGCGC GGAATCCTTA CGAAAATCAA TTCGGTAGTG ATCCCGGGGG TGAACGACGA GCATCTCATC GAAGTTAATA AGTGGGTCAA GGACAGAGGC GCATTGGTGC ACAACATCAT GCCCCTGATT TCAAAGGCGT CTCACGGTAC CTTCTATGGT TTGAACGGTC AGCGTAGCGC TGCGCCTTTT GAACTAACCG CGCTTCGGGA CCGTCTCGAA GGCACCACGG AGGTTATGCG CCATTGCCGC CATTGCCGCT CCGATGCAAT TGGACTGCTA AGTTATGATC GCGCGCGCGA GTTCACGCTT GCCCAGCTGC CAGCCGAGCC AACCTACGAC GAGGAAAAAC GGCGCGCTTT TCGTCAGTTG ATCGAGCGCG AGCAAGGCAG TCAAATATTG CAAGCAGGAG ATGCGATCAC AGCAGGTTTC AGCGCGGCCT GA
|
Protein sequence | MRKVARIAPA ADKSEATNFG GYGSLSDGSS VSTAIDPATY ENIKDHPCFS RDAHRNYARL HLAVAPACNI QCNYCNRKYD CANESRPGVA SHRLTPDQAL RKTMAVASEV PQLSVVGIAG PGDACYDWNK TKATLIPIAR EIPDIKLCVS TNGLALPDRV EELVDMNVGH VTITVNMVDP NIGTKIYPWI FYQGRHYNGI EAAKILHERQ MLGLEMLTER GILTKINSVV IPGVNDEHLI EVNKWVKDRG ALVHNIMPLI SKASHGTFYG LNGQRSAAPF ELTALRDRLE GTTEVMRHCR HCRSDAIGLL SYDRAREFTL AQLPAEPTYD EEKRRAFRQL IEREQGSQIL QAGDAITAGF SAA
|
| |