Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2120 |
Symbol | |
ID | 8013143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2113712 |
End bp | 2115277 |
Gene Length | 1566 bp |
Protein Length | 521 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644824706 |
Product | DEAD/DEAH box helicase domain protein |
Protein accession | YP_002975936 |
Protein GI | 241204840 |
COG category | [J] Translation, ribosomal structure and biogenesis [K] Transcription [L] Replication, recombination and repair |
COG ID | [COG0513] Superfamily II DNA and RNA helicases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.308058 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.00839171 |
Fosmid Hitchhiker | No |
Fosmid clonability | decreased coverage |
| |
Sequence |
Gene sequence | ATGGAAACAG TTTTCCCTTT GACGACATTT GCTGACCTTG GCTTAAGCCA AAAAGTGCTA TCCGCTGTTA CGGACGCGGG CTACACGATC CCCACGCCTA TTCAGGCGGG AGCCATCCCG TTTGCGCTTG AGCGCCGCGA TATTTGCGGT ATCGCGCAGA CCGGCACTGG CAAGACGGCA TCCTTCGTGC TGCCGATGCT GTCGCTTCTG GAGAAGGGCC GTGCGCGTGC GCGCATGCCC CGCACGCTGA TTCTCGAACC GACGCGCGAA CTCGCTGCCC AGGTCGCCGA GAATTTCGAG AAATACGGCA AGAACCATCG CCTTAACGTC GCCCTTCTGA TCGGCGGCGT TTCCTTCGAG GACCAGGACC GCAAGCTCGA ACGCGGCGCC GATGTACTGA TCTGCACCCC CGGCCGCCTT CTTGACCATT TCGAGCGCGG CAAGCTGCTG ATGAGCGGCG TCGAGATCCT CGTCATCGAC GAGGCCGACC GCATGCTCGA TATGGGTTTC ATCCCCGATA TCGAGCGCAT CGCCAAGATG ATCCCGTTCA CCCGCCAGAC GCTGTTCTTC TCGGCAACCA TGCCGTCGGA AATCCAGAAG CTCGCCGACC GTTTCCTGCA GAATCCCGAA CGTATCGAGG TCGCAAAGCC GGCTTCCGCG GCCGAGACCG TGACACAGCG TTTCGTCGCC TCCCACGGCA AGGATTACGA GAAGCGAGCG GTTCTGCGAG AACTCGTCCG CGCCCAGACC GAGCTCAAGA ACGCCATCGT CTTCTGCAAC CGCAAGAAGG ATGTCGCCGA TCTCTTCCGG TCGCTCGAGC GTCATGGCTT CTCGGTCGGC GCCCTGCATG GCGACATGGA CCAGCGTTCC CGCACGATGA CGCTGCAAAG CTTCCGTGAC GGCAATCTCC AGCTTCTGGT CGCTTCCGAC GTCGCCGCCC GCGGCCTCGA TATCCCTGAT GTCAGCCACG TCTTCAATTT CGACGTGCCT ATTCATTCCG AGGATTACGT CCACCGCATC GGCCGCACCG GCCGTGCCGG CCGCTCGGGT GCCGCCTTCA CGCTCGTCAC CAAGCGCGAC ACTAAGTTCG TCGATGCCAT CGAGAAGCTG ATCGGCGAAA AGGTCGAATG GCTGAGCGGT GACCTGACGT CGCTGCCGCC GCCGGCAGAA GACAGCCGGG ACAGCGAACG CCCGCGCCGC AACGGCCGCG AGCGTGGCGC CAGAGATGGT GCAGGCCGAG ATCGCGCGCC GAGAGAGAAT GGCGACAAGG ATCGTGGACG CGGACGAGGC AATCGCGCCG CCGCGAGTCA TAAATCTGAT AACGACATAC AGGATAATGG CGTCGACGTG ATTGAAGCAG CACCAGTAAA GGCCGACATC GTGAAGAACG AGCGCAAAGC AGAGCAGAAG CCGCAGAACA ACGCGCGCAA CAGTCGGCCT TACCCGGCAA ACGACGACAG CCGCGACCGC CGCCGTCATC GCGACCACGA CGATGGCCCG ACCCCGGTCG GCTTCGGCGA CGATATCCCC GCCTTCATGC TGATCGCCGG CAGCGCCAAG GTCTAA
|
Protein sequence | METVFPLTTF ADLGLSQKVL SAVTDAGYTI PTPIQAGAIP FALERRDICG IAQTGTGKTA SFVLPMLSLL EKGRARARMP RTLILEPTRE LAAQVAENFE KYGKNHRLNV ALLIGGVSFE DQDRKLERGA DVLICTPGRL LDHFERGKLL MSGVEILVID EADRMLDMGF IPDIERIAKM IPFTRQTLFF SATMPSEIQK LADRFLQNPE RIEVAKPASA AETVTQRFVA SHGKDYEKRA VLRELVRAQT ELKNAIVFCN RKKDVADLFR SLERHGFSVG ALHGDMDQRS RTMTLQSFRD GNLQLLVASD VAARGLDIPD VSHVFNFDVP IHSEDYVHRI GRTGRAGRSG AAFTLVTKRD TKFVDAIEKL IGEKVEWLSG DLTSLPPPAE DSRDSERPRR NGRERGARDG AGRDRAPREN GDKDRGRGRG NRAAASHKSD NDIQDNGVDV IEAAPVKADI VKNERKAEQK PQNNARNSRP YPANDDSRDR RRHRDHDDGP TPVGFGDDIP AFMLIAGSAK V
|
| |