Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_1864 |
Symbol | |
ID | 6980602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 1911431 |
End bp | 1912420 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 643396586 |
Product | protein of unknown function DUF1624 |
Protein accession | YP_002281375 |
Protein GI | 209549458 |
COG category | [S] Function unknown |
COG ID | [COG3503] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.161953 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 0.0500445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTGC CGGCATTCGA AGCCGATGCG GCGGCAAGAC CGCCGCGCAT CGGCCTGTTG GACACGGCGC GCGGCGCCGC GCTGATCGCC ATGGCGAGCT ACCATTTCAG CTGGGACATG GAGTTCATGG GTTACCTCGC GCCGGGCACG GCCGAGACAG GCTGGCTGAA GATCTATGCC CGGGCGATCG CCACGACCTT TCTCTTCATC GTCGGCATCA GCCTGGTGCT TTCGAGCAAG CCGGAGGTCC GCTGGCCCTC CTTCTGGAAG CGTTTCGGCA TGATCGCTGC GGCCGCCGCC GTCATCTCGA TCGCCACGCG CATCGCCATG CCGGAGACGT GGATCTATTT CGGCATCCTG CATTGTATCG CGGTGCTGAC GCTGATCGGT ATCGTTGTCA TCAGGCTGCC GCTCGCCGCC ACGCTGATCG TGACAACAGC GCTCCTTACC GCCTGGCTCA TCGATAATTT CGGCACGCCC GGCCTGCTGC GCTCGTCCTT CTTCGATCCG AAATATCTCG CCTGGATCGG CCTTGCCGTC ATGCCGGATC GGTCGAACGA CTACGTGCCA TTGTTTCCCT GGGCAACGCC GTTCTTCGCT GGATTGAGCA TCGCCTCGAT CGCCATCAGA ACCAGGCTGC CGCATCGGCT TGCCGCCCTT GGCACCGGCT CCTGGTGGCC GGCGCGGCTT GGCCGCCACA GCCTCGCCTT CTACCTCATC CATCAGCCGG TGCTGATCGC CATCGCCTAC GGGCTTTCCC TTCTCGTCCC GCCACCGCAG CCGGATCCGG CCGAAACTTA CCTGAAGCAA TGCAACTCCG CTTGCGTCAT GCAACAGGGC GAAGCGCTCT GCCAAAGCTT CTGCCAATGC ACGCTGAGCA AATTGCAGGC CCAAAGCCTG TTGACTCAGC TGCAGGCAAA CCAGATCGAC GTACAAAACG ATGAACGGGT CCAGACGATC GCCGGCGAAT GCAGCGCCGA GGTGGAATAA
|
Protein sequence | MTLPAFEADA AARPPRIGLL DTARGAALIA MASYHFSWDM EFMGYLAPGT AETGWLKIYA RAIATTFLFI VGISLVLSSK PEVRWPSFWK RFGMIAAAAA VISIATRIAM PETWIYFGIL HCIAVLTLIG IVVIRLPLAA TLIVTTALLT AWLIDNFGTP GLLRSSFFDP KYLAWIGLAV MPDRSNDYVP LFPWATPFFA GLSIASIAIR TRLPHRLAAL GTGSWWPARL GRHSLAFYLI HQPVLIAIAY GLSLLVPPPQ PDPAETYLKQ CNSACVMQQG EALCQSFCQC TLSKLQAQSL LTQLQANQID VQNDERVQTI AGECSAEVE
|
| |