Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2180 |
Symbol | |
ID | 6980919 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | - |
Start bp | 2233959 |
End bp | 2235269 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 643396899 |
Product | Extensin family protein |
Protein accession | YP_002281687 |
Protein GI | 209549770 |
COG category | [S] Function unknown |
COG ID | [COG3921] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.915036 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 0.64676 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGTTTG TTTCCTTTCC TCGGCGATCC CTTTTGCCCC TGCTGCTCTC GGCGGCGCTG ACGACCTGTT CGATCAGCGA TGGGCTGGTG CCGCCGGCCA ATGTCGACAG CGGCACCAGG GTCAGCTCGA TCTCGCCGGC CCGGGCGCCG GCGGCGCGCA TGGCGCCCTC GGTGCGAATG ACGCCGGTGG AAAGCCAGGC CTCCTATCCC GTTTCCAATG CGCCGGTCGG CAATAGCCAG GGCTCGGTCG ATTACCTCAA TACGCCGAAC CTTGCCGGCA CCGGCCATGC GGCGCCCTCC CAGCCCGCGG CGCGCGGGGG GCGCCTGCCG ATGATCGACA GCGATGAGGC GCTGGCGGCG GGCCAGCCGA GCGGCAATTG GGGCGGCACG CAGAACCTTG CGATCCCTTC CGGTGGCGTC AATATGGATG ACGAACTTGG AGCAGAGCCG GTCGTCGGGT TGGCGCAGGA ACAACAGCAG CAGATCGCAG AGGGCAATGC GACCGAGCCC GTCGTTGACG GCATCGGCAC CGATAGCCCT ACGCAGGTGA ACCAGCCGCT CCGCCAGCCC GCACTGATGC CGCAGCCGGC AGCACAAGCG CAGATGAGCC GGGCGCCCGC CTGGAACGAC GGCAGCCCTG TCGTGGCGCC GACACGCGTT CCGGAAGAGG ACGAAAGCGA AGAGGTCGCG ATGCTGCGGC CCAACAATCC GATGATGAGC GAGCCGGCGG CACCCGTCGA TCCCAGCATC ATGCCGGCCT CCGAGCTTGC CTGCCGGCGC GAGCTGAAGC GCATGGGCGT GCTCTTCGAC GAGAAGCCGC CGATCTCGAA CGGGCCGGCC TGCCAGGTGC CCTATCCGGT ATCGCTGAAG GGGCTTTCCG GCAGTATCGG CGTCAAGCCG GCGGTAACGC TGAACTGTCA GGTGACGCTC GCCTTCGCCA AATGGGTGAA GAACGAGTTG GCGCCGTCTG CCCGCTACCG CTACTGGAGT GGCATCAGGA CGATCCAGCC GCTCGGCGGC TATTCCTGCC GCCGCATGAA CAACAGCCGG CAGAGATACA ATCCGATGTC TGAACACGCC CGCGGCAATG CCATCGACGT CGGCAAGTTC GTGCTGAAGA ACGGTCATGA GATCGACGTG CGCAAAAAGG GCCTGTTCTC GCTGCGCGAG GGCCGGTTGC TGAAGGCGGT GCGCACCGAC AGCTGCCGCT ATTTCAACAC CGTGCTCGGC CCCGGCAGCA ACCCGGAACA CTGGAACCAC TTCCACTTCG ACCTGCGCTC CCGCAAGAGC GGCAAGGTCT ATTGCGACTG A
|
Protein sequence | MAFVSFPRRS LLPLLLSAAL TTCSISDGLV PPANVDSGTR VSSISPARAP AARMAPSVRM TPVESQASYP VSNAPVGNSQ GSVDYLNTPN LAGTGHAAPS QPAARGGRLP MIDSDEALAA GQPSGNWGGT QNLAIPSGGV NMDDELGAEP VVGLAQEQQQ QIAEGNATEP VVDGIGTDSP TQVNQPLRQP ALMPQPAAQA QMSRAPAWND GSPVVAPTRV PEEDESEEVA MLRPNNPMMS EPAAPVDPSI MPASELACRR ELKRMGVLFD EKPPISNGPA CQVPYPVSLK GLSGSIGVKP AVTLNCQVTL AFAKWVKNEL APSARYRYWS GIRTIQPLGG YSCRRMNNSR QRYNPMSEHA RGNAIDVGKF VLKNGHEIDV RKKGLFSLRE GRLLKAVRTD SCRYFNTVLG PGSNPEHWNH FHFDLRSRKS GKVYCD
|
| |