Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_2814 |
Symbol | |
ID | 6981558 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011369 |
Strand | + |
Start bp | 2861154 |
End bp | 2862170 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 643397526 |
Product | Extensin family protein |
Protein accession | YP_002282310 |
Protein GI | 209550393 |
COG category | [S] Function unknown |
COG ID | [COG3921] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.52695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.0111316 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGTTACC CCCACGCAAT CGACATGACG GAAACGGATG CGTTGAGAAA ACTTTCGATA CTGGCGATCC TTCTCGCCGC AGCCCCCTTT CTTGCCGGTG CCCGCCTGCC CCCGCACGGT CCCCTACCCC AGCCTCGACC GGAAGCCGCC GATACGACCA GCCCTACCCC ACTTCCCGAT AAGGCTGAAC CACCGTCACC GGATGATGTG CCGGCACCTC AGCCAAAGCC TGATATGAAG GAGCCGGAGG CACCGGTACC AGACCAGCCA TCGGGGCCGC CCACCGAACC GAAAAAACCC GAATCGGCAA AACCCGAACC GGCAAAGCCC TCACCGGCGG AGCCGATGCA GGGCCCGCCG CTCCCGCCGG GCCAAGGCCC GCAGACGCCG GAGGAAGACA ACAAGCCACC CGCCGAGCAG ACGCTCGAAG AGCAGCATCT GACGATCGAA CCGGAAAGCG ATGCCGAGCA CGCCGAATGC ACCGCTGCGC TCAAGGCCTT GGGCGTCGTC TTCAAGGATG TCCCGCGCAT CGATGACGGC AACGGCTGCG GTATCGACAA GCCGATCACC GTTTCCGAGG CCCTGCCCGG CATCACGCTG AAGCCGGAGG CTACGGTCCG CTGCCCGGCC GCACTCGCCC TTGCCCGCTG GATGAAGGAG AGCGTTATTC CGGCAGCCTC TGCCGCCCTG CCGGAGCAGG GCCGCCTCAC GACGGTCAAC CAGGCAACCT CCTATATGTG CCGCCTTCGC AACAGCGCAG GCACGGGCAA GATCTCCGAA CATGCCCGCG GCAACGCCAT CGACATTGCA AGCTTCCATT TCGAGAAAGG CGAAGATGTC GCCGTCCGCT CCCGCCGCGA AGACCCGACG TTGACCGGCG CCTTCCAACG CACCGTGAGC GCCGCCGGCT GCCTCTATTT CACCACCGTC CTTGACCCCG AAAGCGACGC CGCCCACGAA ACCCATTTCC ACCTCGACGT GATCGAGAGG AAAGGCGGCT ATCGCTACTG CCACTGA
|
Protein sequence | MRYPHAIDMT ETDALRKLSI LAILLAAAPF LAGARLPPHG PLPQPRPEAA DTTSPTPLPD KAEPPSPDDV PAPQPKPDMK EPEAPVPDQP SGPPTEPKKP ESAKPEPAKP SPAEPMQGPP LPPGQGPQTP EEDNKPPAEQ TLEEQHLTIE PESDAEHAEC TAALKALGVV FKDVPRIDDG NGCGIDKPIT VSEALPGITL KPEATVRCPA ALALARWMKE SVIPAASAAL PEQGRLTTVN QATSYMCRLR NSAGTGKISE HARGNAIDIA SFHFEKGEDV AVRSRREDPT LTGAFQRTVS AAGCLYFTTV LDPESDAAHE THFHLDVIER KGGYRYCH
|
| |