Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg2_6054 |
Symbol | |
ID | 6977440 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM2304 |
Kingdom | Bacteria |
Replicon accession | NC_011366 |
Strand | + |
Start bp | 488053 |
End bp | 489237 |
Gene Length | 1185 bp |
Protein Length | 394 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 643393506 |
Product | proteinase inhibitor I4 serpin |
Protein accession | YP_002278324 |
Protein GI | 209546434 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG4826] Serine protease inhibitor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.985195 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAAAT CCACTCTGCT GGCGGGTCTT GCCGCTTCGC TGATGACGCT TGCCCCAGCC CATGCCGACA ATTCAGGCGA CGGCAAGGCG ATGCTCGCCG CTCAGGCGGG GCTTGCCGCC GAGCTCATCG ACCGCACGCT GGCAAGGGAG GGGGCTGCCA ACATCATGGT GTCGCCGGCA AGCCTTGCCG CAGCCCTCGG CCTTGCCAGC CTCGGCGCCT CCGCCGAAGG CAAGGCCGCG ATCGCCAAGG GCCTCGGCTT CGGCAGCGAG GTGAAGGGAC CGGAGACGGT GCTTGCCGCC ATGACACCGG AGAAGCCGGC AGCAGCGGAT GCGCCTCTGG CGACGGCGGT TGCCATCGTT TTCGACGACA AGCTGGTGCT CTCCCCCGAC GCGCTGTCCA TGCTCGCCAC CCACCGGATC AAACCGTCGA TCGAGGATCT CGACGGACCG GCATCGGTCG AACACATCAA CGGCTGGGTC AAGGAGACGA CGCGGGGCGC CATTCCCGTC ATGCTCGACG CGCCGCCCGG CGGCGGCTTC GTCAGCCTTG GCGCGCTGTC TTTCAAGGCG CGCTGGAAGA CCCCCTTCGA GAAAGAAAGC CCGGCAAGCC CCTTTCAGCG GCCTGATGGT TCGACGATTT CGGTGCCGAT GATGCATCTC ACAGGCGATG GGCAGAAATT CCGCTTCGAC GAGAAATTCA CCGCCGTCGA TCTTGCCTAT GCCGGCGAAA GCTACAGCAT GGTCGTGGTG GCGGCGCGCT CGGGCAAGGG TGTCGGCGGC GCCGACCTGA AGGCGCTCAC TTCCTGGCTG CAGGGGGAAA AATTCGAACC TGCCAAGGGC GAAATCTTCC TGCCCCGCTT TTCCCTGAGC GACGGGCGTG ATTTGATGCC GGTACTCGAT CAGATGGGCC TGGCGGCCGG GAAGGCCAAC GATGCAGCTT TCCCGGGTTT CACCAAGGAA AACATTCGCT TGTCGCGCGT TCTTCAGAAG ACGATGATCA AGCTCGACGA AAACGGCACG GAGGCAGCGG CAGCCACGGC AGCGATCACC GAACGCAGCA TCGATCCCAA GCTCGTTCGC GTCGTCGCCG ATGCCCGTTT TGCTTTTGCG CTTCGCGATA CAAAGAGCGG CCTGCTGCTC GCCGCAGGCC TGATCGGCGA TCCGCTTCTA GAACAGGATG ATTGA
|
Protein sequence | MPKSTLLAGL AASLMTLAPA HADNSGDGKA MLAAQAGLAA ELIDRTLARE GAANIMVSPA SLAAALGLAS LGASAEGKAA IAKGLGFGSE VKGPETVLAA MTPEKPAAAD APLATAVAIV FDDKLVLSPD ALSMLATHRI KPSIEDLDGP ASVEHINGWV KETTRGAIPV MLDAPPGGGF VSLGALSFKA RWKTPFEKES PASPFQRPDG STISVPMMHL TGDGQKFRFD EKFTAVDLAY AGESYSMVVV AARSGKGVGG ADLKALTSWL QGEKFEPAKG EIFLPRFSLS DGRDLMPVLD QMGLAAGKAN DAAFPGFTKE NIRLSRVLQK TMIKLDENGT EAAAATAAIT ERSIDPKLVR VVADARFAFA LRDTKSGLLL AAGLIGDPLL EQDD
|
| |