Gene Rleg2_6054 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6054 
Symbol 
ID6977440 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp488053 
End bp489237 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content64% 
IMG OID643393506 
Productproteinase inhibitor I4 serpin 
Protein accessionYP_002278324 
Protein GI209546434 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4826] Serine protease inhibitor 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.985195 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAAT CCACTCTGCT GGCGGGTCTT GCCGCTTCGC TGATGACGCT TGCCCCAGCC 
CATGCCGACA ATTCAGGCGA CGGCAAGGCG ATGCTCGCCG CTCAGGCGGG GCTTGCCGCC
GAGCTCATCG ACCGCACGCT GGCAAGGGAG GGGGCTGCCA ACATCATGGT GTCGCCGGCA
AGCCTTGCCG CAGCCCTCGG CCTTGCCAGC CTCGGCGCCT CCGCCGAAGG CAAGGCCGCG
ATCGCCAAGG GCCTCGGCTT CGGCAGCGAG GTGAAGGGAC CGGAGACGGT GCTTGCCGCC
ATGACACCGG AGAAGCCGGC AGCAGCGGAT GCGCCTCTGG CGACGGCGGT TGCCATCGTT
TTCGACGACA AGCTGGTGCT CTCCCCCGAC GCGCTGTCCA TGCTCGCCAC CCACCGGATC
AAACCGTCGA TCGAGGATCT CGACGGACCG GCATCGGTCG AACACATCAA CGGCTGGGTC
AAGGAGACGA CGCGGGGCGC CATTCCCGTC ATGCTCGACG CGCCGCCCGG CGGCGGCTTC
GTCAGCCTTG GCGCGCTGTC TTTCAAGGCG CGCTGGAAGA CCCCCTTCGA GAAAGAAAGC
CCGGCAAGCC CCTTTCAGCG GCCTGATGGT TCGACGATTT CGGTGCCGAT GATGCATCTC
ACAGGCGATG GGCAGAAATT CCGCTTCGAC GAGAAATTCA CCGCCGTCGA TCTTGCCTAT
GCCGGCGAAA GCTACAGCAT GGTCGTGGTG GCGGCGCGCT CGGGCAAGGG TGTCGGCGGC
GCCGACCTGA AGGCGCTCAC TTCCTGGCTG CAGGGGGAAA AATTCGAACC TGCCAAGGGC
GAAATCTTCC TGCCCCGCTT TTCCCTGAGC GACGGGCGTG ATTTGATGCC GGTACTCGAT
CAGATGGGCC TGGCGGCCGG GAAGGCCAAC GATGCAGCTT TCCCGGGTTT CACCAAGGAA
AACATTCGCT TGTCGCGCGT TCTTCAGAAG ACGATGATCA AGCTCGACGA AAACGGCACG
GAGGCAGCGG CAGCCACGGC AGCGATCACC GAACGCAGCA TCGATCCCAA GCTCGTTCGC
GTCGTCGCCG ATGCCCGTTT TGCTTTTGCG CTTCGCGATA CAAAGAGCGG CCTGCTGCTC
GCCGCAGGCC TGATCGGCGA TCCGCTTCTA GAACAGGATG ATTGA
 
Protein sequence
MPKSTLLAGL AASLMTLAPA HADNSGDGKA MLAAQAGLAA ELIDRTLARE GAANIMVSPA 
SLAAALGLAS LGASAEGKAA IAKGLGFGSE VKGPETVLAA MTPEKPAAAD APLATAVAIV
FDDKLVLSPD ALSMLATHRI KPSIEDLDGP ASVEHINGWV KETTRGAIPV MLDAPPGGGF
VSLGALSFKA RWKTPFEKES PASPFQRPDG STISVPMMHL TGDGQKFRFD EKFTAVDLAY
AGESYSMVVV AARSGKGVGG ADLKALTSWL QGEKFEPAKG EIFLPRFSLS DGRDLMPVLD
QMGLAAGKAN DAAFPGFTKE NIRLSRVLQK TMIKLDENGT EAAAATAAIT ERSIDPKLVR
VVADARFAFA LRDTKSGLLL AAGLIGDPLL EQDD