Gene Rleg2_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1143 
Symbol 
ID6979863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1154297 
End bp1155571 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content55% 
IMG OID643395856 
Productphage terminase, large subunit, PBSX family 
Protein accessionYP_002280663 
Protein GI209548746 
COG category[R] General function prediction only 
COG ID[COG1783] Phage terminase large subunit 
TIGRFAM ID[TIGR01547] phage terminase, large subunit, PBSX family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCAAA TCCAGATACC GGAGAAGTTC GCGCCATTTC TGGAGCCTCA CCGCTATAAG 
ACCGCATACG GGGGGCGTGG CGGTGCCAAG AGCCATACCA TTGGCGGCTT ACTCGTCCAC
ATCGCTGCAC GTCAACCTGA GTTCATCGTG TGCGGGCGCG AGTTTCAAAA CAGTATAGAG
GACAGCGTTT ACCGTCTGCT GGAAAATACC ATCAAGAAGG ATGGTTTGAC GGACTACAAG
TTTACGAAGA CCAGCATCGA AAACACACGC ACCGGTTCGG AGTTCGTGTT TAAAGGGCTG
TCGAAGCTGG ATGGCGTCAG CATCAAGTCC CTTGAAGGAG CGACGAAGCT CTGGATTGAG
GAAGGCCAGA ACATCACCAA CGGCACCCTC AATAACGTGA TCCCCACGAT TCGTACTGAA
GGCTCTGAAA TCTGGACTAG CTTCAACCCC TCGGTCATTG AAGCGCCCGT GTGGCAACGC
CTGGTCGTCA AGCCGCCACC GGATAGCGTG GTCGTCAAGG TAAATCATAA CGATAACCCG
TGGTTTCCAA AGGTGCTGGA AGAAGAACGA CGCCACCATT TCGTCACGAT GGATAGGGCG
ACCTATGACA ACATCTGGGA AGGCGTACCA ATGGCCTTCA CGGCGGGCGC CATCTTCCGC
GAAGAGATGG CCGCGATGGA AAAGCAGGGG CGTATTGGCA AGGTACCGCA TGATCCAACT
AAGCCCGTCA TAGCCGTGTT TGACATGGGT CACGCTGCCA GCGGAAAAGG CGACCCACAT
GCTATAACGT TCGTGCAGCA GGGCAACGGC ACTGCGGTCA ACGCGATTGG CTATTGGGAG
GGCAACAACA AGCCTCTTCC CACTGTAGTT CAGGAAGAGC TGATTGGGCG TCCGTACACC
ATCAGCAAGG TGGTGATGCC GCACGATGCG AACCGCACGA ACTCTCACAC CGGCAAGACG
GACGTAGAAA TAGTGGAGGG CTTCGGATTC ACGGTGGAGA TGCTTGAACG CACCAACGAC
CTAGACCGGG ATGTAAACAA CCTACGCACC GTCCTGCCGC TCATGTTTAT TGATGAAGAA
AATTGCCAAG GGCTGCTAGC TTGCTTGCGT AATCACCGCA GGGAGAAGGA CGAGAAGACG
GGGTTATGGC GCTTCAAACA TGACTGGACG AGTCATGGCG TATCCTCGGC ACGGTACTTG
GCGGTCTATT ACAATATCCA TGGGGGACTT GGATATTACG AGTCGGTTGA TGATGAGCGC
CGCTCCGCGT GGTAG
 
Protein sequence
MNQIQIPEKF APFLEPHRYK TAYGGRGGAK SHTIGGLLVH IAARQPEFIV CGREFQNSIE 
DSVYRLLENT IKKDGLTDYK FTKTSIENTR TGSEFVFKGL SKLDGVSIKS LEGATKLWIE
EGQNITNGTL NNVIPTIRTE GSEIWTSFNP SVIEAPVWQR LVVKPPPDSV VVKVNHNDNP
WFPKVLEEER RHHFVTMDRA TYDNIWEGVP MAFTAGAIFR EEMAAMEKQG RIGKVPHDPT
KPVIAVFDMG HAASGKGDPH AITFVQQGNG TAVNAIGYWE GNNKPLPTVV QEELIGRPYT
ISKVVMPHDA NRTNSHTGKT DVEIVEGFGF TVEMLERTND LDRDVNNLRT VLPLMFIDEE
NCQGLLACLR NHRREKDEKT GLWRFKHDWT SHGVSSARYL AVYYNIHGGL GYYESVDDER
RSAW