Gene Rleg_0119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0119 
Symbol 
ID8011357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp111968 
End bp113086 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content64% 
IMG OID644822710 
Producttransposase IS4 family protein 
Protein accessionYP_002973969 
Protein GI241202873 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.373163 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTC GTCCTGAGGT TTTGGATCAT TGGCCGGAAG TGCGCGAGCG GCTTCCGGCG 
GGTTTTGACT TGGAAGCAAC GGCGCGGTTG CGCGGTGCTT TTACGCGGGT GCGGGAAATC
AAGAATGCCG AGACGCTGTT GCGGCTGGCA CTTGCCTATG GCGGCCTTGG CATGTCGCTA
CGCGAGACCT GTGCATGGGC CGAAGCGGGC GGGATCGCCC GTTTGTCAGA CCCATCGCTG
CTCGAGCGGC TGTGCAAAGC GGCGCCTTGG CTTGGCGACA TCGTGGCCGC GCTGATTGCC
GAACAGGCCA AAGTGCCGAC GGGGCGCTTT GCGGGATATC GCTTGCGTGT GCTCGATGGA
ACGTCGATCT GCCATCCGGG CGCTGACCGC ACGACATGGC GGTTGCATGT CGGCTACGAT
CTGGCAACGG CTCAGGTCGA TCAGCTTGAG TTGACCGACA TCCATGGTGC CGAGAACCTT
CAGCGCCTTA CCTACGCACC CGGCGATATC GTGCTGGCCG ATCGCTACTA TGCAAGACCG
CGCGACCTGC GGCCGGTGAT CGACGCCGGT GCAGACTTCA TCGTGCGGAC CGGCTGGAAC
TCGTTGCGCC TGTTGCAGAC GAATGGCGAG CCCTTTGATC TGTTTGCCGC ACTCGCCGCT
CAGCAAGAGC AGGAAGGCGA GGTGCAGGTT CGTGTCCACG AAGGCATGAC GGGGACGCCG
CCACCACCGC CGCTGGCCCT GCGCCTCATT GTCCGACGCA AGGATCCGCA ACAGGCCCAA
GCCGAGCAGG AGCGTCTGCT CAAAGCCGCC CGCAAGCACG GCAAAAAACC CGATCCGCGC
AGTCTCGAGG CGGCGAAGTA CATTCTGCTG CTGACCTCGC TGCCGGCCAC CACCTTCCCG
CCGGCCGATA TCCTCACCCT CTATCGCTTC CGCTGGCAAA TCGAGCTGGC GTTCAAACGG
TTCAAGAGCC TGGCCGGCCT CGACAGCTTG CCGGCCAAGA AGCCGGAACT GGCCCGGGCA
TGGCTCTACG CCAGACTGAT CGTCGCCATC ATCGCCGAAC AGATTGCCGG GCAAGTCCCG
GACTCTCCCC CCTCTGGATG TGGCAACCCC ACTGGCTAG
 
Protein sequence
MKIRPEVLDH WPEVRERLPA GFDLEATARL RGAFTRVREI KNAETLLRLA LAYGGLGMSL 
RETCAWAEAG GIARLSDPSL LERLCKAAPW LGDIVAALIA EQAKVPTGRF AGYRLRVLDG
TSICHPGADR TTWRLHVGYD LATAQVDQLE LTDIHGAENL QRLTYAPGDI VLADRYYARP
RDLRPVIDAG ADFIVRTGWN SLRLLQTNGE PFDLFAALAA QQEQEGEVQV RVHEGMTGTP
PPPPLALRLI VRRKDPQQAQ AEQERLLKAA RKHGKKPDPR SLEAAKYILL LTSLPATTFP
PADILTLYRF RWQIELAFKR FKSLAGLDSL PAKKPELARA WLYARLIVAI IAEQIAGQVP
DSPPSGCGNP TG