Gene Rleg2_1476 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1476 
Symbol 
ID6980206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1505920 
End bp1507185 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content59% 
IMG OID643396197 
Productaromatic hydrocarbon degradation membrane protein 
Protein accessionYP_002280994 
Protein GI209549077 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.378935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.369147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCGC GTAGGTTTTC CAAGGGCGTA ACGTCGCTCG TTCTTCTTTC GTCGCTGGCA 
TCGCCGGCCT TCGCCGGCGG CCTGGAGCGC GGCGGCTACA ATATCGATCA GCTGTTCGAT
ACCTCGCCTT TCTCGTTTCA GTCGGGCGTT ACCTATGTCA CGCCGCAGCG CAAGCTGAAG
GACGTCAGTG ACACGAACAC CTCTGTCTTA ACAGGGGGTG GCAATCTCAA CACCCGCCCG
AACAGTGCCG ACGAGTCCTC GAATTACACC ATCCCTTACA TCGGCTTTAA GGCCGGTTTC
GCCGATGCAG TCGACTGCCT TGTCGACTAT TCCGAACCCT TCGGCGGGCA TACCGATCCG
GGTTCAAACT GGGCTGGCGC CAACAACAAC ATCGAGACTG AAATCAAAAC CCGCAACTAT
GGCGGCACTT GCTCCTATCG TTTTGATATG GGCCCTGGAC AGCTTCGTTT CATTGGCGGC
GCTTTCTATC AGGAAGTCGA AGGTTTCAAA GAGCGTCTGG TTTCAACGCT TCCGCTTTTG
CTCGGCACCG GCACCGGCGT TGGCCGCCTC GATCTCGAAG ACAGCGGCTG GGGCTGGCGA
GCCGGCGTGG CCTACGAGAT TCCGGAATAT GCGATGCGTG CGAGCCTCGT CTATAACAGC
CGTGTCAAGT ACGACAACCT GACCGGGACT GTGGATCTCC GTCAGGTTCC AATCGTGCCG
ATATACGGCG GCAAAATCAC CAACGTCTTC GGCTCCGCCG AAGCGCCGGA TTCGCTGGAG
ATGAAGCTGC AAAGCGGCAT CGCTCCGGAT TGGCTTGCTT TCGGATCGGT CAAGTGGACG
AACTGGAGTG TTCTGCAGTC CGTGCCTTTC TGCCCGACGT CGACGAAGGG CGTGGCCGCC
TGCACAGCGG GCGGCGCTAC GGAACTCACT TCGCTTGACC TTCTCTATCG TGACGGCTGG
ACCATCTCCG GCGGCGTGGG CCACAAGTTC AACGATCAGT GGGCCGGCGC GGTCAGCGTC
ACGTGGGACC GTGGCACCAG TCAGGGTTAT GGCGCACAGA CCGACAGCTG GACGCTCGGT
CTCGGCGCCG CCTACACGCC GACCGAACAT ATCGAATGGC GTTTTGCCGG GGCCGTTGGC
GTATTGACGA GCGGTTCGTC CGGCACTTTC GAGTATAATG GCCAGACCTA TGGCGACGAT
GTCTCCTATT CCTTTGGCAA CGATCTGGTC GCGGCACTGT CGACGAGCCT AAAGATCAAG
TTCTAA
 
Protein sequence
MASRRFSKGV TSLVLLSSLA SPAFAGGLER GGYNIDQLFD TSPFSFQSGV TYVTPQRKLK 
DVSDTNTSVL TGGGNLNTRP NSADESSNYT IPYIGFKAGF ADAVDCLVDY SEPFGGHTDP
GSNWAGANNN IETEIKTRNY GGTCSYRFDM GPGQLRFIGG AFYQEVEGFK ERLVSTLPLL
LGTGTGVGRL DLEDSGWGWR AGVAYEIPEY AMRASLVYNS RVKYDNLTGT VDLRQVPIVP
IYGGKITNVF GSAEAPDSLE MKLQSGIAPD WLAFGSVKWT NWSVLQSVPF CPTSTKGVAA
CTAGGATELT SLDLLYRDGW TISGGVGHKF NDQWAGAVSV TWDRGTSQGY GAQTDSWTLG
LGAAYTPTEH IEWRFAGAVG VLTSGSSGTF EYNGQTYGDD VSYSFGNDLV AALSTSLKIK
F