Gene Rleg_4449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4449 
Symbol 
ID8015215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4583314 
End bp4584951 
Gene Length1638 bp 
Protein Length545 aa 
Translation table11 
GC content63% 
IMG OID644827024 
ProductABC transporter related 
Protein accessionYP_002978226 
Protein GI241207130 
COG category[R] General function prediction only 
COG ID[COG4172] ABC-type uncharacterized transport system, duplicated ATPase component 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.875798 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGACA TGACAGAACC GCTGCTTTCC GTCCGCGATC TCTCGGTTGC CTTTCATCAG 
GGCGGCGAGA CCTCGCTCGC CGTCGATCAC ATCTCCTTCG ATATCGCCAA AGGCGAGGTC
GTGGCGCTCG TCGGCGAATC CGGCTCCGGC AAGTCGGTCT CGGCCAACTC GATCCTGCGA
CTTTTGCCTT ATCCCTCGGC AAGCCATCCC TCCGGCGAAA TCCTCTTCAA GGGTAAGGAC
CTCTTGAAGG CGTCGGAGCG GGAACTGCGT GAAGTGCGCG GCAACGACAT CACCATGATC
TTCCAGGAGC CGATGACCTC GCTCAATCCG CTTCATACGA TCGAGAAGCA GATCGCCGAG
ATCCTCGCCC TGCATCAGGG CCTTACCGGC CAGCCGGCGC GTGAGCGCGT GCTGGAATTG
CTGAACCAGG TCGGCATCCG CGAGCCGGAG AGGCGGTTGA AGGCCTATCC GCATGAACTG
TCAGGCGGCC AGCGCCAGCG CGTCATGATC GCCATGGCGC TCGCCAACCG GCCGGAATTG
CTGATCGCCG ACGAGCCGAC CACCGCCCTC GACGTCACCG TGCAGGCGCA GATCCTCGAA
TTGCTCCGGC AGTTAAAAGC CGTCCACGGC ATGTCGATGC TGTTCATCAC TCATGATCTC
GGCATCGTCC GCAAATTCGC TGATCGCGTC TGCGTCATGA CCAAGGGCAG GATCGTCGAA
ACCGGCACCG TCGAGGACGT CTTCGCCAAT CCGAAGCATG AATATACCCG GCATCTCCTT
GCCTCCGAAC CGCGCGGCGA GCCGCCGCTC GCCGATCCGT CGAAACCGAT GGTGATGGAA
GGTTCGGATA TCCGTGTCTG GTTCCCGATC AAGGCTGGGC TGATGCGCCG TGTCGTCGAT
CACGTCAAGG CGGTCGACGG CATCGATCTT TCGCTGCGCG CCGGCCAGAC GCTCGGCGTC
GTCGGCGAAT CCGGCTCGGG CAAGACCACG CTCGGCCTGG CGCTCACCCG GCTGATTTCC
TCGCAAGGGC GGATCGCCTT CGTCGGCAAG GACATAGCTG GCTATTCATT CAATGAGATG
CGGCCGTTGC GCAACCAGCT GCAGGTCGTC TTCCAGGATC CCTACGGATC GCTGAGCCCG
CGCATGTCCG TCGGCGATAT CGTCGCCGAA GGGCTGAAGG TGCATGAGCG GTCGCTGACC
GCAGAAGAAC GCGACCAGCG CGTCTGCTGG GCGCTGGAGG AGGTGGGTCT CGATCCGCTG
ACCCGCTGGC GTTATCCGCA CGAATTCTCG GGCGGCCAGC GCCAGCGCAT CGCCATCGCC
CGGGCCATGG TGCTGAAGCC GCGCTTCGTC ATGCTCGACG AGCCGACCTC TGCGCTCGAC
ATGAGCGTGC AGGCGCAGGT GGTCGATTTG CTGCGCGATC TGCAGAAGAA GCACGATCTC
GCCTATCTCT TCATCAGCCA CGACCTGAAA GTGGTGAAGG CGCTCGCCAA CGACGTCATC
GTCATGCGTT TCGGCAAGGT GGTGGAGCAG GGACCGTCCT CGGAAATCTT CCGCGCGCCG
AAGGATGATT ACACCAGGGC GCTGATGGCC GCCGCTTTCA ACATCGAGGC GGTGCCGACG
CCCGCCGTGC AGCAGTAA
 
Protein sequence
MSDMTEPLLS VRDLSVAFHQ GGETSLAVDH ISFDIAKGEV VALVGESGSG KSVSANSILR 
LLPYPSASHP SGEILFKGKD LLKASERELR EVRGNDITMI FQEPMTSLNP LHTIEKQIAE
ILALHQGLTG QPARERVLEL LNQVGIREPE RRLKAYPHEL SGGQRQRVMI AMALANRPEL
LIADEPTTAL DVTVQAQILE LLRQLKAVHG MSMLFITHDL GIVRKFADRV CVMTKGRIVE
TGTVEDVFAN PKHEYTRHLL ASEPRGEPPL ADPSKPMVME GSDIRVWFPI KAGLMRRVVD
HVKAVDGIDL SLRAGQTLGV VGESGSGKTT LGLALTRLIS SQGRIAFVGK DIAGYSFNEM
RPLRNQLQVV FQDPYGSLSP RMSVGDIVAE GLKVHERSLT AEERDQRVCW ALEEVGLDPL
TRWRYPHEFS GGQRQRIAIA RAMVLKPRFV MLDEPTSALD MSVQAQVVDL LRDLQKKHDL
AYLFISHDLK VVKALANDVI VMRFGKVVEQ GPSSEIFRAP KDDYTRALMA AAFNIEAVPT
PAVQQ