Gene Rleg_3521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3521 
Symbol 
ID8014387 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3554806 
End bp3557409 
Gene Length2604 bp 
Protein Length867 aa 
Translation table11 
GC content64% 
IMG OID644826086 
Productprotein of unknown function DUF470 
Protein accessionYP_002977306 
Protein GI241206210 
COG category[S] Function unknown 
COG ID[COG2898] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0528845 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGGGC ACGGCAATTT GGAAGAGATT GAAGAAGCGG ACGGGTTTTC TTTTGGCGGT 
CTATTCAGAC GGTACAGAAC GCCGCTGCTG GCAGCCGCAT CACTGGTGGT CTTCTGCCTC
GTCGGTTACG CCATCATGCA GCTCACCAAC GAGGTGCGCT ATGACGACGT CGTCGGTGCG
CTCGCCGCCA CCAGGCCGAG CTCGATCCTG CTTGCGTTGT TCTTCACGGC GCTCAGTTTC
CTCGCACTGA TCTTCTACGA TCTCAACGCC ATCGAATATA TCGGCAAGAA GCTGCCCTTT
CCGCATGTCG CGCTGACGGC GTTCAGTGCC TATGCCGTCG GCAATACCGC CGGTTTCGGC
GCCCTGTCTG GGGGGGCGAT TCGCTACCGC GCCTATACGC GCCTCGGGCT CTCGCCGGAG
GATATCGGAC GCATCATCGC CTTCGTCACG CTGTCCTTCG GATTGGGGCT TGCGGGCGTC
GCGGCAATCG CCCTCATCGT CATCGCCGAC GAGATCGGCC CGCTCGTCGG CATCAGCCCC
TTCCTTCTTC GGCTGATCGC CGGTTCGATC GTCGCCATTC TGGGCGCGGT GATGATCATC
GGCCGTGACG GGCGCGTGCT CGATCTTGGC CCTGTGGCAA TCCGCCTGCC GGATTCGCGT
ACCTGGTCGC GGCAGTTCCT CGTCACAGCC TTCGATATCG CCGCCTCGGC GTCGGTGCTC
TATGTGCTGC TGCCGCAGAC AGCCATCGGC TGGCCGGTCT TCCTCGCCGT CTATGCGATC
GCCGTCGGTC TCGGCGTGCT CAGCCATGTT CCGGCCGGGC TCGGCGTGTT CGAGACCGTG
ATCATCGCCT CGCTCGGCAG CGCGGTGAAC ATCGATGCCG TGCTCGGATC GCTGGTGCTC
TACCGGCTGG TCTACCATGT GCTGCCGCTG CTGATCGCCG TGCTCGCGGT CTCGGCGGCG
GAGCTGCGTC GTTTCGTCGA CCATCCGGCG GCCTCCAGCA TGCGGCGCAT CGGCGGACGG
CTGATGCCGC AGCTGCTGTC GGCTCTCGCG CTGCTGCTCG GCGTCATGCT GGTGTTTTCG
AGCGTCACGC CGACGCCGGA CCAGAACCTC GAATTCCTCT CCAACTATCT GCCACTGCCG
ATGGTCGAAG GGGCGCATTT CCTCTCCAGC CTGTTGGGGC TGGCACTCGT CGTCGCCGCG
CGCGGCCTCG GCCAGCGGCT CGACGGCGCC TGGTGGGTCG CCGTGTTCTC GGCGCTTGCC
GCGCTGACCT TGTCGCTGCT GAAGGCGATC GCGCTTGTCG AAGCGGCTTT TCTTGCCTTC
CTCATCTTCG GGCTCTTCGT CAGCCGCCGG CTCTTTACCC GCCAGGCCTC GCTGCTCAAC
CAGGCGCTGA CGGCGTCCTG GCTGATGGCG ATCGCCGTGA TCGTCGTCGG GGCTGTCGTC
ATCCTGCTCT TCGTCTATCG CGACGTCGAA TACAGCAACC AGCTCTGGTG GCAGTTCGAA
TTCACCGCCG AGGCGCCGCG CGGGCTGCGC GCCGTGCTCG GCATCACCAT CATTTCGTCG
GCGATCGCCG TCTTCAGCCT GCTCCGACCG GCGAGCTTCC GGCCGGAGCA GGCGACGGAC
GAGGCGCTGG CGCGTGCCGT CGAGATCGTC ATGAAGCAGG GCAATGCCGA TGCCAATCTT
GTGCGCATGG GCGACAAGAG CATCATGTTC TCGGAAAACG GCGATGCCTT CATCATGTAT
GGCCGGCAGG GCCGCTCCTG GATCGCGCTG TTCGATCCGG TCGGCGACCA TCGCGCCGTG
CAGGAACTCG TCTGGCGTTT CGTCGAAGCG GCGCGCGCCG CCGGCTGCCG GGCCGTGTTC
TATCAGATAT CGCCGGCGCT GCTGTCCCAT TGCGCCGATG CCGGCCTGCG CGCCTTCAAG
CTCGGCGAAC TGGCGGTGGC CGATCTCAGG ACCTTCGAGA TGAAGGGCGG CAAATGGGCA
AACCTTCGCC AGACGGCAAG CCGCGCCCAG CGCGACGGGC TGGAATTCGC CGTCGTCGAA
CCGCAGGACT TGTCTTCGGT CATCGATGAT CTTGCCGCGG TTTCGACGGC CTGGCTCGAG
CATCACAATG CCAAGGAAAA GGGCTTCTCG CTCGGCGCGT TCGATTTCGA TTATGTCTCC
TCGCAGCCGG TCGGCATCCT GAAAAAGGAC GGCAGGATCG TCGCCTTCGC CAATATCCTC
GTGACCGAAT CCAGACAGGA GGGCACGATC GATCTCATGC GCTTCTCGCC GGACGCGCCG
AAGGGCTCGA TGGACTTTCT CTTCGTGCAG ATCATGGAAT ATCTGCGAGG GCAGGGTTTC
ACCCACTTCA ATCTCGGCAT GGCGCCTCTC TCCGGCATGT CGAAACGCGA GGCGGCGCCC
GTCTGGGACC GCATCGGCAG CACCGTCTTC GAACACGGCG AGCGCTTCTA TAACTTCAAA
GGCCTTCGGG CATTCAAATC CAAGTTTCAT CCGCACTGGC AACCGCGCTA TCTTGCGGTC
TCAGGAGGGG GCAATCCGAT GATCGCGTTG ATGGACGCGA CATTTCTGAT TGGGGGCGGA
TTGAAAGGGG TAGTGAGAAA ATGA
 
Protein sequence
MSGHGNLEEI EEADGFSFGG LFRRYRTPLL AAASLVVFCL VGYAIMQLTN EVRYDDVVGA 
LAATRPSSIL LALFFTALSF LALIFYDLNA IEYIGKKLPF PHVALTAFSA YAVGNTAGFG
ALSGGAIRYR AYTRLGLSPE DIGRIIAFVT LSFGLGLAGV AAIALIVIAD EIGPLVGISP
FLLRLIAGSI VAILGAVMII GRDGRVLDLG PVAIRLPDSR TWSRQFLVTA FDIAASASVL
YVLLPQTAIG WPVFLAVYAI AVGLGVLSHV PAGLGVFETV IIASLGSAVN IDAVLGSLVL
YRLVYHVLPL LIAVLAVSAA ELRRFVDHPA ASSMRRIGGR LMPQLLSALA LLLGVMLVFS
SVTPTPDQNL EFLSNYLPLP MVEGAHFLSS LLGLALVVAA RGLGQRLDGA WWVAVFSALA
ALTLSLLKAI ALVEAAFLAF LIFGLFVSRR LFTRQASLLN QALTASWLMA IAVIVVGAVV
ILLFVYRDVE YSNQLWWQFE FTAEAPRGLR AVLGITIISS AIAVFSLLRP ASFRPEQATD
EALARAVEIV MKQGNADANL VRMGDKSIMF SENGDAFIMY GRQGRSWIAL FDPVGDHRAV
QELVWRFVEA ARAAGCRAVF YQISPALLSH CADAGLRAFK LGELAVADLR TFEMKGGKWA
NLRQTASRAQ RDGLEFAVVE PQDLSSVIDD LAAVSTAWLE HHNAKEKGFS LGAFDFDYVS
SQPVGILKKD GRIVAFANIL VTESRQEGTI DLMRFSPDAP KGSMDFLFVQ IMEYLRGQGF
THFNLGMAPL SGMSKREAAP VWDRIGSTVF EHGERFYNFK GLRAFKSKFH PHWQPRYLAV
SGGGNPMIAL MDATFLIGGG LKGVVRK