Gene Rleg2_6395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6395 
Symbol 
ID6983467 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011371 
Strand
Start bp42222 
End bp44048 
Gene Length1827 bp 
Protein Length608 aa 
Translation table11 
GC content63% 
IMG OID643399393 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002284149 
Protein GI209552234 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0500237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATTG CCGTAGAGAA TCTCACCCCG ACCGTCACCA CCGGCCCGCT TCCGGCATCG 
CGGAAGATCC ATATTCCGGG AGATATCCAT GGCGATATCC GCGTGCCGAT GCGCGAAATC
AGCGTCCATC CGACCGCCGG CGAACCGCCT GTCGTCGTCT ATGATTCCTC CGGCCCTTAC
ACGATCGAAG GTGCGGAGAT CCGCATCGAG CAGGGCCTAC CCCAACTGCG CCGCGACTGG
GTGCTGGCCC GCGGCGATGT CGAAGCCTAT GACGGCCGCC ATGTCCGCCC CGAAGATAAC
GGCTTTGTCA GCGGCGACCG GCTGACGCCG GAATTTCCGG GGCGGCGCCG GCCGCTGCGC
GCCAAGGACG GCAAAGCCGT CACCCAGCTC GCCTATGCAC GGGCCGGCGT CATCACGCCG
GAGATGGAGT TCGTGGCCAT TCGCGAGAAT CTCGGCCGCA AGGCGAAGGC CGAAGCCCTG
GTGCGTGACG GCGAGAGCTT CGGCGCCGAT ATTCCCGATC ACGTGACGCC GGAATTCGTC
CGCCGGGAAG TCGCCGCCGG CCGGGCGATC ATCCCGGCCA ACATCAACCA TCCCGAAAGC
GAACCGATGA TCATCGGCCG GAACTTTCTG GTGAAGATCA ACGCCAATAT CGGCAACTCC
GCCGTCACCT CATCGATGGC TGAGGAAGTC GAGAAGATGG TCTGGGCTGC CCGCTGGGGC
GCCGATACGG TCATGGATCT GTCGACCGGC CGCAACATCC ACAATATCCG CGAATGGATC
ATCCGCAATT CGCCGCTGCC GATCGGCACG GTGCCGCTCT ACCAGGCGCT GGAAAAGGTT
GGCGGCATCG CCGAAGACCT CACCTGGGAG ATCTATCGCG ATACGCTGAT CGAGCAGGCC
GAACAGGGCG TCGACTATTT CACCATCCAT GCCGGTGTCC GGCTGCATTA CATCCCACTC
ACCGTCAATC GCGTCACCGG CATCGTCTCG CGCGGCGGTT CGATCATGGC CAAGTGGTGT
CTGCATCATC ACCGCGAAAG CTTCCTCTAC GAGCATTTCG AGGAAATCTG CGATATCTGC
CGGGCCTATG ACGTCTCCTT CTCGCTCGGC GACGGCCTTC GCCCCGGCTC GATCGCCGAT
GCCAACGATG CGGCGCAGTT TGCCGAACTC GAAACGCTGG GCGAGTTGAC GAAAATCGCC
TGGGCCAGGG ATTGCCAGGT GATGATCGAG GGACCTGGCC ATGTGCCGAT GCACAAGATC
AAGGAAAACA TGGACAAGCA GCTCGCCGTC TGCGGCGAGG CGCCCTTCTA CACGCTGGGG
CCGCTGACCA CCGATATCGC GCCCGGCTAC GACCATATCA CCTCCGGCAT CGGCGCGGCG
ATGATCGGCT GGTTCGGCAC GGCCATGCTC TGCTACGTCA CCCCGAAAGA ACATCTTGGC
CTGCCCGATC GCAACGACGT CAAGACCGGC GTCATCACCT ACAAGATCGC CGCCCATGCC
GCCGACCTTG CCAAGGGGCA CCCGGCCGCC CGCCTGCGCG ACGACGCCCT GTCGCGGGCG
CGCTTCGAAT TCCGTTGGGA AGACCAGTTC AACCTGTCGC TCGATCCCGA CACCGCCCGC
AGCTTCCATG ACGAGACCCT GCCGAAGGAA GCGCACAAAG TGGCGCATTT CTGCTCGATG
TGCGGCCCGA AATTCTGCTC GATGCGCATA TCGCACGACA TCCGCGCCGA AGCACAAAAG
GAAGGGCTGG ATGCGATGGC GGCGAAATAC CGGGAGGGCG GCGATCTCTA TATGCCGATC
GATACCCACG CCAACGGGGC GGAATGA
 
Protein sequence
MTIAVENLTP TVTTGPLPAS RKIHIPGDIH GDIRVPMREI SVHPTAGEPP VVVYDSSGPY 
TIEGAEIRIE QGLPQLRRDW VLARGDVEAY DGRHVRPEDN GFVSGDRLTP EFPGRRRPLR
AKDGKAVTQL AYARAGVITP EMEFVAIREN LGRKAKAEAL VRDGESFGAD IPDHVTPEFV
RREVAAGRAI IPANINHPES EPMIIGRNFL VKINANIGNS AVTSSMAEEV EKMVWAARWG
ADTVMDLSTG RNIHNIREWI IRNSPLPIGT VPLYQALEKV GGIAEDLTWE IYRDTLIEQA
EQGVDYFTIH AGVRLHYIPL TVNRVTGIVS RGGSIMAKWC LHHHRESFLY EHFEEICDIC
RAYDVSFSLG DGLRPGSIAD ANDAAQFAEL ETLGELTKIA WARDCQVMIE GPGHVPMHKI
KENMDKQLAV CGEAPFYTLG PLTTDIAPGY DHITSGIGAA MIGWFGTAML CYVTPKEHLG
LPDRNDVKTG VITYKIAAHA ADLAKGHPAA RLRDDALSRA RFEFRWEDQF NLSLDPDTAR
SFHDETLPKE AHKVAHFCSM CGPKFCSMRI SHDIRAEAQK EGLDAMAAKY REGGDLYMPI
DTHANGAE