Gene Rleg_2604 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_2604 
Symbol 
ID8013565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp2600559 
End bp2601839 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content65% 
IMG OID644825180 
Productprotein of unknown function DUF442 
Protein accessionYP_002976410 
Protein GI241205314 
COG category[S] Function unknown 
COG ID[COG3453] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR01244] conserved hypothetical protein TIGR01244 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.148627 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCCG TGAAGGTCAA TGAGCTGATA TCGGTGGCGG GCCAGCCCGA CGCCGCAGGT 
TTTGCCGCCT TCGCGGCTGA TGGCTTTGCT GCCGTCATCA ATGCCCGGCC GGATGGCGAG
GAGCCGGGAC AGCCGGGCAA TACGGCGGAA AAGGCTTCCG CCGCTGCCGC CGGGCTCGCC
TACAGCTTCG TGCCGGTGAA GGGGACCGAA ATCACCGAGG CCGATATCTG CGCCTTCCAG
ACGGCGATGG CCGAGGCCAA GGGACCGGTC GTCGCCCATT GCAAGAGCGG CACGCGGGCG
TTGACGCTTT ATGCGCTGGG CGAGGTGCTC GACGGGCGGA TGAAGCCCGG AGATGTCGAG
GCCTTCGGTC AAAACCTCGG TTTTGATCTT GCCGGCGCGC GACGCTGGCT GGAAAAGCGG
TCAGGGCAGG TGGCTGATGT GAAGGCCTTC TTCGAGCCCC GCACCTGCAG TGTGCAATAT
GTCGTTTCCG ACCCGGCAAC GAAACGCTGC GCCATCATCG ACCCGGTGCT CGATTTCGAC
GAGATGTCGG GGGCGACGGG AACGGCCAAT GCAGATGCCA TCCTCGCTCA TATCGAAAGC
GAAGGGCTGA CGGTCGAGTG GATCCTCGAC ACGCATCCGC ATGCCGATCA TTTCTCCGCC
GCGCATTATC TGCATGAGAA GACCGGCGCG CCGACGGCGA TCGGCGCCCA TGTCACCGAC
GTGCAGACGC TCTGGAAGGA GATCTACAAC TGGCCGGGGC TCGCGACCGA CGGCTCGCAA
TGGGACCGGC TGTTTGCCGA TGGCGACACG TTCGAGATCG GTGCGCTTAA AGCCCGCGTG
ATTTTTTCGC CCGGGCACAC ACTCGCCTCG ATCACCTATG TGATCGGTGA CGCCGCCTTT
GTGCACGACA CGGTGTTCAC GCCGGATTCC GGCACGGCGC GCACGGATTT CCCGGGCGGC
AGCGCTGCCG CCCTCTGGCA CTCGATCCAG GCCATCCTGT CGCTGCCCGA GGAGACCCGT
CTCTTTTCCG GCCACGATTA CCAGCCCGGC GGCCGGCACC CGCGCTGGGA AAGCACGGTG
GAGGCACAGA AGCGCGCCAA TCCGCATATT GCAGGCATCG ACGAGGCCGG CTTCGTGGCG
CTGCGCCAGG CGCGCGATCG CACGCTGCCC AAGCCCAAGC TGATGCTGCA CGCGCTGCAG
GTGAATATCC GCGGCGGGCG GCTGCCCGAG CCGGAGGGGA ATGGCAGGCG GTATCTGAAG
ATACCGCTGG ATGCATTGTA G
 
Protein sequence
MTSVKVNELI SVAGQPDAAG FAAFAADGFA AVINARPDGE EPGQPGNTAE KASAAAAGLA 
YSFVPVKGTE ITEADICAFQ TAMAEAKGPV VAHCKSGTRA LTLYALGEVL DGRMKPGDVE
AFGQNLGFDL AGARRWLEKR SGQVADVKAF FEPRTCSVQY VVSDPATKRC AIIDPVLDFD
EMSGATGTAN ADAILAHIES EGLTVEWILD THPHADHFSA AHYLHEKTGA PTAIGAHVTD
VQTLWKEIYN WPGLATDGSQ WDRLFADGDT FEIGALKARV IFSPGHTLAS ITYVIGDAAF
VHDTVFTPDS GTARTDFPGG SAAALWHSIQ AILSLPEETR LFSGHDYQPG GRHPRWESTV
EAQKRANPHI AGIDEAGFVA LRQARDRTLP KPKLMLHALQ VNIRGGRLPE PEGNGRRYLK
IPLDAL