Gene Rleg2_6037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_6037 
Symbol 
ID6977423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011366 
Strand
Start bp467073 
End bp468335 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content67% 
IMG OID643393489 
Productprotein of unknown function DUF181 
Protein accessionYP_002278307 
Protein GI209546417 
COG category[S] Function unknown 
COG ID[COG1944] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00702] uncharacterized domain
[TIGR03604] bacteriocin biosynthesis docking scaffold, SagD family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.463338 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.501056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTTT TCACCGATCT TGAAACGCTT GCCGGTGACC TCAAACGATC ATCGGCCGGC 
GTCAGCGATT ATCACGACCG CGCCGTCACG CCGGCTCAGA CCTTGGCGGC GATCAGGCCG
CATCTGCGCG AATTCGGCAT CACGCGCGTC GGCCTGCTGA CCGCACTCGA CGTCTTGAAC
ATCCCCGTCG CCTTCGCGAC GCGGCCGAAC AGCCATACGC TCTCGGTCTT CCAGGGCAAA
GGCATCGACA ATGATGCCGC CATGACCTCG GCCGCCATGG AAGCGATCGA GACGCGGATC
GCCGAAATCC CGCCCGCCGA CCTGACGGAG GCGACCGTCG CCGGCATGCG GGCGGAAAAT
GCGGCCATGA TCGATCTCGA CAATGTCGCC CGCTGCGCTC CCGACGAGAT CGGCAGCGGA
CCCATTCCCT GGTGCTCCGG GCTCGACATC CTTTCCGGCA GCAGCGCCTT CGTGCCGTGG
TGGCTTGTCG GCCTCGACCA TCGCGGCGAA AGACCACCGG GTTTCGAGCA GTCGAGCGAT
GGGCTGGCCT CCGGCAACAC GCCATCCGAA GCCGTTCTGC ACGGGCTCTG CGAACTGGTG
GAGCGCGACG CCTGGGCCTT GACCCAGCTG AAATCGCCCG AGCGGCTGAA GGAGAGCCGC
ATCGATCCCG CCTCCTTCGG CGACGCAGTC ATCGATGTCA TGACCGACCG GATCGCGCGC
GCCGGCATGC GGCTGCTGCT CCTCGACATG ACCACCGATA TCGGCGTTCC CGCCTTTCTC
GCGGTCATCA TGCCCGGCAA CCTTTCCGAC CGTGTCGATG CACGCTGGGC CCATGTCTGC
GGCGGCTGCG GCTGCCATCC CGATCCCGTG CGCGCCGCGC TGCGCGCCAT CACCGAAGCG
GCGCAGAGCC GGCTGACCGC AATTGCCGGC AGCCGCGACG ATTTTTCGCC GCGCGTCTAT
CAGCGGCTCG ACCAGAGCGC GGCGATGCAG CAGGTGGTCG AACTTTGTGA GGGCGGCGGC
CGCATGCGCG CCTTCCAGCC GCGTCAGAGC CGCCCGGCGA CAATCCAGGA AACCATCGGC
CATATCGCCG ACCGGCTGGC TGCGACCGGC ATCGAGCAGA TCGTCGTCGT GCCGTTTGCG
CACCGGGCTC TGCCGGTCTC CGTCGTCAGG GTCATCGTGC CGGGCCTGGA GGTCGATATC
TCCGGCCAGT ACATCCAGCT CGGCATGCGG GCGGTCAACA CCATGAGGGG AGCCCAGTCA
TGA
 
Protein sequence
MSVFTDLETL AGDLKRSSAG VSDYHDRAVT PAQTLAAIRP HLREFGITRV GLLTALDVLN 
IPVAFATRPN SHTLSVFQGK GIDNDAAMTS AAMEAIETRI AEIPPADLTE ATVAGMRAEN
AAMIDLDNVA RCAPDEIGSG PIPWCSGLDI LSGSSAFVPW WLVGLDHRGE RPPGFEQSSD
GLASGNTPSE AVLHGLCELV ERDAWALTQL KSPERLKESR IDPASFGDAV IDVMTDRIAR
AGMRLLLLDM TTDIGVPAFL AVIMPGNLSD RVDARWAHVC GGCGCHPDPV RAALRAITEA
AQSRLTAIAG SRDDFSPRVY QRLDQSAAMQ QVVELCEGGG RMRAFQPRQS RPATIQETIG
HIADRLAATG IEQIVVVPFA HRALPVSVVR VIVPGLEVDI SGQYIQLGMR AVNTMRGAQS