Gene Rleg_0942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_0942 
Symbol 
ID8012087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp928172 
End bp929446 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content56% 
IMG OID644823526 
Productprotein of unknown function DUF21 
Protein accessionYP_002974777 
Protein GI241203681 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.429612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTTCTGG AAATTGGAAT TGTGGCGTTT CTCACCATCC TGAATGGTGT GCTCGCCATG 
TCGGAGCTGG CCGTCGTGTA TTCTCGAACA GCTCGCCTAA AGGTCCTCTC CGACAATGGA
AGCAAGGGTG CAGCTCAAGC GATCAAACTT GCTGAAAACC CTGGTCGTTT TCTCTCAACG
GTGCAGATCG GCATCACGCT GGTCGGCGTT CTATCCGGCG CTTTCTCGGG GGCCACGCTC
GGCGGCCGCC TGAGCGGATG GCTAGAAGCC CAGGGAATGT CATCGACGGC CGCTGATGCC
ATTGGCGTAG GTTCAGTCGT CGTGGCAATC ACATATCTTT CGTTGATCGT CGGCGAACTT
GTTCCAAAGC AGATCGCATT GCGGGAACCC GAAGCGGTTG CGGCCAGGGT CGCACCCGCT
ATGGCGGTCC TTTCAAAAAT TGCGCTGCCA CTCGTGTGGC TTCTAAACGC CTCCGGAAAC
CTTGTGCTCA AACTCTTGGG CCAAACAGGA AAAGCTGGCG AAAATGTCTC TGACGCAGAA
ATCAAAACTG TTCTGGCCGA GGCGCAGTCG GCTGGAGTGA TCGAAAGCGA AGAGTCCGCG
ATGATATCAG GTGTCATGCG GCTGGCGGAT CGCACTGCCC GAGCGCTTAT GACGCCCCGA
CGGGACGTCG AAATTATTGA TATCGACGAC AGCCTTGATG AAATTCGGAC CCAGTTGCAC
AGGACGAAGC GGTCGCGGTT GCCCGTTCGA AAAGGCAGTT CGGACGAGGT GATCGGCATC
CTTCCGGTCA AGGACTTCTA CGACGCGATG TCCGAACACG GCAGCGCCGA CATCAAGGCT
CTGACGCAAG ACGTCCCGGT GGTTTCAGAC CTTTCAACTG CCATCAATGT TATTGAAGCC
ATCAGGAAAT CGCCCGTTCA CATGGTGCTG GTTTTCGACG AGTACGGCCA CTTCGAGGGG
GTTGTCTCGT CAGGTGACAT TTTGGAAGCA ATCATGGGGG CTCTGCAGGA GGGACCGGTC
GATGAACAGG CCATCGCTCG GCGAGACGAC GGCTCTTATC TCGTGTCGGG CTGGACGCCA
ATTGACGAGT TCGCTGAATT CTTAAACCTC AAGCTCGATG GCGACTTGGA ATATCAGACT
GTCGCCGGCC TGGTGTTGGA AGAGTTGAAA CATCTGCCGG AATTGGGCGA GAGCTTCACG
AGAGATGGAT GGCGCTTCGA AGTCGTCGAT CTCGACGGGA GGCGCGTGGA CAAAATACTT
GTGTCGGCTG AGTGA
 
Protein sequence
MFLEIGIVAF LTILNGVLAM SELAVVYSRT ARLKVLSDNG SKGAAQAIKL AENPGRFLST 
VQIGITLVGV LSGAFSGATL GGRLSGWLEA QGMSSTAADA IGVGSVVVAI TYLSLIVGEL
VPKQIALREP EAVAARVAPA MAVLSKIALP LVWLLNASGN LVLKLLGQTG KAGENVSDAE
IKTVLAEAQS AGVIESEESA MISGVMRLAD RTARALMTPR RDVEIIDIDD SLDEIRTQLH
RTKRSRLPVR KGSSDEVIGI LPVKDFYDAM SEHGSADIKA LTQDVPVVSD LSTAINVIEA
IRKSPVHMVL VFDEYGHFEG VVSSGDILEA IMGALQEGPV DEQAIARRDD GSYLVSGWTP
IDEFAEFLNL KLDGDLEYQT VAGLVLEELK HLPELGESFT RDGWRFEVVD LDGRRVDKIL
VSAE