Gene Rleg_3887 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3887 
Symbol 
ID8014707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3955888 
End bp3957198 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content63% 
IMG OID644826457 
Productprotein of unknown function DUF21 
Protein accessionYP_002977669 
Protein GI241206573 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4536] Putative Mg2+ and Co2+ transporter CorB 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0724448 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.00619914 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCGGTCG AAGGCGCTCT GGCATTTCTT TCGACATATT GGCCGGAGAT CCTTTCGATC 
ACGGCGCTCG TGCTCATGTC CGCCTTCTTT TCCGGCTCCG AGACCGCGCT GACCGCCGTT
TCGCGCAGCC GTATCCATAC GCTCGAGGTC AACGGCGACG AACGCGCCGG CCTCGTCCGG
CAGTTGATCG AACGGCGCGA CCGGCTGATC GGTGCGCTGC TCATCGGCAA CAATCTCGCC
AATATCCTGT CCTCCTCGAT CGCCACCAGC CTCTTCCTCG GGCTGTTCGG CAGTTCCGGC
GTGGCGCTGG CGACGCTCGC GATGACCGTC ATCCTGGTGA TCTTCGCGGA AGTGCTTCCG
AAGAGCTGGG CGATTTCGGC GCCTGAGCGC TTCGCACTCG CCATCGCGCT GCCGGCCAGG
CTGTTCGTTG CCGTCGTCGG CCCGGTTTCC TCCTTCGTCA ATGCGATCGT GCGGCAGATT
CTTTCGCTGT TCGGCATCAA TCTCTCACGA GAGACATCGA TGCTGACGGC GCATGAGGAA
CTGCGCGGTG CCGTCGATCT GCTGCACCGC GAGGGATCGG TGGTGAAGGC CGACCGCGAC
CGCCTCGGCG GCGTGCTCGA TCTTAGCGAG CTCGAACTGT CCGACATCAT GGTCCACCGC
ACCGCGATGC GGGCGATCAA CGCCGACGAT GCGCCGGAAG CGGTGGTGCG GGTTATCCTC
GAAAGCCCCT ATACGCGCAT GCCGCTGTGG CGTGGCACGA TCGACAACAT CATCGGCGTC
GTCCATGCCA AGGATCTGCT GCGGGCGCTT GCCGAGCCGA ACATGGAGCC GCAGAACCTC
GATATCGTGA AGATCGCGCA GAAGCCGTGG TTCGTGCCCG ACAGCACCAA CCTCGAGGAC
CAGCTCAACG CCTTCCTGCG GCGCAAGCAG CATTTCGCCG TCGTCGTCGA CGAATATGGC
GAGGTGCAGG GCATCGTCAC GCTGGAAGAT ATTCTCGAGG AAATCGTCGG CGACATTTCC
GACGAACACG ATATCGAAAT ACAGGGCGTG CGTCAGGAGG CTGACGGCTC CGTCGTCGTC
GACGGCGGCG TTCCGATCCG CGACCTGAAC CGCGCGCTCG ACTGGAACCT GCCCGATGAG
GAGGCGACGA CGATCGCCGG CCTCGTTATC CACGAATCGA TGACCATCCC GGAAGAGCGC
CAAGCCTTCA CCTTCTACGG CAAGCGTTTC GTCGTCATGA AGCGGGAGAA GAACCGCATC
ACCAAGCTGC GCATCCGCCC GGCCGGAGAA GACGGCGCAA AGCCAGCCTG A
 
Protein sequence
MSVEGALAFL STYWPEILSI TALVLMSAFF SGSETALTAV SRSRIHTLEV NGDERAGLVR 
QLIERRDRLI GALLIGNNLA NILSSSIATS LFLGLFGSSG VALATLAMTV ILVIFAEVLP
KSWAISAPER FALAIALPAR LFVAVVGPVS SFVNAIVRQI LSLFGINLSR ETSMLTAHEE
LRGAVDLLHR EGSVVKADRD RLGGVLDLSE LELSDIMVHR TAMRAINADD APEAVVRVIL
ESPYTRMPLW RGTIDNIIGV VHAKDLLRAL AEPNMEPQNL DIVKIAQKPW FVPDSTNLED
QLNAFLRRKQ HFAVVVDEYG EVQGIVTLED ILEEIVGDIS DEHDIEIQGV RQEADGSVVV
DGGVPIRDLN RALDWNLPDE EATTIAGLVI HESMTIPEER QAFTFYGKRF VVMKREKNRI
TKLRIRPAGE DGAKPA