Gene Rleg_3423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_3423 
Symbol 
ID8014296 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp3445152 
End bp3446138 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content58% 
IMG OID644825981 
Productputative sugar ABC transporter, substrate-binding protein 
Protein accessionYP_002977208 
Protein GI241206112 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1879] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.490297 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGAAGG CATTACTACT TACAGCCGCA GTTTTAGCAC TGACCGCCGG GCAGGCCTTG 
GCCAAGAAGC AACTCGTCAT CGTCGTCAAA GGCCTCGACA ATCCGTTCTT CGAAGCAATC
AATCAAGGTT GCCAGAAATG GAACAAGGAG AACCCCACGG CGGAATACGA ATGCTTCTAC
ACCGGTCCGG CATCGACATC CGATGAAGCC GGCGAAGCGC AGATCGTTCA GGATATGTTG
GGCAAGGCGG ACACGGCTGC AATCGCAATC TCGCCGTCAA ATGCAAAGCT GATCGCCCAG
ACCCTGAAGA CGGCCAATCC GACGGTTCCG GTGATGACGC TCGATGCCGA TCTCGCCGCC
GAGGATGCAG CGTTGCGCAA GACCTATCTC GGCACCGACA ACTATCTGAT GGGCGCCCGC
ATCGGCGAAT ACATCAAGAA GGGCAAGCCG AAGGGCGGCA AGATCTGCAC CATCGAAGGC
AATCCGGGAG CGGACAATAT TCTACGCCGC GCACAAGGAA TGCGCGACAC GCTTACTGGC
CAGAAAGGCT TGACCGAGCT GAAAGGCGAG GGTGGCTGGA CCGAGGTCGC CGGTTGCCCG
GTGTTCACCA ATGACGACGG TGCCAAGGGC GTTCAGGCAA TGACCGATAT CCTCGCGGCC
AATCCCGACT TGGACGCATT CGGCATTATG GGTGGCTGGC CATTGTTCGG TGCACCGCAA
CCCTATCGCG ACCTGTTCAA GCCGCTGGCC GACAAGATCG CCAGCAACGA TTTCGTCATT
GGTGCCGCCG ATACGATCGG CGACGAGGTC GCCATTGCGA AGGAAGGGCT GGTGACAGCG
CTCGTCGGCC AGCGGCCATT CGAGATGGGC TACAAGGCAC CGTCGGTGAT GATGGATCTG
ATTGCCGGCA AGCCGGTTGA AGATCCGGTC TTCACCGGGC TCGATGAGTG CACGAAGGAT
ACAGTCGATA CCTGCATTCA AAAGTAG
 
Protein sequence
MRKALLLTAA VLALTAGQAL AKKQLVIVVK GLDNPFFEAI NQGCQKWNKE NPTAEYECFY 
TGPASTSDEA GEAQIVQDML GKADTAAIAI SPSNAKLIAQ TLKTANPTVP VMTLDADLAA
EDAALRKTYL GTDNYLMGAR IGEYIKKGKP KGGKICTIEG NPGADNILRR AQGMRDTLTG
QKGLTELKGE GGWTEVAGCP VFTNDDGAKG VQAMTDILAA NPDLDAFGIM GGWPLFGAPQ
PYRDLFKPLA DKIASNDFVI GAADTIGDEV AIAKEGLVTA LVGQRPFEMG YKAPSVMMDL
IAGKPVEDPV FTGLDECTKD TVDTCIQK