Gene Rleg_6407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_6407 
Symbol 
ID8016906 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012854 
Strand
Start bp119806 
End bp120831 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID644828202 
Producthypothetical protein 
Protein accessionYP_002979402 
Protein GI241554189 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00664448 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.306006 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATTGG CAAAGACATT CAAAATTGCG TTGTTCAGTC TGCTTGGTCT CAGTGCCATT 
CACGGTCCGG CGGCAGCCGA GGCAGACGCT CCCCTGCTTC CCAAAGTGCC GCCGGGCACG
GTGCTGACGA TCGGCGATCC GGTCACGCAG AAAGCGCTCG AAGTGTCGGG CCTCGGCAAG
GAGCTCAGTT TCGAGGTCAA ATGGGCCAAT ATCAGCGGCG GGCCGCAGAC GTCGGAAGCT
TTCCGCGCCC ATGCACTCGA TGTCGGGTCG GTGGCCGAGA TCCCTTCGAT CTTCGCCAAC
TGGAACAATC TGCCCGTCCG CAACATCGCC TATCGCGAAC GCCGCGACCC GATCGCCAAT
CCCATCTACC GCTTCGGCAT CGCGCCAGGT GCGGCCGTGA AGACGCTCGC GGACTTCCGC
GGCAAGCGCA TCGCCTTCAG CCCGGGTCAG GCACAGGGCA CGCTCGTTCT CCGCGCCCTC
CGTGCCGCAG GCTTGAAGAG CGGCGACGTC ACTCTGGTGG AACTGCCGAG TACCAGTGAC
GTCTACCCGA AGGCACTGGC GAGCAAGCAG GTCGACATCG CACCGCTCGG CGGCGTCTAC
ATCAGGCGCT ACATCACCCA ATACGGACCC GATGGCGCGA CCCTCGTGGA ACATGGCCTG
CGGGATGATC CGAGCCATCT TTATGCGCCG CAATGGGTTC TGGATGATCC CGCCAAGGCG
GCAGCTCTCG CCGAATACGT CGGCCTGTGG GCACGCGCCA CCGAATGGGT GAACCGGAAC
CCGGACCTCT GGATCAAGGA ATATTACGTC GGTCAGCAGG GGTTGAGCCA GGAGGACGGG
GAATATCTCG TCAAGTTGAC CGGCGAACAG GTCGTTCCCA GCGATTGGAG CGAAGTGAAG
AAGCGCCACC AGGAAACCAT CGATCTACTG GCAGACGAGC TCGGCAACAA GCCCCTCAAT
GTGGAGCAGA TTTTCGACAA TCGCTTCGAA AAGCTCGGCG CCGCGGCCTT GGCCAAGAGC
CAGTAA
 
Protein sequence
MTLAKTFKIA LFSLLGLSAI HGPAAAEADA PLLPKVPPGT VLTIGDPVTQ KALEVSGLGK 
ELSFEVKWAN ISGGPQTSEA FRAHALDVGS VAEIPSIFAN WNNLPVRNIA YRERRDPIAN
PIYRFGIAPG AAVKTLADFR GKRIAFSPGQ AQGTLVLRAL RAAGLKSGDV TLVELPSTSD
VYPKALASKQ VDIAPLGGVY IRRYITQYGP DGATLVEHGL RDDPSHLYAP QWVLDDPAKA
AALAEYVGLW ARATEWVNRN PDLWIKEYYV GQQGLSQEDG EYLVKLTGEQ VVPSDWSEVK
KRHQETIDLL ADELGNKPLN VEQIFDNRFE KLGAAALAKS Q