Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_6407 |
Symbol | |
ID | 8016906 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012854 |
Strand | - |
Start bp | 119806 |
End bp | 120831 |
Gene Length | 1026 bp |
Protein Length | 341 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644828202 |
Product | hypothetical protein |
Protein accession | YP_002979402 |
Protein GI | 241554189 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.00664448 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.306006 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTGG CAAAGACATT CAAAATTGCG TTGTTCAGTC TGCTTGGTCT CAGTGCCATT CACGGTCCGG CGGCAGCCGA GGCAGACGCT CCCCTGCTTC CCAAAGTGCC GCCGGGCACG GTGCTGACGA TCGGCGATCC GGTCACGCAG AAAGCGCTCG AAGTGTCGGG CCTCGGCAAG GAGCTCAGTT TCGAGGTCAA ATGGGCCAAT ATCAGCGGCG GGCCGCAGAC GTCGGAAGCT TTCCGCGCCC ATGCACTCGA TGTCGGGTCG GTGGCCGAGA TCCCTTCGAT CTTCGCCAAC TGGAACAATC TGCCCGTCCG CAACATCGCC TATCGCGAAC GCCGCGACCC GATCGCCAAT CCCATCTACC GCTTCGGCAT CGCGCCAGGT GCGGCCGTGA AGACGCTCGC GGACTTCCGC GGCAAGCGCA TCGCCTTCAG CCCGGGTCAG GCACAGGGCA CGCTCGTTCT CCGCGCCCTC CGTGCCGCAG GCTTGAAGAG CGGCGACGTC ACTCTGGTGG AACTGCCGAG TACCAGTGAC GTCTACCCGA AGGCACTGGC GAGCAAGCAG GTCGACATCG CACCGCTCGG CGGCGTCTAC ATCAGGCGCT ACATCACCCA ATACGGACCC GATGGCGCGA CCCTCGTGGA ACATGGCCTG CGGGATGATC CGAGCCATCT TTATGCGCCG CAATGGGTTC TGGATGATCC CGCCAAGGCG GCAGCTCTCG CCGAATACGT CGGCCTGTGG GCACGCGCCA CCGAATGGGT GAACCGGAAC CCGGACCTCT GGATCAAGGA ATATTACGTC GGTCAGCAGG GGTTGAGCCA GGAGGACGGG GAATATCTCG TCAAGTTGAC CGGCGAACAG GTCGTTCCCA GCGATTGGAG CGAAGTGAAG AAGCGCCACC AGGAAACCAT CGATCTACTG GCAGACGAGC TCGGCAACAA GCCCCTCAAT GTGGAGCAGA TTTTCGACAA TCGCTTCGAA AAGCTCGGCG CCGCGGCCTT GGCCAAGAGC CAGTAA
|
Protein sequence | MTLAKTFKIA LFSLLGLSAI HGPAAAEADA PLLPKVPPGT VLTIGDPVTQ KALEVSGLGK ELSFEVKWAN ISGGPQTSEA FRAHALDVGS VAEIPSIFAN WNNLPVRNIA YRERRDPIAN PIYRFGIAPG AAVKTLADFR GKRIAFSPGQ AQGTLVLRAL RAAGLKSGDV TLVELPSTSD VYPKALASKQ VDIAPLGGVY IRRYITQYGP DGATLVEHGL RDDPSHLYAP QWVLDDPAKA AALAEYVGLW ARATEWVNRN PDLWIKEYYV GQQGLSQEDG EYLVKLTGEQ VVPSDWSEVK KRHQETIDLL ADELGNKPLN VEQIFDNRFE KLGAAALAKS Q
|
| |