Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_4001 |
Symbol | |
ID | 8014810 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 4078974 |
End bp | 4080041 |
Gene Length | 1068 bp |
Protein Length | 355 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 644826570 |
Product | oligopeptide/dipeptide ABC transporter, ATPase subunit |
Protein accession | YP_002977781 |
Protein GI | 241206685 |
COG category | [E] Amino acid transport and metabolism [P] Inorganic ion transport and metabolism |
COG ID | [COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.345464 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 0.222478 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATGACG AGGCTTCCGG CCAAGCCGTC GATACAGGTG CCGCAACGCA TGCCTTCGCT GCTTTCCTGC GCGACGACGC GCTTTTGAAG GTACGCGATC TCTCAGTCAG ATACCGGCGC GGCGGCAAGA TCTTTGTTGC GGTCGACGGC GTCTCCTTCG ACGTGGCTCC TGGGGAAACA CTCGGTCTCG TCGGAGAGTC GGGCAGCGGC AAGACGACCA TCGGCCGTGC CCTTCTGAAG CTTCTGCCGA AGGCGGACAC GCGTGTCGAC GGCCACGTCG AATATGACGG CCTGAATGTT GCCGATCTGT CTGCTTCCGA TCTTCGCGCC ATCCGCAGCA AGCTGCAGAT GATCTTCCAG GATCCGATTT CTTCCTTCAA TCCGCGCCGC AAGGTGCAGG ATATCGTCGG AGAAGGCCTC GAGATCCAGG GCATCCACAA AGCGGAAAGA CTGGAAAGGG TCGATCGTGC GCTCAACGAT GTCGGCATGA GCCGGACGAT GGTGGAGGGC CGCAGGCCGC ATCAGTTTTC CGGCGGCCAG TGCCAGCGTA TTGCGATCGC GCGGGCGCTT GCCGTTGGGC CGGAGCTGAT CGTCTGCGAC GAGCCGGTTG CATCTCTCGA CGTCTCGGTG CAGGCGCATG TGATCAATCT TCTGCAGGAT ATCCGCCAGA AGCGAAACCT GGCGCTGATC TTCATTTCCC ACGATCTCGC CGTCGTGCGC AATGTCAGCG ACAGGGTCGC GGTCCTCTAC ATGGGCAGGA TCGTCGAGAT CGGAACCGGC GATGCCATCT ATCAGCGTCC GGCGCATCCC TATACCCGCA TGCTGCTGGA AGCAGTCCCG GTTCCCGATG CCAGCCGGAA GATCGTGCCG AGCACAACGC CGACGCAGGC CTTGTCGCGG AGCGCGCCGC CGTCCGGATG CCGGTTCCGC CTGCGTTGCC CACGCGCTCA GGCGGTTTGT GCGGAACAGG AACCGAAGCT TGCATCGATG CCCCACGGCC AATTCGCGGC CTGTCATTTC CCTCACGACG AACCGGCGCC CGGAATGAAG ACGGCGGAAC AAGCGTGA
|
Protein sequence | MNDEASGQAV DTGAATHAFA AFLRDDALLK VRDLSVRYRR GGKIFVAVDG VSFDVAPGET LGLVGESGSG KTTIGRALLK LLPKADTRVD GHVEYDGLNV ADLSASDLRA IRSKLQMIFQ DPISSFNPRR KVQDIVGEGL EIQGIHKAER LERVDRALND VGMSRTMVEG RRPHQFSGGQ CQRIAIARAL AVGPELIVCD EPVASLDVSV QAHVINLLQD IRQKRNLALI FISHDLAVVR NVSDRVAVLY MGRIVEIGTG DAIYQRPAHP YTRMLLEAVP VPDASRKIVP STTPTQALSR SAPPSGCRFR LRCPRAQAVC AEQEPKLASM PHGQFAACHF PHDEPAPGMK TAEQA
|
| |