Gene Rleg_4001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4001 
Symbol 
ID8014810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4078974 
End bp4080041 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content62% 
IMG OID644826570 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_002977781 
Protein GI241206685 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.345464 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.222478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATGACG AGGCTTCCGG CCAAGCCGTC GATACAGGTG CCGCAACGCA TGCCTTCGCT 
GCTTTCCTGC GCGACGACGC GCTTTTGAAG GTACGCGATC TCTCAGTCAG ATACCGGCGC
GGCGGCAAGA TCTTTGTTGC GGTCGACGGC GTCTCCTTCG ACGTGGCTCC TGGGGAAACA
CTCGGTCTCG TCGGAGAGTC GGGCAGCGGC AAGACGACCA TCGGCCGTGC CCTTCTGAAG
CTTCTGCCGA AGGCGGACAC GCGTGTCGAC GGCCACGTCG AATATGACGG CCTGAATGTT
GCCGATCTGT CTGCTTCCGA TCTTCGCGCC ATCCGCAGCA AGCTGCAGAT GATCTTCCAG
GATCCGATTT CTTCCTTCAA TCCGCGCCGC AAGGTGCAGG ATATCGTCGG AGAAGGCCTC
GAGATCCAGG GCATCCACAA AGCGGAAAGA CTGGAAAGGG TCGATCGTGC GCTCAACGAT
GTCGGCATGA GCCGGACGAT GGTGGAGGGC CGCAGGCCGC ATCAGTTTTC CGGCGGCCAG
TGCCAGCGTA TTGCGATCGC GCGGGCGCTT GCCGTTGGGC CGGAGCTGAT CGTCTGCGAC
GAGCCGGTTG CATCTCTCGA CGTCTCGGTG CAGGCGCATG TGATCAATCT TCTGCAGGAT
ATCCGCCAGA AGCGAAACCT GGCGCTGATC TTCATTTCCC ACGATCTCGC CGTCGTGCGC
AATGTCAGCG ACAGGGTCGC GGTCCTCTAC ATGGGCAGGA TCGTCGAGAT CGGAACCGGC
GATGCCATCT ATCAGCGTCC GGCGCATCCC TATACCCGCA TGCTGCTGGA AGCAGTCCCG
GTTCCCGATG CCAGCCGGAA GATCGTGCCG AGCACAACGC CGACGCAGGC CTTGTCGCGG
AGCGCGCCGC CGTCCGGATG CCGGTTCCGC CTGCGTTGCC CACGCGCTCA GGCGGTTTGT
GCGGAACAGG AACCGAAGCT TGCATCGATG CCCCACGGCC AATTCGCGGC CTGTCATTTC
CCTCACGACG AACCGGCGCC CGGAATGAAG ACGGCGGAAC AAGCGTGA
 
Protein sequence
MNDEASGQAV DTGAATHAFA AFLRDDALLK VRDLSVRYRR GGKIFVAVDG VSFDVAPGET 
LGLVGESGSG KTTIGRALLK LLPKADTRVD GHVEYDGLNV ADLSASDLRA IRSKLQMIFQ
DPISSFNPRR KVQDIVGEGL EIQGIHKAER LERVDRALND VGMSRTMVEG RRPHQFSGGQ
CQRIAIARAL AVGPELIVCD EPVASLDVSV QAHVINLLQD IRQKRNLALI FISHDLAVVR
NVSDRVAVLY MGRIVEIGTG DAIYQRPAHP YTRMLLEAVP VPDASRKIVP STTPTQALSR
SAPPSGCRFR LRCPRAQAVC AEQEPKLASM PHGQFAACHF PHDEPAPGMK TAEQA