Gene Rleg_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4420 
Symbol 
ID8015190 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4548967 
End bp4550355 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content63% 
IMG OID644826995 
Producthypothetical protein 
Protein accessionYP_002978197 
Protein GI241207101 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0624] Acetylornithine deacetylase/Succinyl-diaminopimelate desuccinylase and related deacylases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.33388 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATG TACGTCAGGT GCTTGCGCGC GCGGATCAGA ACCTTCCATC GAGCCTCGAC 
AAGCTGTTCG AGCTTCTGCG GATCCAGTCG ATATCCACCG ATCCCGCTTT TAAAGCCGAA
TGCCGAAAGG CTGCGGAATG GCTCGTCGCT TATCTCGGGA CGCTCGGCTT TATGGCCTCG
GTGCGCGATA CGCCAGGCCA TCCGATGGTC GTCGCCCATC ATGCTGGCGC GTCCGCCGAT
GCGCCGCATG TTCTCTTCTA CGGCCATTAC GACGTTCAGC CGGTCGACCC AATCGAACTC
TGGGAAAACG ACCCCTTCGA GCCATCGATC AAGGATGTCG GCGAGGGCCG CAAGATCCTG
ACCGGCCGCG GCACCTCCGA CGACAAGGGC CAGTTGATGA CCTTCGTCGA GGCTTGCCGG
GCCTATAAGG AGATCAACGG CGCGCTTCCC TGCCGCGTCA CCATCCTCTT CGAGGGCGAG
GAGGAGTCCG GCTCGCCGTC CCTGAAGCCC TTCCTCGAAG CCAATGCCGC CGAGCTCAAG
GCCGATTATG CGCTGGTCTG CGATACCGGC ATGTGGGACC GCGACACGCC GGCGATCGCC
GCAGCGCTGC GCGGCCTCGT CGGCGAGGAA GTGGTCGTGA CGGCCGCCGA CCGCGACCTG
CATTCCGGTC TCTTCGGCGG CGCGGCCGCC AATCCGATCC ATATTCTCGT CGAGGCTCTT
GCCGGCCTGC ATGACGAGAC GGGCCGCATT ACGCTTGACG GTTTCTATGA AGGCGTCGAG
GAAACGCCTG AAAACATCAA GGCGTCATGG GAGACGCTCG GCAAGACCGC GGAGAGCTTC
CTCGGCGAAG TCGGCCTCTC CATTCCATCC GGCGAAAAGG GCCGTTCGGT GCTGGAACTC
ACCTGGGCGC GGCCGACCGC CGAAATCAAC GGCATCTGGG GCGGTTATAC CGGCGAAGGC
TTCAAGACGG TGATCGCCGC CAAGGCTTCG GCCAAGGTTT CGTTCCGTCT CGTCGGCACG
CAGGATCCGG CCGCCATTCG CGAGGCCTTC CGCAGCTATA TCAGCGCGAA GATTCCCGCC
GATTGCTCGG TCGAGTTCCA TCCGCATGGC GGCTCGCCGG CGATCCACCT CTCCTATGAT
TCACCTGTTC TGACCAAGGC CAAGAACGCG CTTTCCGACG AGTGGCCGAA ACCCGCCATT
GTCATCGGCA TGGGTGGGTC GATCCCGATC GTCGGAGATT TCCAGAAGAT GCTCGGCATG
GAATCGCTGC TCGTCGGCTT CGGCCTCAGC GATGATCGCA TCCACTCGCC GAACGAGAAA
TACGAACTCG TCTCCTACCA CAAGGGCATC CGCTCCTGGG TGCGGATTCT TCAGGCGCTC
GCCGCCTGA
 
Protein sequence
MTDVRQVLAR ADQNLPSSLD KLFELLRIQS ISTDPAFKAE CRKAAEWLVA YLGTLGFMAS 
VRDTPGHPMV VAHHAGASAD APHVLFYGHY DVQPVDPIEL WENDPFEPSI KDVGEGRKIL
TGRGTSDDKG QLMTFVEACR AYKEINGALP CRVTILFEGE EESGSPSLKP FLEANAAELK
ADYALVCDTG MWDRDTPAIA AALRGLVGEE VVVTAADRDL HSGLFGGAAA NPIHILVEAL
AGLHDETGRI TLDGFYEGVE ETPENIKASW ETLGKTAESF LGEVGLSIPS GEKGRSVLEL
TWARPTAEIN GIWGGYTGEG FKTVIAAKAS AKVSFRLVGT QDPAAIREAF RSYISAKIPA
DCSVEFHPHG GSPAIHLSYD SPVLTKAKNA LSDEWPKPAI VIGMGGSIPI VGDFQKMLGM
ESLLVGFGLS DDRIHSPNEK YELVSYHKGI RSWVRILQAL AA