Gene Rleg_4005 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4005 
Symbol 
ID8014814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4082512 
End bp4083702 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content63% 
IMG OID644826574 
Productaromatic amino acid aminotransferase 
Protein accessionYP_002977785 
Protein GI241206689 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1448] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.143022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.2013 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGATG ATCTGATAAT GCCGCCAGCC GACAAGATCC TGTCGTTGAT GCCGATTTTC 
CGGCAGGACA GTCGTTCGAA CAAGATCGAT CTCGGCGTCG GAGTCTACCG GGACGCCTCC
GGTACGACGC CGATCCCGCG GGCGGTGAGG GAGGCAGAAA AGCGAATCCA TACCGCGCAG
ACGACCAAAG CCTATGTCGG CCCGGCCGGA GATCCTGTTT TCTGCGATCT CATCGGCAGG
CTTGTCTTCG GCGAAGCCGC GCCGTGGGAG CGAATTCGCG GCATCCAGAC GCCGGGCGGA
GCAGGCGCCT TGACGGTGCT CGCCGGCCTG ATCTCCCTGG CGCGCCCGGG TGCTGCGGTC
CATGTGCCCG ACCCGACCTG GGTGAACCAT GTGTCGATCC TCGAAGACAA CCGGCTTCGG
GTCGTCACTT ACCCTTACCT CGATCGCCGA ACAGGCGAGG TGGATTTCGA CGCCCTGCTC
GATCATTTCT CACGGTCGGA GCGGGGCGAC ATCGTGTTGC TGCACGGCTG CTGCCACAAT
CCGACCGGCG CCGACCCGAG CCGTTCGCAA TGGCAGGCGC TGGCAGAGAT CATCGCCGAG
CGCGGGCTCG TTCCGCTGGT CGATATCGCT TATCAGGGGT TTGGCGAGGG TCTCGAGGAC
GATGCCTTCG TGGTACGGCT GCTCACCGGC ATGGTTCCGG AAATGCTCGT CTCCTCGTCA
TGCTCGAAGA ATTTCGGAAT CTATCGCGAG CGTACGGGTG CCGCATTCAT TCTCGCCGCG
AACGCGGATC GGGCGGATGC AGCCAAGGCG CAACTCACAG TGCGAGCCCG TCTCGTCTAT
TCGATGCCGC CGGATCATGG CGCGGCTATC GTTCGCACCG TCCTGGAAGA CCCGGCGCTT
TCGGCCGACT GGCGCGCCGA ACTGGACGAT ATGCGCTCCA GCATTCTGTC GCTGCGCCAG
GGGCTTGCTG CCTCGTTCCG GCGTTTCACC AATGGCAGCG ACTACGATTT CCTCGCCAAG
AACAAAGGCA TGATTTCGCT GATCGGCCTG ACACCCGGAG AAGCCGTGAT GCTGCGCGAG
CAGCACGCGA TCTACATCGT CGAGGACGGA CGCATCAATG TCGCCGGGCT GCAGGCCAGC
CAGATCGACA CCTTTGCGGA AGCCGTTCTG GCAGTTCGCG GGAAACGCTG A
 
Protein sequence
MFDDLIMPPA DKILSLMPIF RQDSRSNKID LGVGVYRDAS GTTPIPRAVR EAEKRIHTAQ 
TTKAYVGPAG DPVFCDLIGR LVFGEAAPWE RIRGIQTPGG AGALTVLAGL ISLARPGAAV
HVPDPTWVNH VSILEDNRLR VVTYPYLDRR TGEVDFDALL DHFSRSERGD IVLLHGCCHN
PTGADPSRSQ WQALAEIIAE RGLVPLVDIA YQGFGEGLED DAFVVRLLTG MVPEMLVSSS
CSKNFGIYRE RTGAAFILAA NADRADAAKA QLTVRARLVY SMPPDHGAAI VRTVLEDPAL
SADWRAELDD MRSSILSLRQ GLAASFRRFT NGSDYDFLAK NKGMISLIGL TPGEAVMLRE
QHAIYIVEDG RINVAGLQAS QIDTFAEAVL AVRGKR