Gene Rleg2_1901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg2_1901 
Symbol 
ID6980640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM2304 
KingdomBacteria 
Replicon accessionNC_011369 
Strand
Start bp1945208 
End bp1946374 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content66% 
IMG OID643396624 
Productaminotransferase class V 
Protein accessionYP_002281412 
Protein GI209549495 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.00391908 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGCCGC CACGCCTTTA TCTCGACTGG AATGCCACAG CGCCGCTGCA CCCTGCAGCA 
CGCGAGGCGA TCATGCGCGC CATCGACATA TTCGGCAATC CGAATTCCGT TCACGGCGAA
GGCCGTGCCG CCCGCGCCGC AATCGAATGT GCACGGCGCA AAGTGGCGGC GCTGGTCGGC
ACCGACGCCG GCAATGTGAT CTTTACCAGC GGCGCCACCG AGGCCGCCAA TCTGGTGCTG
ACACCGGATT TCCGCATGGG CCGCACGCCG CTTCAGCTCG GCCGCCTCTA CTTCTCGGCA
ATCGAGCATC CGGCGGTGCG CGAAGGCGGC CGCTTCGCCA GAGAGAAGAT GACCGAGATC
CCGGTCACGT CAGACGGCAT CGTCGATCTC GATGCGCTTG GTCTGCTGCT TGATGCACAT
GACAAGGCCG CCGGCCTGCC GATGGTCGCC ATCATGCTCG TCAACAACGA GACCGGCATC
GTCCAGCCTG TGGAGGCGGC GGCAAAGATC GTCCACGCTC ATGGCGGGCT CTTCGTCGTC
GATGCCGTTC AGGCGGCCGG CCGCATAGGG CTCGACATCG GCAGGATCGG CGCCGATTTC
ATGATCGTCT CCTCGCACAA GATCGGCGGG CCGAAGGGTG CCGGCGCGTT GATTGCCCGC
GGCGAGGCGC TGATGCCGCG GCCACTGATC CAGGGTGGCG GCCAGGAGCG GGGTCACCGG
TCGGGGACAC AGAATTCACT GGCGCTGATC GGCTTCGGCG CGGCGACGGA AGCTGCATCC
GACGAGCTCG AGGCACGCAA TGCGGCAATC GGCGCGTTGC GCGAGCGGCT GGAAGCCGGC
ATGCGTCAGG CGGCAACCGA TGTGGTGATC CATGGCGAAG GCGGCGAACG TGTCGCCAAC
ACGATCTTCT TCACTTTGCC TGGGTTGAAG GCCGAGACTG GGCAGATCGC ATTCGATCTC
GAAGGTGTAG CGCTTTCGGC GGGCTCAGCC TGCTCATCCG GCCGGCTCGG CGAAAGCCAT
GTGCTGACGG CGATGGGGCG CGACGCCAAG CTCGGGGGCT TGCGTATCTC GCTCGGCTTT
TCGACGACGG AAGAGGATAT CGACCGGGCG ATTGCCGCTT TTGCGAAGAT CGCCTGCCGG
CGCAGGTCGG CGGGCGAGGC GGCCTGA
 
Protein sequence
MAPPRLYLDW NATAPLHPAA REAIMRAIDI FGNPNSVHGE GRAARAAIEC ARRKVAALVG 
TDAGNVIFTS GATEAANLVL TPDFRMGRTP LQLGRLYFSA IEHPAVREGG RFAREKMTEI
PVTSDGIVDL DALGLLLDAH DKAAGLPMVA IMLVNNETGI VQPVEAAAKI VHAHGGLFVV
DAVQAAGRIG LDIGRIGADF MIVSSHKIGG PKGAGALIAR GEALMPRPLI QGGGQERGHR
SGTQNSLALI GFGAATEAAS DELEARNAAI GALRERLEAG MRQAATDVVI HGEGGERVAN
TIFFTLPGLK AETGQIAFDL EGVALSAGSA CSSGRLGESH VLTAMGRDAK LGGLRISLGF
STTEEDIDRA IAAFAKIACR RRSAGEAA