Gene Rleg_4040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRleg_4040 
Symbol 
ID8014845 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhizobium leguminosarum bv. trifolii WSM1325 
KingdomBacteria 
Replicon accessionNC_012850 
Strand
Start bp4118856 
End bp4120079 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content60% 
IMG OID644826609 
Productargininosuccinate synthase 
Protein accessionYP_002977820 
Protein GI241206724 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCAT ACAAAGACGT GAAGAAAGTC GTTCTCGCCT ATTCCGGCGG CCTCGACACC 
TCGATCATCC TGAAGTGGCT GCAGACGGAA CTCGGCGCCG AAGTCGTCAC CTTCACCGCC
GATCTCGGCC AGGGCGAAGA GCTGGAGCCG GCGCGCAAGA AGGCCGAAAT GCTCGGCATC
AAGGAGATCT ATATCGAGGA TGTGCGCGAG GAATTCGTAC GCGATTTCGT CTTCCCGATG
TTCCGCGCCA ATGCCGTCTA CGAAGGCGTC TACCTGCTCG GCACCTCGAT CGCCCGTCCG
TTGATTTCCA AGCATCTGAT CGATATCGCC AAGAAGACCG GCGCCGATGC GATCGCCCAC
GGCGCGACCG GCAAGGGCAA CGACCAGGTC CGGTTCGAGC TCTCCGCCTA TGCCCTGAAC
CCCGACATCA AGATCATCGC GCCGTGGCGC GACTGGGCGT TCAAGAGCCG CACCGACCTG
CTGGCTTTCG CCGAGCAGCA TCAGATCCCT GTTGCCAAGG ACAAAAAGGG CGAGGCGCCA
TTCTCCGTCG ACGCCAACCT TCTGCATTCC TCTTCCGAGG GCAAGGTTCT CGAGGACCCC
TCCAAGGAGG CGCCTGAATA TGTGCACATG CGCACCATTT CGCCTGAGGC TGCACCCGAC
AAGGCAACGA CCATCAAGGT CGGCTTCGAA AAGGGTGATG CGGTTTCGAT CAACGGCGTG
CGCATGAGCC CGGCGACGCT CTTGGCTGCG CTCAACAATT ACGGACGAGA CAACGGCATC
GGTCGTCTCG ACCTCGTCGA GAACCGTTTT GTCGGCATGA AGTCGCGCGG CGTCTACGAG
ACCCCAGGCG GCACCATCCT GCTTTCGGCG CACCGCGCCA TTGAATCGAT CACGCTCGAC
CGCGGTGCCG CCCATCTCAA GGACGACATC ATGCCGCGTT ACGCCGAGCT GATCTATTAC
GGCTTCTGGT TTTCGCCGGA GCGCGAGATG CTGCAGGCGC TGATCGACAA GAGCCAGGAG
CATGTCGAAG GCGAAGTGAC GCTGAAGCTC TACAAGGGCA ATGTCATGGT CATCGGCCGT
GAAAGCGACA AGTCGCTCTA TTCCGACAAG CTCGTCACTT TCGAGGATGA CCAGGGCGCC
TACGACCAGA AGGATGCGGC CGGCTTCATC AAGCTCAACG CGCTGCGCCT GCGCACGCTC
GCCAAGCGCA ATCTCGTGAA GTAA
 
Protein sequence
MASYKDVKKV VLAYSGGLDT SIILKWLQTE LGAEVVTFTA DLGQGEELEP ARKKAEMLGI 
KEIYIEDVRE EFVRDFVFPM FRANAVYEGV YLLGTSIARP LISKHLIDIA KKTGADAIAH
GATGKGNDQV RFELSAYALN PDIKIIAPWR DWAFKSRTDL LAFAEQHQIP VAKDKKGEAP
FSVDANLLHS SSEGKVLEDP SKEAPEYVHM RTISPEAAPD KATTIKVGFE KGDAVSINGV
RMSPATLLAA LNNYGRDNGI GRLDLVENRF VGMKSRGVYE TPGGTILLSA HRAIESITLD
RGAAHLKDDI MPRYAELIYY GFWFSPEREM LQALIDKSQE HVEGEVTLKL YKGNVMVIGR
ESDKSLYSDK LVTFEDDQGA YDQKDAAGFI KLNALRLRTL AKRNLVK