Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Rleg_2965 |
Symbol | |
ID | 8015745 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Rhizobium leguminosarum bv. trifolii WSM1325 |
Kingdom | Bacteria |
Replicon accession | NC_012850 |
Strand | + |
Start bp | 2956944 |
End bp | 2958155 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644825535 |
Product | carbamoyl phosphate synthase small subunit |
Protein accession | YP_002976763 |
Protein GI | 241205667 |
COG category | [E] Amino acid transport and metabolism [F] Nucleotide transport and metabolism |
COG ID | [COG0505] Carbamoylphosphate synthase small subunit |
TIGRFAM ID | [TIGR01368] carbamoyl-phosphate synthase, small subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0448128 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGATGA CCGCGACAGC ACCCTGGACA ATCGAAAAGC CGACCGCCCT GCTCGTTCTT GCCGACGGCA CGGTCATCGA AGGCAAGGGC ATCGGCGCCA CCGGCAAGGT GCCGGCCGAG GTGGTCTTCA ACACGGCGCT TACCGGCTAT GAGGAGATCC TGACGGACCC CTCCTATCTC GGCCAGATCG TCACCTTCAC CTTCCCGCAT ATCGGCAATA TCGGCACCAA TGATGAAGAC ATCGAGGACC TGACGCCTGC CGCCCGCCAC GGCGCCGTCG GCGTCATCTT CAAGGCCGAC ATCACTGAGC CCTCGAACTA CCGCGCCGCC AAGCACCTCG ACCAATGGCT GAAGGCCCGC GGCGTCATCG GTCTCTGCGG CATCGACACG CGCGCGCTGA CCGCCTGGAT CCGTGAGAAC GGCGCTCCGA ACGCGGTCAT CGCCCACGAT CCGAACGGTG TCTTCGACAT CGAGACGCTG AAGGCCGAGG CTAAAGCCTG GAGCGGCCTT GAAGGTCTCG ACCTCGCCAA GATCGCCTCG TCGGGCCAGT CCTCGCAATG GACCGAAACG CCGTGGGTGT GGAATGAAGG TTACGGTGAA CTCAAGGCGA CAGACGCGAA ATACCACGTC GTCTGCCTCG ATTACGGCGT CAAGCGCAAC ATCCTGCGCC TGTTTGCCGG CCTCGACTGC AAAGTGACTG TCGTGCCGGC CGCAACGAGC GCCGAAGACG TGCTCGCCAT GCAGCCGGAC GGCATCTTCC TGTCGAACGG CCCGGGCGAT CCGGCGGCAA CCGGCGACTA TGCCGTGCCT GTGATCAAGA CACTGATCAA GACCGATATC CCGGTCTTCG GCATCTGCCT CGGCCACCAG ATGCTGGGCC TGGCACTCGG CGCGAAGACC GAGAAGATGC ATCAGGGCCA TCACGGCGCC AACCACCCCG TCAAGGACCA TACGACCGGC AAGGTCGAGA TCGTCTCGAT GAACCACGGC TTCGCGGTCG ACTCGAAGTC GCTGCCGGAT GGCGTTGAAG AGACTCATAT TTCGCTTTTC GACGGCACCA ATTGCGGCCT GCGCGTCCTC GGCAAGCAGG TCTTCTCCGT CCAGCACCAT CCGGAAGCCT CTCCCGGTCC GCAGGACAGC CACTACCTCT TCCGCCGCTT CATCAACATG GTGCGCGAGA AGAAGGGCGA ACCGGCGCTC GCCGAACGCT GA
|
Protein sequence | MKMTATAPWT IEKPTALLVL ADGTVIEGKG IGATGKVPAE VVFNTALTGY EEILTDPSYL GQIVTFTFPH IGNIGTNDED IEDLTPAARH GAVGVIFKAD ITEPSNYRAA KHLDQWLKAR GVIGLCGIDT RALTAWIREN GAPNAVIAHD PNGVFDIETL KAEAKAWSGL EGLDLAKIAS SGQSSQWTET PWVWNEGYGE LKATDAKYHV VCLDYGVKRN ILRLFAGLDC KVTVVPAATS AEDVLAMQPD GIFLSNGPGD PAATGDYAVP VIKTLIKTDI PVFGICLGHQ MLGLALGAKT EKMHQGHHGA NHPVKDHTTG KVEIVSMNHG FAVDSKSLPD GVEETHISLF DGTNCGLRVL GKQVFSVQHH PEASPGPQDS HYLFRRFINM VREKKGEPAL AER
|
| |