Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_1121 |
Symbol | |
ID | 3916417 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 1165884 |
End bp | 1166882 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640443856 |
Product | cysteine synthase A |
Protein accession | YP_496400 |
Protein GI | 87199143 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGGCTC CTGCGATTGC TCCCGATACC ATTTCCCTCA TCGGCAACAC GCCTCTCGTC CGGCTCAAGG GGCCGAGCGA GGAGACCGGT TGCGAGATCT ACGGCAAGTG CGAATTCACC AACCCCGGCG CTTCGGTGAA GGATCGCGCG GCCCTGTGGA TCGTCCGCGA CGCCGAAGAG CGCGGCATTC TCAAGCCCGG CGGCACCATT GTCGAGGGCA CGGCGGGTAA CACCGGGATC GGCCTGGCAC TCGTCGCCAA TGCGCGCGGC TACAAGTCGG TCATCGTCAT GCCCGAGACG CAATCGCGCG AGAAGATGGA CACCTTGCGG GCACTGCGTT CGGAGCTGGT GCTGGTTCCG GCCGCCCCCT TCTCGAACCC CGGCCACTTC GTGCACACTT CGCGCCGCAT TGCCGAGGAG ACCGAAGGCG CGGTCTGGGC GAACCAGTTC GACAACATCG CCAACCGCCG AGCGCACATC GAAAGCACCG CGCCCGAAAT CTGGGAGCAG ATGGAGCATC GCATCGATGG CTTCACCTGC GCTGCGGGTA CGGGTGGCAC CATCGCGGGC GTGGGCATGG GCCTCAAGGC CTTCGACGAG AACATCACCA TTGCCCTCAC CGATCCGCAT GGCGCCGCGC TGTACAATTA CTATGCCCAC GGCGAACTGA AGGCGGAAGG CTCTTCGGTT GCCGAGGGGA TCGGTCAGGG GCGCATCACG GCGAACCTCG ACGGTGCGCC CATCGACACC CAGTTCCGCA TTTCGGACGA GGAAGGTCTG CACTGGGTCG AACGCCTGCT GGCCGAGGAA GGCCTCTGTC TTGGCCTGTC GAGCGGCATC AACGTGGCGG GCGCGGTTGC GCTGGCAAGG CAACTGGGCA AGGGCAGCCG CGTGGCGACG ATCCTGTGCG ACACGGGCTT CCGCTATCTC TCCTCGCTCT ACAATCCGGA ATGGCTCAAG ACCAAGGGCC TGCGAGTGTT CCCTTGGCTG GAGCAATGA
|
Protein sequence | MMAPAIAPDT ISLIGNTPLV RLKGPSEETG CEIYGKCEFT NPGASVKDRA ALWIVRDAEE RGILKPGGTI VEGTAGNTGI GLALVANARG YKSVIVMPET QSREKMDTLR ALRSELVLVP AAPFSNPGHF VHTSRRIAEE TEGAVWANQF DNIANRRAHI ESTAPEIWEQ MEHRIDGFTC AAGTGGTIAG VGMGLKAFDE NITIALTDPH GAALYNYYAH GELKAEGSSV AEGIGQGRIT ANLDGAPIDT QFRISDEEGL HWVERLLAEE GLCLGLSSGI NVAGAVALAR QLGKGSRVAT ILCDTGFRYL SSLYNPEWLK TKGLRVFPWL EQ
|
| |