Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2103 |
Symbol | |
ID | 3917751 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | + |
Start bp | 2239574 |
End bp | 2240491 |
Gene Length | 918 bp |
Protein Length | 305 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 640444856 |
Product | cysteine synthase |
Protein accession | YP_497376 |
Protein GI | 87200119 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0031] Cysteine synthase |
TIGRFAM ID | [TIGR01136] cysteine synthases [TIGR01139] cysteine synthase A |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.0401259 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGAGCGG AATCCATCCT CGCCACGATC GGCAACACCC CGCACATTCG CCTGTCCCGC CTGTTTCCCG ACCACGAGGT ATGGGTGAAG TGCGAGCGCG CCAATCCGGG CGGCTCCATC AAGGACAGGA TCGGCCTCGC CATGATCGAG GCCGCAGAGG CCGACGGCAG CCTGAAGCCG GGCGGCACCA TCGTCGAGCC CACTTCCGGC AATACCGGCA TCGGTCTTGC CATGGCGGCG GCGGTAAAGG GCTACAAGCT CATCCTCGTC ATGCCTGAAT CGATGTCGCT TGAACGCCGC CGCCTGATGC TGGCCTATGG CGCGACCTTC GACCTGACCC CGCGCGAAAA GGGCATGAAG GGCGCGATCG AGCGCGCGAA GGAAATCGTC GATCAGACCG ACGGCGCATG GATGCCGCAG CAGTTCGACA ATCCGGCCAA CGTATCGGTT CACGTCCGCA CGACCGCCCA GGAAATCCTC AAGGACTTCG GCAGCGAACC GATCGACGTG CTCATCACCG GCGTCGGCAC CGGCGGTCAC CTGACCGGTT GCGCCGAGGA ACTGAAGAAG CACTGGCCCT CGCTCAAGGC CTATGCCGTG GAACCCACCC TTTCCCCGGT CATCTCGGGC GGCCAACCCG GCCCGCACCC GATCCAGGGC ATCGGCGCGG GCTTCATCCC CGGCAACCTC CACACCCAGT CGATCGACGG CGCAATCCAG GTCGACCCGG CGGACGCCAA GGAAATGGCG CGCCTTTGCG CATCGAAGGA AGGCATGCTG GTCGGCATCT CGTCCGGCGC AACGCTGGCG GCCATCGCGC AGAAGCTGCC GAGCCTGCCC GCCGGCAGCC GCGTGCTGGG CTTCAACTAC GACACCGGCG AGCGCTATCT CTCGGTGCCG GACTTCCTGC CGGAGTAA
|
Protein sequence | MRAESILATI GNTPHIRLSR LFPDHEVWVK CERANPGGSI KDRIGLAMIE AAEADGSLKP GGTIVEPTSG NTGIGLAMAA AVKGYKLILV MPESMSLERR RLMLAYGATF DLTPREKGMK GAIERAKEIV DQTDGAWMPQ QFDNPANVSV HVRTTAQEIL KDFGSEPIDV LITGVGTGGH LTGCAEELKK HWPSLKAYAV EPTLSPVISG GQPGPHPIQG IGAGFIPGNL HTQSIDGAIQ VDPADAKEMA RLCASKEGML VGISSGATLA AIAQKLPSLP AGSRVLGFNY DTGERYLSVP DFLPE
|
| |