Gene Saro_1121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1121 
Symbol 
ID3916417 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1165884 
End bp1166882 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content65% 
IMG OID640443856 
Productcysteine synthase A 
Protein accessionYP_496400 
Protein GI87199143 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGGCTC CTGCGATTGC TCCCGATACC ATTTCCCTCA TCGGCAACAC GCCTCTCGTC 
CGGCTCAAGG GGCCGAGCGA GGAGACCGGT TGCGAGATCT ACGGCAAGTG CGAATTCACC
AACCCCGGCG CTTCGGTGAA GGATCGCGCG GCCCTGTGGA TCGTCCGCGA CGCCGAAGAG
CGCGGCATTC TCAAGCCCGG CGGCACCATT GTCGAGGGCA CGGCGGGTAA CACCGGGATC
GGCCTGGCAC TCGTCGCCAA TGCGCGCGGC TACAAGTCGG TCATCGTCAT GCCCGAGACG
CAATCGCGCG AGAAGATGGA CACCTTGCGG GCACTGCGTT CGGAGCTGGT GCTGGTTCCG
GCCGCCCCCT TCTCGAACCC CGGCCACTTC GTGCACACTT CGCGCCGCAT TGCCGAGGAG
ACCGAAGGCG CGGTCTGGGC GAACCAGTTC GACAACATCG CCAACCGCCG AGCGCACATC
GAAAGCACCG CGCCCGAAAT CTGGGAGCAG ATGGAGCATC GCATCGATGG CTTCACCTGC
GCTGCGGGTA CGGGTGGCAC CATCGCGGGC GTGGGCATGG GCCTCAAGGC CTTCGACGAG
AACATCACCA TTGCCCTCAC CGATCCGCAT GGCGCCGCGC TGTACAATTA CTATGCCCAC
GGCGAACTGA AGGCGGAAGG CTCTTCGGTT GCCGAGGGGA TCGGTCAGGG GCGCATCACG
GCGAACCTCG ACGGTGCGCC CATCGACACC CAGTTCCGCA TTTCGGACGA GGAAGGTCTG
CACTGGGTCG AACGCCTGCT GGCCGAGGAA GGCCTCTGTC TTGGCCTGTC GAGCGGCATC
AACGTGGCGG GCGCGGTTGC GCTGGCAAGG CAACTGGGCA AGGGCAGCCG CGTGGCGACG
ATCCTGTGCG ACACGGGCTT CCGCTATCTC TCCTCGCTCT ACAATCCGGA ATGGCTCAAG
ACCAAGGGCC TGCGAGTGTT CCCTTGGCTG GAGCAATGA
 
Protein sequence
MMAPAIAPDT ISLIGNTPLV RLKGPSEETG CEIYGKCEFT NPGASVKDRA ALWIVRDAEE 
RGILKPGGTI VEGTAGNTGI GLALVANARG YKSVIVMPET QSREKMDTLR ALRSELVLVP
AAPFSNPGHF VHTSRRIAEE TEGAVWANQF DNIANRRAHI ESTAPEIWEQ MEHRIDGFTC
AAGTGGTIAG VGMGLKAFDE NITIALTDPH GAALYNYYAH GELKAEGSSV AEGIGQGRIT
ANLDGAPIDT QFRISDEEGL HWVERLLAEE GLCLGLSSGI NVAGAVALAR QLGKGSRVAT
ILCDTGFRYL SSLYNPEWLK TKGLRVFPWL EQ