Gene Clim_0668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0668 
Symbol 
ID6354282 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp740570 
End bp741859 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content57% 
IMG OID642668295 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_001942730 
Protein GI189346201 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTCAA ACCACTACCG TTTCGAAACG CTTGCCCTGC ATGCAGGGCA GCCTGTCGAT 
CAGACACAGT CCCGCGGCAT CCCGGTGTAC CGCACCAGTT CCTACATCTT CAAAAACACG
AAACATGCGG CGAACCTGTT TGCCTTGAAG GAACTGGGCA ACATCTATAC CCGACTGATG
AACCCGACCA CAGATATCCT GGAACAGCGC ATAACGGAAC TTGAAGGGGG AGCCGCATCC
GTTGCGCTCG CATCGGGCAC GGCGGCAATC TTCAATGCCG TCATCACGCT GGCGGAAGCC
GGGGATGGCA TCATTGCCGC CAACAATCTC TACGGCGGCA CCTATACCCA GTTCGACGCC
ATTCTGCCTA AACTCGGCAT CGATGTCACC TTCGTCGATC CGCATAAACC GGAAAACTTC
GAGCGGGCGA TAACGGAAAA AACAAGGGCG ATCTTCATCG AAACGATCGG CAACCCCGCT
CTGGATTACA CCGGGGTAAA AGCCGTCGCC GATGTCGCCC ACCGTAACGG GCTCCCCCTG
ATCGTCGACG CAACGTTCAC GACCCCATAC CTTTTGAGAA CAATAGAACT TGGAGCCGAC
ATCGTGGTCA ACTCCCTGAC AAAATGGATA GGAGGACACG GCGCAGCGGT GGGAGGAAGT
ATTACCGATG CCGGACGTTT TGACTGGAAA AAAGGTCGCC ATCCTCTCTT TACCGAACCG
GACGACAACT ACCACGGACT CCGCTGGGCG CTCGACCTGC CCGAGCCGCT CGCTGCGATA
GCCTTCGCCC TCAGGGTACG CACCGTACCG TTAAGAAACC TTGGATCGTG CATTTCGCCC
GACAATTCAT GGATATTCCT CCAGGGTCTC GAAACCTTGC CGGTGCGCAT GGCGCGGCAT
TGCGAAAACG CACTTTACGT GGCAGAATAT CTCGAACACC ATCCCAACGT GGCATGGATT
CGCTATCCAG GCCTGAAAAA CGACACGTCC CATGCCGCAG CTTCGAAAGA CCTGAAAAAA
GGGTTCGGAG GCATGGTGGT GTTCGGCGTA AAAGGAGGAT ACGATGCCGC CGTCCGGCTT
ATCGATTCCA TCGGCCTCTT CTCGCACCTT GCCAACGTCG GTGACGCAAA AAGCCTCATC
CTGCATCCGG CAAGCACCTC CCACAGCCAG TTGTCCGAAG AACAGCAGCG GCAGGGCGGA
CTCTCTCCGG AACTGATACG CCTCTCCATA GGGCTCGAAC ATCCCGACGA CCTGATAGAG
GCACTCGATA ACGCGCTTCA ACCCTTGTAA
 
Protein sequence
MSSNHYRFET LALHAGQPVD QTQSRGIPVY RTSSYIFKNT KHAANLFALK ELGNIYTRLM 
NPTTDILEQR ITELEGGAAS VALASGTAAI FNAVITLAEA GDGIIAANNL YGGTYTQFDA
ILPKLGIDVT FVDPHKPENF ERAITEKTRA IFIETIGNPA LDYTGVKAVA DVAHRNGLPL
IVDATFTTPY LLRTIELGAD IVVNSLTKWI GGHGAAVGGS ITDAGRFDWK KGRHPLFTEP
DDNYHGLRWA LDLPEPLAAI AFALRVRTVP LRNLGSCISP DNSWIFLQGL ETLPVRMARH
CENALYVAEY LEHHPNVAWI RYPGLKNDTS HAAASKDLKK GFGGMVVFGV KGGYDAAVRL
IDSIGLFSHL ANVGDAKSLI LHPASTSHSQ LSEEQQRQGG LSPELIRLSI GLEHPDDLIE
ALDNALQPL