Gene Huta_2621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHuta_2621 
Symbol 
ID8384926 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhabdus utahensis DSM 12940 
KingdomArchaea 
Replicon accessionNC_013158 
Strand
Start bp2683325 
End bp2684623 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content66% 
IMG OID644973696 
ProductO-acetylhomoserine/O-acetylserine sulfhydrylase 
Protein accessionYP_003131516 
Protein GI257053683 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2873] O-acetylhomoserine sulfhydrylase 
TIGRFAM ID[TIGR01326] OAH/OAS sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGGTC AGGAATCCCC ATCACGACAG TTCGGGACGC GGTGTCTGCA CGCGGGCCAG 
TCGCCGGATC CCGAGACCGG CGCAGTCGCA CCGCCGATCT ACCAGACGAC CTCCTACGCC
TTCGAGGACG CCGATCGGGC GGCAGATCTC TACGCCCTGG AGGCTGAGGG CAACGTTTAC
AGTCGCTTCG ACAACCCGAC GGTCCGATCG CTGGAACGTC GACTGGCGTC GCTCGAGAGC
GGCGTCGACG CCGTCGCGAC TGCGTCAGGA ATGGCCGCTC TGGACGCGAC GACGACAGCC
CTCGCCAGTG CGGGCGAGAA TATCGTCTCA GCCGCCTCGA TCTACGGCGG AACCCATTCC
TATCTGACGA CGACCGCCCG CGAGCGAGGG ATCGAAGCGC GGTTCGTCGA CACCCTCGAT
TATGGGGCCT ACGCGGCGAC CATCGACGAC GAGACGGCCT ACGTCCACCT CGAAACCATC
GGCAACCCCT CGCTCGTGAC GCCGGACATC GAGCGGATCG CCGACATCGC CCACGATCAC
GGTGCGCCGC TCGTCGTGGA CAACACCTTT GCGACGCCCT ATCTCTGCAA TCCGATCGAA
CACGGCGCGG ACGTCGTCTG GGAGTCGACG ACGAAGTGGA TTCACGGCTC CGGGACCACT
GTTGGCGGCG TCGTGATCGA CGGCGGAAGC TTCCCCTGGG CGGACCATCC CGAAAAGTAT
CCCGCGCTGG GCGAGAGTGC CGGCGCGCTG GACGACGAGA GCTTCGTCGA TCGCTTCGGC
GAGCGGGCCT TCGCCGTGGC AACGCGCCAG CGGGCCGTCC GAAACGTCGG CGACGGCCAG
AAACCCTTCG ATGCCTGGGC GACGCTCCAA GGGGCCGAAA CCCTGGCTGT CCGGATGGAA
CGTCACTGCG AGAACGCCAC ACGTGTCGCC GAGTTCCTCG CCGACCATCC TGCCGTGGAG
TGGGTGTCCT ATCCGGGTCT GGAATCCCAC GAGACCAACG AGCAAGCCAG CGAGTATCTG
CAGGGTGGAT TCGGCGGCAT GGTCACGTTC GGCCTCTCCG GGGGGTACGA GGGGGCCAAA
CGGCTCTGTG AGGAGACCGA CCTCGCGCAG TTCCTCGCGA ACGTCGGCGA CGCCAAGACG
CTGATCATCC ACCCGGCCTC GACGACCCAC GCCAAACTCT CGCCCGAGGA ACAGCGGGCG
AGCGGCGTCG CGCCCGACAT GATCCGGTTG TCGGTCGGGA TCGAAGACGT CAGAGACATC
ATTGGGGACC TCAAAGAGGG AATCGAGACA GCTACATGA
 
Protein sequence
MSGQESPSRQ FGTRCLHAGQ SPDPETGAVA PPIYQTTSYA FEDADRAADL YALEAEGNVY 
SRFDNPTVRS LERRLASLES GVDAVATASG MAALDATTTA LASAGENIVS AASIYGGTHS
YLTTTARERG IEARFVDTLD YGAYAATIDD ETAYVHLETI GNPSLVTPDI ERIADIAHDH
GAPLVVDNTF ATPYLCNPIE HGADVVWEST TKWIHGSGTT VGGVVIDGGS FPWADHPEKY
PALGESAGAL DDESFVDRFG ERAFAVATRQ RAVRNVGDGQ KPFDAWATLQ GAETLAVRME
RHCENATRVA EFLADHPAVE WVSYPGLESH ETNEQASEYL QGGFGGMVTF GLSGGYEGAK
RLCEETDLAQ FLANVGDAKT LIIHPASTTH AKLSPEEQRA SGVAPDMIRL SVGIEDVRDI
IGDLKEGIET AT