Gene Hhal_1797 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1797 
Symbol 
ID4711000 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1968924 
End bp1970117 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content69% 
IMG OID639856267 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_001003363 
Protein GI121998576 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.697099 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGCACAA CCAGGCCTGA TGACGATCCG CTCGCCGGGT TCGAATCCCG GGCGGTGCGA 
GCCGGTCAGG TGCGCACCGA TGCCCAGGAG CAGTCGGAGC CCATCTACCC GACCTCCAGC
TTTACCTTCG AGAGTGCCGC ACAGGCGGCC GCGCGTTTCT CCGGTGAGGA CCCCGGGAAC
GTCTACTCGC GTTTCACCAA CCCGACGGTG CGGACCTTCT GCGACCGCCT GGCCGCCCTG
GAGGGCGGGC AGGCCTGCGT CGGGACGGCT TCAGGCATGT CGGCGGTGCT GGCGACGTGC
CTGGGGCTGC TGCAGGCCGG CGACCACGTG GTCGCCTCGC GGACCCTCTT CGGGACCACC
CTGTCGCTGC TGACCAAGTA CCTGCCGCGC TGGGGCATCG AGGTCAGCTG GGTGCCGTTG
AGCGACGAGC GGGCCTGGGC TGATGCGGTG CAGCCGAACA CGCGCCTGCT CTTTGCTGAG
ACGCCCTCCA ACCCCCTCAA CGAGGTGGTG GACATCCGCC GCCTCGCGGA GGTGGCCCAT
GCCCACGAAG CCCTGTTGGC GATCGACAAC TGCTTCTGTA CCCCCGCCCT GCAGCGCCCG
CTGGAGATGG GGGCCGACCT GGTGATCCAC TCGGCCACCA AGTACCTGGA CGGTCAAGGT
CGCTGTGTCG GTGGCGCGGT GGTTGGCGAC GCCCAGCGCG TGGGGGAAGA GATCCACGGT
TTCATCCGCA CCGCCGGGCC GTGCATGAGC CCGTTCAACG CCTGGGTGTT CCTCAAGGGA
TTGGAGACGC TGTCCCTGCG CATGCATGCG CACAGCCGGA ATGCCCAGCA GGTGGCGGAG
TGGCTGCAGG GCCATCCCGG CGTCGAGCGG GTCCACTACG CCGGGCTGCC GGACCATCCC
CACCACCGCC TGGCCGCGGC GCAGCAGAGC GGGTTCGGCG GGATTGTGGC CTTCGAGCTC
CCCGGAGGCC GGGAGGCAGC CTGGCGTCTG ATCGACAGCA CGCGCATGCT GTCGATCACC
GGCAACCTGG GGGACACCAA GTCCACCATC ACCCATCCGG CGACCACCAC CCACGGCACC
ATCTCCGATG AGTTGCGCGC GGCCGCCGGC ATCCGCGAGG GGCTGGTGCG GGTTTCCGTT
GGGTTGGAGG ATCCGGCGGA TATCATCCGC GACCTGGAGC GCGGCCTGGG GTGA
 
Protein sequence
MCTTRPDDDP LAGFESRAVR AGQVRTDAQE QSEPIYPTSS FTFESAAQAA ARFSGEDPGN 
VYSRFTNPTV RTFCDRLAAL EGGQACVGTA SGMSAVLATC LGLLQAGDHV VASRTLFGTT
LSLLTKYLPR WGIEVSWVPL SDERAWADAV QPNTRLLFAE TPSNPLNEVV DIRRLAEVAH
AHEALLAIDN CFCTPALQRP LEMGADLVIH SATKYLDGQG RCVGGAVVGD AQRVGEEIHG
FIRTAGPCMS PFNAWVFLKG LETLSLRMHA HSRNAQQVAE WLQGHPGVER VHYAGLPDHP
HHRLAAAQQS GFGGIVAFEL PGGREAAWRL IDSTRMLSIT GNLGDTKSTI THPATTTHGT
ISDELRAAAG IREGLVRVSV GLEDPADIIR DLERGLG