Gene Hhal_1970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1970 
Symbol 
ID4710337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2171291 
End bp2172850 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content64% 
IMG OID639856443 
ProductNitrilase/cyanide hydratase and apolipoprotein N-acyltransferase 
Protein accessionYP_001003536 
Protein GI121998749 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGAAG AGTCGGAACC GGAACGGACA GAACAGGACG AGCACCACCT GCGCCTGCGC 
AACATGGTCC TTGAGGACTA CGACGATGTC GCGCACATCA TGGACCAAGT GTATCCCGAT
ATGGACGGGG CGTGGTCGCG GGAACAGTTC GCGGCGCAGA TCAATCGCTT CCCGGAGGGG
CAGATCTGCA TCGAGGACAA CGGCCGCGTG GTTGCCGGCG CCATCACGCT CATCGTGGAC
TACGGTCGCT TCGGCGACAA GCACACCTAC GAGGAGATCA CCGGCGGCGG CTACCTGACC
ACCCACGACC CCAACGGCGA CGTCCTTTAC GGCGTGGATA TCTTCGTCGA CCCGGAGTAC
CGCTCGATGC GCCTGGGGAG GCGACTCTAC GACGCCCGCA AGGAGCTGTG CCGGCGTCTG
AACCTGCGCG GCATCGTCGC CGGCGGGCGC ATCCCCGGCT ACACCCGGCA CGCGGACGAG
ATGTCGCCGG AGGCCTACAT CGAGCTGGTC AAGCGCCGCG AGATCCATGA CCCGATCCTC
TCGTTCCAGC TCGCCAACGA CTTCCACGTG CGGCGGATCA TCACCGATTA CCTGCCCTCG
GACCGGGACT CGTACGCCTA CGCGACCCTG GTGCAGTGGA ACAACATCTT CTACGAACCC
AAGGAACAGC CGCTGATCAC CCGGCGCAAG GCCGTGGTCC GGGTCGGCAC TGTGCAGTGG
CAGATGCGCC CGATGAGCTC GGTGGACGAG CTGATCGATC AGGTGGAGTT CTTCGTCGAC
GCCCTGGCCG GCTACAACGC CGACTTTGCA CTGTTCCCGG AGTTCTTCAA CGGACCACTG
CTTTCGCTGT TCAACCAGGA GAACCCGGCC GAGGCCATCC GCGGCCTGGC GCAGTACACC
GAGGAGATCG TCGACGAGAT GTCGCGCCTG GCGGTCTCCT ACAACATCAA TATCATCGCC
GGCTCGATGC CCGTCTACCA GGAGCAGTCG CTCTACAACG TCTCCTACCT CTGCCGGCGC
GACGGCACGA CGGATGAGCA GCACAAGCTG CACATCACCC CCGACGAGCG CTCCTACTGG
GGCGTGCGGG GCGGTGACGA ACTCAAGGTC TTCGAGACCG ACATCGGCAA GATCGGCATC
CTGGTCTGCT ACGACTCCGA GTTCCCAGAG CTGCCGCGGA TCCTCAATCA GAAGGGGGCG
CGCATCCTCT TCGTGCCCTA CTGGGTCGAC ACCCAGACCG GCTATCTGCG CGTCCGCCGC
TGCGCCCAGG CACGGGCCAT CGAGAACGAG TGCTATGTGG CGATCACCGG CTCCGTGGGC
AACCTGCCCA AGGTAGAGAA CATCGACATC CAGTACTCGC AGTCGGCGGT GTTCTCGCCG
GCGGACTTCG CCTTCCCCCA CGACGCCATC GTCGCGGAGA CCACCCCGAA CACCGAGATG
ACGCTGATCG TCGACCTGGA TCTGGACAAG CTCACCGAGA TCCGCAACGA GGGCTCGGTC
CGCAATTACA AGGACCGTCG CCTGGATCTC TACCGCATCC AGTGGCTGGG CCGGGACTGA
 
Protein sequence
MQEESEPERT EQDEHHLRLR NMVLEDYDDV AHIMDQVYPD MDGAWSREQF AAQINRFPEG 
QICIEDNGRV VAGAITLIVD YGRFGDKHTY EEITGGGYLT THDPNGDVLY GVDIFVDPEY
RSMRLGRRLY DARKELCRRL NLRGIVAGGR IPGYTRHADE MSPEAYIELV KRREIHDPIL
SFQLANDFHV RRIITDYLPS DRDSYAYATL VQWNNIFYEP KEQPLITRRK AVVRVGTVQW
QMRPMSSVDE LIDQVEFFVD ALAGYNADFA LFPEFFNGPL LSLFNQENPA EAIRGLAQYT
EEIVDEMSRL AVSYNINIIA GSMPVYQEQS LYNVSYLCRR DGTTDEQHKL HITPDERSYW
GVRGGDELKV FETDIGKIGI LVCYDSEFPE LPRILNQKGA RILFVPYWVD TQTGYLRVRR
CAQARAIENE CYVAITGSVG NLPKVENIDI QYSQSAVFSP ADFAFPHDAI VAETTPNTEM
TLIVDLDLDK LTEIRNEGSV RNYKDRRLDL YRIQWLGRD