Gene Hhal_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0421 
Symbol 
ID4711453 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp488709 
End bp490190 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content65% 
IMG OID639854879 
Producthypothetical protein 
Protein accessionYP_001002012 
Protein GI121997225 
COG category[R] General function prediction only 
COG ID[COG2251] Predicted nuclease (RecB family) 
TIGRFAM ID[TIGR03491] RecB family nuclease, putative, TM0106 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.805168 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGCC TGTCCAAATC CCGCTTCATA GCCGGCTGGC AGTGCCCGCT TCGGCTCTGG 
TACACGGTCC ACGAGCCCTG GCGAGCCACG CCGCCTGATC CCCAGTTGCA AGCGATCTTC
GACCAGGGCC ACCGACTCGG GCAATTGGCT CAGCAGCGCT ATCCATCCGG GGTGCTCGTC
GAAGCGGATT ACCGACACAC GCGCCAGGCC CTCGAGCAAA CGCAGGTGCT GATGGCCGAC
CCGGGCGTGC CGGCGATCTT TGAAGCCGCT CTGGAGCACG ACCAGGTCCT GACCCGAGTG
GACGCGCTCG TTCGAAACGG GGACGGGTGG GACCTGGTCG AGGTCAAGGG CGCCACGCGT
GCCAAGGAGG TTTTCCAGCT CGACGTGGCG ATCCAGTATT GGATCCTCAC CGGTGCCGGC
ATCCCGGTAC GCGATGCCGG GCTGCTGTTG ATCGACCGTG ACTACGTCTA TCCGGGCGGC
GCGCACGACC CGCAGGAGTT CTTCCGGTTC GAACCGCTGA CCAGAACCTG CGAGCACTGG
CTTGAGTGGA TCGAGGCGCA GGCCGTCTCC TTCCAGGAGG TTGCCGCCCG TACAACACCA
CCCCAGATCG AAATCGGGCA GCAGTGCTTC AGCCCTTACC CCTGCCCGTT CTACAACCAC
TGCTCTGCTG ACAGGGAGTG GCCCGAGCAC CCGATCGAGG AGCTACCGTA TCTGGCCGGC
GAGCGTTACC AGGGCCTCCT CGAGCGGGGT GTCACAACCA TCGACGCCAT CCCCGACGAT
TACCCGCTTA CCGTCGCTCA GAAGCGGATC CGAGACGCCG TTGCCAAGGG GCATCCCTGG
ACCGATCCCG ACCTGGGTGA AGCCGTTCAG GGTGTGGAGT GGCCTCTCTT CTTCCTGGAT
TTCGAGACCG CACAGCCGGC CTTACCCCGT TACCCCGGAA CCTCCCCGTT TCAGGCCTTG
CCGTTCCAGT TTTCCTGCCA TATCCAACGA GCCCCCGGGG TCGCCCCGGA GCACACGGAC
TTTCTGGCCA CCGCCGATAC CGACCCCCGC CGCCCACTTG CCGAGGCACT GCTCAACGCA
CTCGGTGACC GCGGATCGAT CGTCGTTTAC TCCAGCTTTG AGCGGCGGGT CCTCGGCGAA
CTCGCCGATG CGCTGCCGGA TCTCGCTGAG CCCTTAGCCG CCCTTCAGGA GCGGCTGTGG
GATCTGCTAC GGGTCCTGCG GAACCACTAC TACCACCCGG CTTTCAAGGG CTCCTACTCC
ATCAAGCGGG TCCTGCCCGT ACTGGTTCAA GACCTCGACT ACACCGCCCT GGAAGTGGCC
GATGGCCAGG CCGCCGCGCA GGCCTGGATC CAGATGCTCG ACACGGATTG CGACGAGGAA
CGCGATCGGC TCGCCCAAGC CCTCCGCGAA TATTGCTTCA CCGACACGCT GGCCATGGTG
CGACTGCGCG AGGCGTTACT GCGTGCCGCC GGAGAGCTGT GA
 
Protein sequence
MPRLSKSRFI AGWQCPLRLW YTVHEPWRAT PPDPQLQAIF DQGHRLGQLA QQRYPSGVLV 
EADYRHTRQA LEQTQVLMAD PGVPAIFEAA LEHDQVLTRV DALVRNGDGW DLVEVKGATR
AKEVFQLDVA IQYWILTGAG IPVRDAGLLL IDRDYVYPGG AHDPQEFFRF EPLTRTCEHW
LEWIEAQAVS FQEVAARTTP PQIEIGQQCF SPYPCPFYNH CSADREWPEH PIEELPYLAG
ERYQGLLERG VTTIDAIPDD YPLTVAQKRI RDAVAKGHPW TDPDLGEAVQ GVEWPLFFLD
FETAQPALPR YPGTSPFQAL PFQFSCHIQR APGVAPEHTD FLATADTDPR RPLAEALLNA
LGDRGSIVVY SSFERRVLGE LADALPDLAE PLAALQERLW DLLRVLRNHY YHPAFKGSYS
IKRVLPVLVQ DLDYTALEVA DGQAAAQAWI QMLDTDCDEE RDRLAQALRE YCFTDTLAMV
RLREALLRAA GEL