Gene Hhal_2065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_2065 
Symbol 
ID4709997 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp2269167 
End bp2270441 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content70% 
IMG OID639856538 
Producthypothetical protein 
Protein accessionYP_001003631 
Protein GI121998844 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGGGAC TGTCGGTATC GATGGAAGGA TGGGAACGAG AGCCACGCCG AGTGGGCTGG 
CAGGCCGGTC TGGTCGGGGT CTTGCTGGCG TTGCCGGTGG CGACGGCCGC TTTGGAGTAT
CGGAGTGGCG GAGTGCTGGA GGACGGCGCC ACCGAGCACC GCTTCGGGGT CGAGCGGTTC
AGCTGGGCAG AGCCGCCTGG CAACGACGCG GAGTCGCGGG ACGACGTTCT GCGTCTGTCG
CCGTCAGCCC GCTTCGGGCT TGCCGCGGGT TACGATCTTC GCGTGGGGCT CCCTGTCCAG
CAGGAGGATG ATCGGCGCGA TCTGCACGGT GTGGAGCTGG AACTCGGCCT GCCGCTGCGT
GAGGGTGACG CAGGCCCCGA CGTGACCCTG GCGGTCCACG GTCGGTTGCT CCCGGCGGAT
CCACCCCTCG GCAGTGGCAG CGATGGGCTG GGCGTGGCCG TTCACCTGAG TGACCGACTG
GGTGAACGGG GCATCCGGCT GGACGGCTAC CTCGGCCTGG AGCGGGGCGA CGCGGCCCTG
CGCGATGGCC CCGGCTACGA GGCCGTCAAC CGCCTGCATT ACGCCAATCG CATCGAATAC
CCCTTAGGCG CAGGCTGGGG CGTCGGTGCC GATGTGCGCA CCGTGATCGG CCTCAGTGGC
GAAGAGGTGC AGAACCAATT CGCCTTCGTG ATCCGTCCCG GTCTCAGTTA TCGACCGACC
GCGAACACCA CCCTGCGCGC CGCTGCAGGG CGCGAGCTGG CCGACCGTGG CGTCGAGCCG
GAGTCCACAG TACAGCTCTC ATTGACCCAT CGGCCGCAGG CCCCGGCGCC GCGCCGTGAG
CTGCAGGCGC GCCTGGCCGA GCTAGAGGAT CGCCACGAGC GGATGACCCA GGAGCAGACG
GGGATCGCCC AGCGTCAGGC CCGGCAGGCG GGACGGCTCT CCGAGCACGG CGAGGTGATC
GACCTGGTCA AGCGCCGCGC CGGAACCCTG GAGGTCGAGG TAGTGAACCG CTCCGGTGAA
CGCCAGCACG CCAGTGAGGC AGTGGCCCGC CTGGAGCGCC TTGGGCACCA CGTGGTCCGG
CGCATGGAGC GTCCGGAGGC ATCGATGCGC GACGCCAGCG TCGTCCAGTA CCGCGAGGCC
TACGAAGAGG CCGCGGTGGA ACTCGGTGAG GCACTGCCGG GTGTCCAGGA GGTGTACCGG
GCCGATCCGC CGATCGGGCC CGGGGCCGAC GTGCGCTTGA TTGTTGGCGC CGACTTCGGC
AGCGATGGGG AGTAA
 
Protein sequence
MLGLSVSMEG WEREPRRVGW QAGLVGVLLA LPVATAALEY RSGGVLEDGA TEHRFGVERF 
SWAEPPGNDA ESRDDVLRLS PSARFGLAAG YDLRVGLPVQ QEDDRRDLHG VELELGLPLR
EGDAGPDVTL AVHGRLLPAD PPLGSGSDGL GVAVHLSDRL GERGIRLDGY LGLERGDAAL
RDGPGYEAVN RLHYANRIEY PLGAGWGVGA DVRTVIGLSG EEVQNQFAFV IRPGLSYRPT
ANTTLRAAAG RELADRGVEP ESTVQLSLTH RPQAPAPRRE LQARLAELED RHERMTQEQT
GIAQRQARQA GRLSEHGEVI DLVKRRAGTL EVEVVNRSGE RQHASEAVAR LERLGHHVVR
RMERPEASMR DASVVQYREA YEEAAVELGE ALPGVQEVYR ADPPIGPGAD VRLIVGADFG
SDGE