Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2065 |
Symbol | |
ID | 4709997 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2269167 |
End bp | 2270441 |
Gene Length | 1275 bp |
Protein Length | 424 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 639856538 |
Product | hypothetical protein |
Protein accession | YP_001003631 |
Protein GI | 121998844 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTGGGAC TGTCGGTATC GATGGAAGGA TGGGAACGAG AGCCACGCCG AGTGGGCTGG CAGGCCGGTC TGGTCGGGGT CTTGCTGGCG TTGCCGGTGG CGACGGCCGC TTTGGAGTAT CGGAGTGGCG GAGTGCTGGA GGACGGCGCC ACCGAGCACC GCTTCGGGGT CGAGCGGTTC AGCTGGGCAG AGCCGCCTGG CAACGACGCG GAGTCGCGGG ACGACGTTCT GCGTCTGTCG CCGTCAGCCC GCTTCGGGCT TGCCGCGGGT TACGATCTTC GCGTGGGGCT CCCTGTCCAG CAGGAGGATG ATCGGCGCGA TCTGCACGGT GTGGAGCTGG AACTCGGCCT GCCGCTGCGT GAGGGTGACG CAGGCCCCGA CGTGACCCTG GCGGTCCACG GTCGGTTGCT CCCGGCGGAT CCACCCCTCG GCAGTGGCAG CGATGGGCTG GGCGTGGCCG TTCACCTGAG TGACCGACTG GGTGAACGGG GCATCCGGCT GGACGGCTAC CTCGGCCTGG AGCGGGGCGA CGCGGCCCTG CGCGATGGCC CCGGCTACGA GGCCGTCAAC CGCCTGCATT ACGCCAATCG CATCGAATAC CCCTTAGGCG CAGGCTGGGG CGTCGGTGCC GATGTGCGCA CCGTGATCGG CCTCAGTGGC GAAGAGGTGC AGAACCAATT CGCCTTCGTG ATCCGTCCCG GTCTCAGTTA TCGACCGACC GCGAACACCA CCCTGCGCGC CGCTGCAGGG CGCGAGCTGG CCGACCGTGG CGTCGAGCCG GAGTCCACAG TACAGCTCTC ATTGACCCAT CGGCCGCAGG CCCCGGCGCC GCGCCGTGAG CTGCAGGCGC GCCTGGCCGA GCTAGAGGAT CGCCACGAGC GGATGACCCA GGAGCAGACG GGGATCGCCC AGCGTCAGGC CCGGCAGGCG GGACGGCTCT CCGAGCACGG CGAGGTGATC GACCTGGTCA AGCGCCGCGC CGGAACCCTG GAGGTCGAGG TAGTGAACCG CTCCGGTGAA CGCCAGCACG CCAGTGAGGC AGTGGCCCGC CTGGAGCGCC TTGGGCACCA CGTGGTCCGG CGCATGGAGC GTCCGGAGGC ATCGATGCGC GACGCCAGCG TCGTCCAGTA CCGCGAGGCC TACGAAGAGG CCGCGGTGGA ACTCGGTGAG GCACTGCCGG GTGTCCAGGA GGTGTACCGG GCCGATCCGC CGATCGGGCC CGGGGCCGAC GTGCGCTTGA TTGTTGGCGC CGACTTCGGC AGCGATGGGG AGTAA
|
Protein sequence | MLGLSVSMEG WEREPRRVGW QAGLVGVLLA LPVATAALEY RSGGVLEDGA TEHRFGVERF SWAEPPGNDA ESRDDVLRLS PSARFGLAAG YDLRVGLPVQ QEDDRRDLHG VELELGLPLR EGDAGPDVTL AVHGRLLPAD PPLGSGSDGL GVAVHLSDRL GERGIRLDGY LGLERGDAAL RDGPGYEAVN RLHYANRIEY PLGAGWGVGA DVRTVIGLSG EEVQNQFAFV IRPGLSYRPT ANTTLRAAAG RELADRGVEP ESTVQLSLTH RPQAPAPRRE LQARLAELED RHERMTQEQT GIAQRQARQA GRLSEHGEVI DLVKRRAGTL EVEVVNRSGE RQHASEAVAR LERLGHHVVR RMERPEASMR DASVVQYREA YEEAAVELGE ALPGVQEVYR ADPPIGPGAD VRLIVGADFG SDGE
|
| |