Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_2355 |
Symbol | |
ID | 4709078 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2581473 |
End bp | 2582933 |
Gene Length | 1461 bp |
Protein Length | 486 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 639856830 |
Product | hypothetical protein |
Protein accession | YP_001003920 |
Protein GI | 121999133 |
COG category | [S] Function unknown |
COG ID | [COG1690] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.314227 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTCAAGC GTATCGAGGC CGCTGAAGGT CTGGAGTTGG AACGGCTCGA CAGTTGCCGC TGGCGGTTGC CGCGGCAGGG GCGGATGCAG GTAGACGGGT TGATCTTCGC CAACGACGCG TTGATTGAGG ACATCCGCGA TACCGAGGCC GTGCGCCAGG TGGCCAATGT GGCCTGCCTC CCCGGGGTGG TCGGGCGGTC CATCGGCATG CCGGATATCC ATTGGGGGTT CGGCTTCCCC ATCGGAGGCG TGGCCGCCTT CGATCCGGAC CAGGGGGGCG TGATCTCGCC CGGAGGGGTG GGCTACGACA TCAACTGCGG AGTCCGGCTG CTGCGGACGC CGCTACAGGC CGAGGACCTG GGCGCCCACC TGCCGCGCCT GATGGATCGA CTCTTCGAAC GCATCCCGGC CGGCATGGGC CGTGGGTACG GCGACACCCT GCTGCGCAAC CGGGATATGC GCCGGTTGCT GCGCGAGGGG GCGGCGTGGG CCGTGGAGGT GGGGCTGGGC GAGCCCGAGG ATCTGGCCCG GATCGAGGAC CGTGGGTGCC TGCCCGGCGC CGACCCCGAG GCGGTCAGCG ATCGGGCCAT CCAGCGCGGA CGGGATCAGG TCGGTACGGT GGGATCCGGC AACCACTTCA TCGAGATCGG CTGTGTGGAC GATGTCTACG ACGAAGCCGC TGCCCGCCGC CTGGGGCTCG AGGCGGGGAC GCTGACCGTG ATGATCCACT CCGGGTCACG CGGGCTCGGT CACCAGGTCT GCGATGACTT TCTGGTGACC ATGGAGCGGA TCACCGGGCG CAACGGCATC GAGCTGCCCG ACCGTCAGCT GGCCTGCGCG CCGCTGAGCT GCTCCGCCGC CCGGGACTAC CTGGGGGCCA TGCAGGCCGC CGCCAACTTC GCCTACGTCA ACCGCCAGGC GATGACCCAG CAGGTGCGCC GGGTCTTCGC CGAGGTGCTG GGGGAGGAGG CGCACCTGGA GCTGGTCTAC GACGTCTCCC ACAACATCGC CAAGTTCGAG CGCCATCGGG TCGACGGTGA GGAGCGCGAG GTCTGCGTCC ACCGCAAGGG CGCCACCCGC GCCTTCCCGC CCGGCCACCC GGAACTCCCC GAGGATCTGC GCGGGCTCGG GCAGCCGGTG CTGCTGCCCG GCGACATGAC CCGCTACTCC TACGTCCTGC TCGGCACCCA GGGCGCCTAC GCCGAGACCT TCGGCTCCTG CGCCCACGGC GCCGGACGCC GTCTCAGCCG GCGCCAGGCC AAACGCGCCG CTGAGGGGCG GGACTTGGAT GCCGAGCTGG CCGAGGCTGG TATCGAGGTG CGCGCCTCGT CCCGGCAGAC GGTGGCCGAG GAGCTGGCCG AGGCGTACAA GGACGTGTCC GATGTGGTGG ACGTGGTGGC CCACGCCGGC ATTGGCCGCC GGGTGGCCCG CCTGCGTCCG CTGGGGGTGC TCAAGGGGTG A
|
Protein sequence | MVKRIEAAEG LELERLDSCR WRLPRQGRMQ VDGLIFANDA LIEDIRDTEA VRQVANVACL PGVVGRSIGM PDIHWGFGFP IGGVAAFDPD QGGVISPGGV GYDINCGVRL LRTPLQAEDL GAHLPRLMDR LFERIPAGMG RGYGDTLLRN RDMRRLLREG AAWAVEVGLG EPEDLARIED RGCLPGADPE AVSDRAIQRG RDQVGTVGSG NHFIEIGCVD DVYDEAAARR LGLEAGTLTV MIHSGSRGLG HQVCDDFLVT MERITGRNGI ELPDRQLACA PLSCSAARDY LGAMQAAANF AYVNRQAMTQ QVRRVFAEVL GEEAHLELVY DVSHNIAKFE RHRVDGEERE VCVHRKGATR AFPPGHPELP EDLRGLGQPV LLPGDMTRYS YVLLGTQGAY AETFGSCAHG AGRRLSRRQA KRAAEGRDLD AELAEAGIEV RASSRQTVAE ELAEAYKDVS DVVDVVAHAG IGRRVARLRP LGVLKG
|
| |