Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0741 |
Symbol | |
ID | 4711337 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 829445 |
End bp | 830503 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639855205 |
Product | hypothetical protein |
Protein accession | YP_001002324 |
Protein GI | 121997537 |
COG category | [S] Function unknown |
COG ID | [COG4913] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCGACC TGCGTGCCGA GCACGCCGGC CTGGAAGAGG AGCTGGAGTC GCTGCGTCAG CGCCGCTCCA ACATCCCGGC GCGCATGCTG GAGCTGCGCC AGCGCCTGTG CCAGGAGCTG AGCCTCGATG AGGAGGCCGT CCCCTTCGCC GGTGAGCTCC TCCAGGTCCA TGAGCAGGAG ACGGACTGGG AGGGGGCCAT CGAGCGGCTG CTCCACAACT TCGGCCTCTC GCTCCTGGTC CCGGAGGCGC ACTACCGCGC CGTGGCCGGC TGGGTGGATC GCAGCCACCT GCGTGGCCGG CTGGTCTACT ACCGGGTGCG CGAGCCGCGC AGCAGTGAGC CGCCCGCGCT GCACCCCGAG TCCCTGGTGC GTAAGATCGC CATCCGTCCC GACTCGGCGT TCTACGCCTG GCTGGAGCAG GAGCTGGGTC GGCGCTTCGA CTACGCCTGC ACCCGGGATC TGGAGACCTT CCGGCGCGAG GATCGCGCCA TCACCCCGGC CGGGCAGATC AAGGCGGGCG GGGATCGCCA CGAGAAGGAC GACCGCCACC GCATCGACGA CCGCTCCCGC TTCGTCCTCG GCTGGTCCAA CGAGGCGAAG ATCGCCGCCC TCCAGGAGGA CGACCTGCCC CGCTTCGAGG CCCGGTTCAA GGAGCTGCTC AACGAGAACA CCATCCGCGA GATCGCCAAC TTCAACGCCC AGCTCAACAA AGAGCGCGCG CAGATCCGCG AGCGCATCGC AACCATCAAC GCCTCGCTGT TCGATATCGA CTACAACTCG GGGCGCTACA TCGAGCTGGT CGCCGACACC ACCACCGACC CCGAGGTCCG CGACTTCCGC GAGCAGCTCC GGGCCTGCAC CGAGGATACG GTGACCGGCT CCGAGGACGC CCAGTACAAC GAGCGCAAGT TCCTCCAGGT CAGGGCGATC ATCGAGCGCT TCGTCGCCAG TGTCGGCTTC GTCCACGGCG AAGGCGGGTG CTTCTCGCTG CTGCGCCATC TGAGCATCGA GGCGTACCAC GCCGAAAAGG CCATCCGGCG CGCGGCAACC AGCGGATGA
|
Protein sequence | MRDLRAEHAG LEEELESLRQ RRSNIPARML ELRQRLCQEL SLDEEAVPFA GELLQVHEQE TDWEGAIERL LHNFGLSLLV PEAHYRAVAG WVDRSHLRGR LVYYRVREPR SSEPPALHPE SLVRKIAIRP DSAFYAWLEQ ELGRRFDYAC TRDLETFRRE DRAITPAGQI KAGGDRHEKD DRHRIDDRSR FVLGWSNEAK IAALQEDDLP RFEARFKELL NENTIREIAN FNAQLNKERA QIRERIATIN ASLFDIDYNS GRYIELVADT TTDPEVRDFR EQLRACTEDT VTGSEDAQYN ERKFLQVRAI IERFVASVGF VHGEGGCFSL LRHLSIEAYH AEKAIRRAAT SG
|
| |