Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1497 |
Symbol | |
ID | 4709150 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 1617924 |
End bp | 1619225 |
Gene Length | 1302 bp |
Protein Length | 433 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639855964 |
Product | hypothetical protein |
Protein accession | YP_001003066 |
Protein GI | 121998279 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0439] Biotin carboxylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACGATC GCAAGCATAT CTACGTCGTC GGACTCGACG ATTTCCACCT TGAACACCTC AAGACCGTGC GCGGCAGCGA GGGGTACCTA TTCCATCCCC TGGCCGAATA CAGCGCCATC GTGCTCCCTG AGCGCTACGA TATCCCGGCC ATCCTCGATC ATGCGCGTCA AACGCTCGAT CTGGCCCCGC GCGTGGACGC GATCATCGGC CACTGGGACT TTCCCACGAC TTCAATCGTC CCCGTCCTAC GCCGGGAGTA CGGCCTTCCC ACGCCGTCCC TGGAGAGCAT CCTGCTTTGC GAGAACAAGT ATTGGAACCG CCTGGCCTGT GAAGCGTCCG TGCCGGAGTG CACCCCTCCG TTCCAGGCCA TCGATCCCTA CGGCGAGGAT CCGCTCGGCG CCCTCAAGCT CGGCTACCCG TTCTGGCTGA AACCCGCGGT GGCCTTCTCG TCCCTGCTCG GTTTCCGCGT CGAGGACGAC GCGCAGTTCC AGGATGCCAT CGCTGCCATC GCCCAGGGCA TCCCCACCTT CGCCGAGCCG TTCCAGGCGT TCACCGACCT TGTTGAAAAC CCGAAACGCC TGCCGCGGAC GGGGAGCGGC GCCACGGCCC TGGCCGAGGG CATCATCCAG GGCCGCCTCT GCACGCTCGA AGGCTACGTC TACAACGGCG AGGTGGTGAC CTACGCCATC CTCGACTCCC TGCGCGGGGC CAACCAGGTC AGCTTCGTCA GCTACCAGTA CCCGTCCAGC CTGCCGATAC CAGTCCAGGA GCGGATGAAG GACTACGCGC GCCGCCTGCT GACGCACATC GGCCTCGATC AGACCGCGTT CAACATGGAG TTCTTCTGGG ACGAGGATGT CGACAAGATC TGGTTGCTTG AGGTCAACCC ACGGATCTCC AAGTCGCACT GCCCGATCTT CGAGATCGCC ACGGGCAGCT CCCACCACGA GGTCGCCATC GACGTAAGCC TGGGCCGTCG ACCCCAATTT CCTCGCGCCG AGGGGCGCTT CCCCATGGCG GCGAAATTCA TGCCGCGGGT GTACGGCGAC GCTCGGGTCC TGCGCATACC CGACCCGGCG CAGATCCACG CTCTGCAGCT CACCCACCCG GAACTCTCCA TCCACATTGC AGTGGCCGAG GGTATGCAGC TCTCCGAACT ACGGGGCCAG GACAGCTACA GCTACGAGAT CGCCGAGCTG TTTATCGGCG GCGAGGACGA ACAGCACCTC CACGACAAGT TCCGGACGAT CATGAGGCAG CTCGACTTCC GCTTCTCCGC GCCGCTGCCA ACCAACTACT GA
|
Protein sequence | MDDRKHIYVV GLDDFHLEHL KTVRGSEGYL FHPLAEYSAI VLPERYDIPA ILDHARQTLD LAPRVDAIIG HWDFPTTSIV PVLRREYGLP TPSLESILLC ENKYWNRLAC EASVPECTPP FQAIDPYGED PLGALKLGYP FWLKPAVAFS SLLGFRVEDD AQFQDAIAAI AQGIPTFAEP FQAFTDLVEN PKRLPRTGSG ATALAEGIIQ GRLCTLEGYV YNGEVVTYAI LDSLRGANQV SFVSYQYPSS LPIPVQERMK DYARRLLTHI GLDQTAFNME FFWDEDVDKI WLLEVNPRIS KSHCPIFEIA TGSSHHEVAI DVSLGRRPQF PRAEGRFPMA AKFMPRVYGD ARVLRIPDPA QIHALQLTHP ELSIHIAVAE GMQLSELRGQ DSYSYEIAEL FIGGEDEQHL HDKFRTIMRQ LDFRFSAPLP TNY
|
| |