Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0020 |
Symbol | |
ID | 4710198 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 21085 |
End bp | 22296 |
Gene Length | 1212 bp |
Protein Length | 403 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 639854476 |
Product | hypothetical protein |
Protein accession | YP_001001617 |
Protein GI | 121996830 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCAA GCGCTATAGC CGGGTGTTGT AACAAGCTCA GGCACAGGAT TGTGGCCGGG GCGGCCCTGG TCACAGCGGG GAACGCTGTG GCCGGGTCGA CCCTGCCCGA ATCCGGTATC CTCGACCCTG GACAGATCGA TGTCAGGGGC ACAGTGGGCT TGAGCTACTT CGGGGCCGGT CAGGGCACCA CTATGGATGT TCTGCCCTCC CTGCGTACGG GGCTCCCCGG TCCGTTCGAT GTGGCGGTTA CGGTGCCTTA TCGAAACGAT CTTGAGCAGG AGCAGTACGC CCTGCGTAGC TCATTCCGCA CCGATTTCTC CTACCGGTTC CTCGATGATG GCCCTCGCCA GGCGGTTCTG GGCGGCTACA TGACCTTCGA TCCCTCGGAT GCTGGGGAAG GTGTAGGGAG CGGGTCGCAC AACTATGGGG TCAGCGCCGA CTACCGCGCA GAGGACATCG TGGGCACCGG GACCTTCTAT CTGCGCGGGG CCGTGGAACG GCTGGACCAT CGCGACGATC CCGGCGCCGA TGAGGTCTCC TACCGGCTCG CCAATCGCCT CACCGCCGAG ATCGGTCTAG GGCTGGATGT GGATGTCGAC GCCGAGCCCT ACTTCGGTCT TCGAGGTACC CAAGGGCTAG GCTCGACGAC ACGTGACCAG CAGAGCTTGT CCTTTCGTCC GGGCATTCGG TTTCGGTACA CCCCTAACAG CGAGGTCCAG TTCCTGGCGC AATTCGATAC GGTACAGCGT AACGCCGAAC CGGAGCGGGC GATCTTCGTG ACCTGGACCT ACCAGCACCG CCCTCCGGAG CGTGATCTGG ACAGCCTGCG GGAGCGTATC TCCGCCAATG AGATGGCCAT CGAGCGCCTG GATCGACGGG TGAGCGACAT CGAACGCCGG CTGTTAACAC GTACCGAGGT ACCGGAACCG GAGACGAGGG AGGGGGTGGT CGTGCTCAAT CACTCCGGGA TCCCGGAGTT GACCACCTTG GTGGTGGACA CCCTGGAGAA CCTCGATCTC TCTGTCGGGG ACACGCGTGA CGAAGACGAC GTCGCCCGTC GGGATCGCAC CAAGATCCTC TACCGGCCGG GGCATGCGGA GCGGGCTCGG GAGATTGCCC GCGCGCTGCC GGGGAATCAG CTGATCGAGC AGCGTGATGA CATGCCGAAT CAGGCCGAGA TCGCCGTCCT GATCGGCTTC GATCTGGAGT GA
|
Protein sequence | MKASAIAGCC NKLRHRIVAG AALVTAGNAV AGSTLPESGI LDPGQIDVRG TVGLSYFGAG QGTTMDVLPS LRTGLPGPFD VAVTVPYRND LEQEQYALRS SFRTDFSYRF LDDGPRQAVL GGYMTFDPSD AGEGVGSGSH NYGVSADYRA EDIVGTGTFY LRGAVERLDH RDDPGADEVS YRLANRLTAE IGLGLDVDVD AEPYFGLRGT QGLGSTTRDQ QSLSFRPGIR FRYTPNSEVQ FLAQFDTVQR NAEPERAIFV TWTYQHRPPE RDLDSLRERI SANEMAIERL DRRVSDIERR LLTRTEVPEP ETREGVVVLN HSGIPELTTL VVDTLENLDL SVGDTRDEDD VARRDRTKIL YRPGHAERAR EIARALPGNQ LIEQRDDMPN QAEIAVLIGF DLE
|
| |