Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1862 |
Symbol | |
ID | 4711231 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 2035584 |
End bp | 2036465 |
Gene Length | 882 bp |
Protein Length | 293 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639856334 |
Product | UspA domain-containing protein |
Protein accession | YP_001003428 |
Protein GI | 121998641 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0589] Universal stress protein UspA and related nucleotide-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.23265 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCTCA GGCAGATCCT TTTCGCCAGT GATCTCTCGC CGCAGGCGGC GCTGGCCGGC GAGCGGGCCG CGCAGCTGGC CGGAGAGACC GGAGCAGGCC TTGGAGCGGT CTACGTCATC GACAGTGATA TCCCCGCGGA GAGTGCCGAT GGGCAACCCC GTGCCGCCGT CCGGCAGACA GCCGAGGCGG AGTTGCAGAA CACGACTCCC GGGCAGTCAG CGTCGCTCCA GGTGCGTTTC GGCAATGTCC TCGGTGAGCT GGCCCAGGCC ATCGAGGAGG AGCAGGCCGA GCTGCTGGTC GTCGGGGCCC ACGGCCAGCA CTATGCGGCT GACTGGTTAC TGGGGACCTC CGCCGAGCAG TTCGTCCGTC ACCTGCCTGT CCCGACCCTG GTGGTCCGTA ATCCGGCAGA GCATCCTTAT CGGCGTATCC TGGTGGCCAC CGACTTCTCG GCCTGCGCCC GTGCGGCCCT GCAGCGGGTG GCCACCTGGT TCCCGGAGGC GGAACTGGAA GTGGTGCATG TCCTGGATAC CCAGGCACTG GAGCAGATGC GGGCAGCGGG CGTGGGCGAG CGCTGGGTCG AACAGCGGTA TGAGCGTCAG CGTGCTGCGG CCGAGTCGCG ACTGCGCGAG GAGTTGACCG CCTGTGGATT GGGTCCGGGG CGGGTCACCG AGACCCTGCT CGCCGGCTAT CCGGCGGAGG CGCTGCTTGG GCGCGTGCGA ACCGGACAGC CGCCCGATCT GGTGGTATTG GGCAACCACG GGCGCGGGCG CTGGGGCGAT TTGCTGCTTG GGAGTGTCGC CAGCCGCGTT TTGCACCAGA CCAGCAGGGA CTTGATGCTG GTTCGTAGCG GCGAGACTTC GGGGGCTCCC ACGGTCGGAT AA
|
Protein sequence | MSLRQILFAS DLSPQAALAG ERAAQLAGET GAGLGAVYVI DSDIPAESAD GQPRAAVRQT AEAELQNTTP GQSASLQVRF GNVLGELAQA IEEEQAELLV VGAHGQHYAA DWLLGTSAEQ FVRHLPVPTL VVRNPAEHPY RRILVATDFS ACARAALQRV ATWFPEAELE VVHVLDTQAL EQMRAAGVGE RWVEQRYERQ RAAAESRLRE ELTACGLGPG RVTETLLAGY PAEALLGRVR TGQPPDLVVL GNHGRGRWGD LLLGSVASRV LHQTSRDLML VRSGETSGAP TVG
|
| |