Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1520 |
Symbol | |
ID | 4709508 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1647660 |
End bp | 1648658 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 639855987 |
Product | hypothetical protein |
Protein accession | YP_001003089 |
Protein GI | 121998302 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG0489] ATPases involved in chromosome partitioning |
TIGRFAM ID | [TIGR01007] capsular exopolysaccharide family [TIGR03018] exopolysaccharide/PEPCTERM locus tyrosine autokinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.00579805 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATTA TCGAGCGGGC GCTGGAGAAG CGCCGAGGCA ACCAGGCACC CGGTGCACAA GCGCAAGGTT CGGTCCAGGG GGCCGGACAA CAGGAGGCCG ATCCGGCGGG CGAGACGACC CAGACGGAAT ATGCGCAGGC CAGGCCCTCC ACGCCGCAGT TCGGACGCGC GCCGCGCACC GTCGAACGAC CGGCCGACGT CCGGATTGAT TACGGGTGGT TGCGCCATCA GGGGATCCAG GTTCCAGGAG AGGCGCGCAG CGGGCTTGAA GAGGAGTTCC GCTTGATGAA GCGCCCGCTG CTCGATAACG CCTTCGGGCG CCATGGCATG CCGGTAGTGG ATAAAGGGCG GTTGATCATG GTGACCAGTG CCGTCCCCGG GGAGGGTAAG ACCTTCTCCA CGATCAACCT CGCGCTGAGC ATCGCCATGG AGGTGGATCG CACGGTTCTG GTCGTCGATG CCGACGTGGC ACGTCCGAGC GTCCCGCGAA CACTTGGATT CGCGGCCGAC AGGGGGCTCA TGGACCTGTT GACCGACTCT GATCTTCGTC TGCCGGACGT GCTCCTGCGC ACCGATATCC CTGATCTCAG TGTGTTGCCA GCCGGACGTC CCCACGGTCG TTCGACGGAG TTACTGGCCA GTCAGGGTAT GACCGATCTG CTGGAGGAGA TCCACGAGCG CTACCCGGAT CGGGTGATCC TCTTCGATTC CCCCCCGCTG CTCTCGACCA GTGAACCGAG TGTGCTGGCC CGGGAGATGG GGCAGGTGCT CCTCGTGATC GAGGCTGAGG GTACGGCGCA AACGGCGGTG ATGCGGGCCG CGGAGCTGCT GGAGGGGTGC GACGTGGTGC TGACCATGCT CAACAAAGCG ACCGGTCATG GAGGGCTGGG ATACAGCGGT TACGGCTACG GCTACGGTTA CGGTTACGGT AAATACGGTG GGGAGCCCCG GAGCAGCGCC GCCAAGGAAG CGGATGGCCG TGTCGCCGAG GGTAGCTAG
|
Protein sequence | MSIIERALEK RRGNQAPGAQ AQGSVQGAGQ QEADPAGETT QTEYAQARPS TPQFGRAPRT VERPADVRID YGWLRHQGIQ VPGEARSGLE EEFRLMKRPL LDNAFGRHGM PVVDKGRLIM VTSAVPGEGK TFSTINLALS IAMEVDRTVL VVDADVARPS VPRTLGFAAD RGLMDLLTDS DLRLPDVLLR TDIPDLSVLP AGRPHGRSTE LLASQGMTDL LEEIHERYPD RVILFDSPPL LSTSEPSVLA REMGQVLLVI EAEGTAQTAV MRAAELLEGC DVVLTMLNKA TGHGGLGYSG YGYGYGYGYG KYGGEPRSSA AKEADGRVAE GS
|
| |