Gene Hhal_1520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_1520 
Symbol 
ID4709508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp1647660 
End bp1648658 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content65% 
IMG OID639855987 
Producthypothetical protein 
Protein accessionYP_001003089 
Protein GI121998302 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0489] ATPases involved in chromosome partitioning 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family
[TIGR03018] exopolysaccharide/PEPCTERM locus tyrosine autokinase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00579805 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTA TCGAGCGGGC GCTGGAGAAG CGCCGAGGCA ACCAGGCACC CGGTGCACAA 
GCGCAAGGTT CGGTCCAGGG GGCCGGACAA CAGGAGGCCG ATCCGGCGGG CGAGACGACC
CAGACGGAAT ATGCGCAGGC CAGGCCCTCC ACGCCGCAGT TCGGACGCGC GCCGCGCACC
GTCGAACGAC CGGCCGACGT CCGGATTGAT TACGGGTGGT TGCGCCATCA GGGGATCCAG
GTTCCAGGAG AGGCGCGCAG CGGGCTTGAA GAGGAGTTCC GCTTGATGAA GCGCCCGCTG
CTCGATAACG CCTTCGGGCG CCATGGCATG CCGGTAGTGG ATAAAGGGCG GTTGATCATG
GTGACCAGTG CCGTCCCCGG GGAGGGTAAG ACCTTCTCCA CGATCAACCT CGCGCTGAGC
ATCGCCATGG AGGTGGATCG CACGGTTCTG GTCGTCGATG CCGACGTGGC ACGTCCGAGC
GTCCCGCGAA CACTTGGATT CGCGGCCGAC AGGGGGCTCA TGGACCTGTT GACCGACTCT
GATCTTCGTC TGCCGGACGT GCTCCTGCGC ACCGATATCC CTGATCTCAG TGTGTTGCCA
GCCGGACGTC CCCACGGTCG TTCGACGGAG TTACTGGCCA GTCAGGGTAT GACCGATCTG
CTGGAGGAGA TCCACGAGCG CTACCCGGAT CGGGTGATCC TCTTCGATTC CCCCCCGCTG
CTCTCGACCA GTGAACCGAG TGTGCTGGCC CGGGAGATGG GGCAGGTGCT CCTCGTGATC
GAGGCTGAGG GTACGGCGCA AACGGCGGTG ATGCGGGCCG CGGAGCTGCT GGAGGGGTGC
GACGTGGTGC TGACCATGCT CAACAAAGCG ACCGGTCATG GAGGGCTGGG ATACAGCGGT
TACGGCTACG GCTACGGTTA CGGTTACGGT AAATACGGTG GGGAGCCCCG GAGCAGCGCC
GCCAAGGAAG CGGATGGCCG TGTCGCCGAG GGTAGCTAG
 
Protein sequence
MSIIERALEK RRGNQAPGAQ AQGSVQGAGQ QEADPAGETT QTEYAQARPS TPQFGRAPRT 
VERPADVRID YGWLRHQGIQ VPGEARSGLE EEFRLMKRPL LDNAFGRHGM PVVDKGRLIM
VTSAVPGEGK TFSTINLALS IAMEVDRTVL VVDADVARPS VPRTLGFAAD RGLMDLLTDS
DLRLPDVLLR TDIPDLSVLP AGRPHGRSTE LLASQGMTDL LEEIHERYPD RVILFDSPPL
LSTSEPSVLA REMGQVLLVI EAEGTAQTAV MRAAELLEGC DVVLTMLNKA TGHGGLGYSG
YGYGYGYGYG KYGGEPRSSA AKEADGRVAE GS