Gene Hlac_0901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0901 
Symbol 
ID7401272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp891568 
End bp892728 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content67% 
IMG OID643707966 
Productcell division protein FtsZ 
Protein accessionYP_002565569 
Protein GI222479332 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0206] Cell division GTPase 
TIGRFAM ID[TIGR00065] cell division protein FtsZ 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.363847 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACTCTA TTGTAGAGGA CGCCATCGAC GAAGCCGAGG AATCCCCGGT AGATGACTCC 
GGGGAGGCCG GCGCCGGCGA GAACGGCGCA ACCGCCGGAG CCCCGCCTCA GACCGGAACG
ATGACCGACG ACGAGCTGCA GGACGTCCTC CAGGACCTCC AGACGAACAT CACCGTCGTC
GGCTGCGGAG GCGCCGGCGG CAACACGGTC AACCGGATGA CCGAGGAGGG GATCCACGGG
GCGAAGCTGG TCGCGGCCAA CACCGACGTT CAGCACCTCG TCAACATCGA AGCCGACACG
AAGATCCTTA TGGGCCAGCA GAAGACGCAA GGTCGCGGCG CCGGCTCCCT CCCGCAGGTC
GGTGAGGAGG CCGCCATCGA GTCCCAAGAG GAGATCCAGG ACGCCATCGA CGGCTCCGAC
ATGGTGTTCG TCACCGCCGG GCTCGGCGGC GGCACGGGGA CCGGGTCCGC CCCGGTCGTC
GCGAAGGCCG CCCGCGAGTC GGGCGCCCTG ACCATCGCCA TCGTCACGAC CCCCTTCACT
GCCGAGGGCG AGGTCCGACG AACGAACGCC GAGGCCGGCC TCGAACGGCT CCGCGACGTG
AGCGACACCG TCATCGTCGT CCCCAACGAT CGCCTGCTCG ACTCGGTCGG GAAGCTCCCC
GTTCGGCAGG CGTTCAAGGT GTCCGACGAG GTCCTAATGC GCTCGGTGAA AGGTATCACG
GAGCTCATTA CGATGCCCGG ACTCGTCAAC CTCGACTTCG CCGACGTTCG CACCGTCATG
GAGAAGGGCG GCGTCGCGAT GATCGGGCTC GGCGAGTCCG ACTCCGACTC GAAGGCGCAG
GACTCGGTGA AATCGGCGCT CCGCTCGCCC CTGCTCGATG TCGACATCTC CAGCGCGAAC
TCCGCGCTGG TCAACGTCAC CGGCGGGACC GACATGTCCA TCGAAGAGGC AGAGGGCGTC
GTCGAGGAGA TCTACGACCG GATCGACCCC GACGCCCGGA TCATCTGGGG AACCTCCGTT
GACGAGGAGC TGGAAGGCGA GATGCGGACC ATGATCGTGG TGACCGGCGT CGAGTCGCCG
CAGATCTACG GCCGCAACGG CGAATCGGCC GAGGGAGAAG GAGAAACGCC CGAGATGGAA
GACATCGACT ACGTGGAGTA G
 
Protein sequence
MDSIVEDAID EAEESPVDDS GEAGAGENGA TAGAPPQTGT MTDDELQDVL QDLQTNITVV 
GCGGAGGNTV NRMTEEGIHG AKLVAANTDV QHLVNIEADT KILMGQQKTQ GRGAGSLPQV
GEEAAIESQE EIQDAIDGSD MVFVTAGLGG GTGTGSAPVV AKAARESGAL TIAIVTTPFT
AEGEVRRTNA EAGLERLRDV SDTVIVVPND RLLDSVGKLP VRQAFKVSDE VLMRSVKGIT
ELITMPGLVN LDFADVRTVM EKGGVAMIGL GESDSDSKAQ DSVKSALRSP LLDVDISSAN
SALVNVTGGT DMSIEEAEGV VEEIYDRIDP DARIIWGTSV DEELEGEMRT MIVVTGVESP
QIYGRNGESA EGEGETPEME DIDYVE