Gene Hlac_0340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0340 
Symbol 
ID7399732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp362377 
End bp363516 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content72% 
IMG OID643707404 
Productprotein of unknown function DUF87 
Protein accessionYP_002565014 
Protein GI222478777 
COG category[R] General function prediction only 
COG ID[COG0433] Predicted ATPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.0311557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATGTGC TCGGACGCGA CACCGGCTCG GATGATGGGA CCGGGTCAGC CGGCACGAAC 
GGCGAGATCG GCGACGAACG GCTCCCAACC GTCCAGCTCG GGTCGTTCCT CGCACGCGAC
GGCAGCGCCG GGGCCGCGGT CGGGATTGAC GCCGACAGCC CGCACGCCGG CGTCGTCTTC
GGTAAGCGGG GCACCGGCAA GTCGTACACA CTCGGCGTCC TTGCGGAGGG GCTCGCGGCG
GCCAGCGGCG TCGCGCCGGT TGTGGTCGAT CCAATGGGCG TCTTCGACGG GCTTCGGGCG
ACTGGCGGAC AGGTCGTCGA ACCACGGGTC CGCCCAGCGG CGATTCCCCC AGAGGCGTGG
CCGGACCTGC TCGGGCTCGA CCCGGCGAGC GGGCCGGGAA GTCTGGTGTG GCGCGTCGTC
GCTGACGCCC TCAAATCCCC TGAGGCGGGG GGTTCGGGTG AATCGGACGA ATCGCCGTCG
CTCGCGACGC TCCGCGATCG AGTCGACGCC GCAGACGCGC CCGCTGCAGA TCGCCGCGCG
GCCGCAAACC ACCTGCGGCT CGCGGAGTCG TGGGGCGTGT TCGACGCGGA CGCACCGCCG
ACCGTCCGGC TCGTCGGTGG CGGGGAGCCG ACCGTACTCG ATCTCGCCGG CGTTCCGGAG
GCAGCCGCGG CTGCAGTCGT CAGGGCGGTC GCTCGCGGGC TCTACGACGC CCGGATCGAC
GGCGACCTCG ATCGGCTCCC GTGGCTCCTC GTCGACGAGG CGCACGCTTT CTTCGGCGGC
GTCGCTGATC CCGCGCTCCG AACGCTCCTG ACCCGTGGTC GCGCACCCGG CGTCTCGCTG
GTCTGTGCGA CGCAGCGACC CGGTGCGCTG CCGAGCGTCG CCGTCTCGCA GTCGGACCTG
CTCGTCGCCC ACCGGCTCAC CGCCGAGCGC GACCTCGACC GGCTCGCCGA GGCGGAGGCG
ACCTACCTCG CCGGCGACCT CGCTTCCCGG CTCCCGACTG AAACCGGCGA GGCGCTCGTC
GTCGACGACG CGACGGAGAC GGCTCACACA GTTCGGATCC GAGAGCGACG GACTCCACAC
GGTGGCGGAA GTCCCAGCGC AAGCGGGACC GCCGCCGCGA AGTCCGAAGA CCCAAGATAA
 
Protein sequence
MYVLGRDTGS DDGTGSAGTN GEIGDERLPT VQLGSFLARD GSAGAAVGID ADSPHAGVVF 
GKRGTGKSYT LGVLAEGLAA ASGVAPVVVD PMGVFDGLRA TGGQVVEPRV RPAAIPPEAW
PDLLGLDPAS GPGSLVWRVV ADALKSPEAG GSGESDESPS LATLRDRVDA ADAPAADRRA
AANHLRLAES WGVFDADAPP TVRLVGGGEP TVLDLAGVPE AAAAAVVRAV ARGLYDARID
GDLDRLPWLL VDEAHAFFGG VADPALRTLL TRGRAPGVSL VCATQRPGAL PSVAVSQSDL
LVAHRLTAER DLDRLAEAEA TYLAGDLASR LPTETGEALV VDDATETAHT VRIRERRTPH
GGGSPSASGT AAAKSEDPR