Gene Hlac_3355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3355 
Symbol 
ID7402210 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp112239 
End bp113375 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content47% 
IMG OID643709906 
Producthypothetical protein 
Protein accessionYP_002567472 
Protein GI222481236 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATCCG GAAGACAAAT ATCCATCAGG AACAAGCGAT CACTGGCTAC ATCGGTGATC 
CGCTTCGGGG CCATCGCTAT AATCGTATCT GCCCTGTTTG TCTTCCTAAA CCAGATCCTG
GACGTGAGCT GGCAGATCCT CACAATAGCC GCAACTGTGG TGATTCTCCT GATCTTAGGA
GGGATGTGGA AACGAGGCTT CTTCCTATCT CCTTGGTCAC TGCAACTGAG GTCGCTTTAC
CAGAAATACA GTTACTCCGG GTTTTTACCG CTTCTTAGAG TTTTCAAATC TCGAGAGCGT
CGGAGAGTGG AAATTCTGAA AGGGCTTTTC CTAGTTGCTG CTGCCGTAAT CGTGGGAGCC
TCGATCTATC TCCTCCATTC TGCCCTGCTA AATGAGTTCA CAGTCAAAGT CCAAAGCGGT
ATACTCGGAA CAGCCTGGCA GGTACACGCC GTGATTATCG GGTTTAGCTT CGTTGCTTTG
ACTTTTGTCT GGGAAGAGAT CTACAGCAAC TCTTTGAGCG ACGAATTAAC AAGACTGTTC
GTAGAGGACA TTGGCTCAAT CTGGACAGTT ACCTTTGTCT TTGGCTCCAA CCTACTTCTT
GGAGTCATAG CCTTCACACA ATCTTCAGCA GAAAATGCAG GTCTTCTAGC GGTCTACATT
ACTGCTGTAC TATTCGTCTC ATCTATAGCA TCCGTAGCCG GACGGTTCTT AGACGCATTA
GATCTCCTGT TCTACACAGA TATAGACGAA GAGGTGAAAG AATACGCAAA GGCGGATTTA
GATAAAGAGC TCATAAGAGA AAGTTCCACC CCGAATAAAG TCCTCTCGGA GGCTATACGC
AGCTTCGAAC AGGTATCTAT AGGTATGCCA AACTTCGGAC TGGAACAGAC CACAATCTCT
TCAAGAGATA TCGAGAAACG AGGGGTAATC ACAGATCTCA ACTTGAAAAG GATGGAAAAT
ATCTCTGAGA TTGTTGACCG GGAACGCGCG GTCAGTATAA GCAAGAACCC GACCGTAGGC
ATGTCTCTCG CTGAAGATAC GACAGTTCTC TCTTTAGAAG GAGACATCGC TGATGAGACA
GTACAAGAGC TCACGGAACA ACTGCGTCGG GGACTGCGAA CACGTAGGGA AAACTGA
 
Protein sequence
MVSGRQISIR NKRSLATSVI RFGAIAIIVS ALFVFLNQIL DVSWQILTIA ATVVILLILG 
GMWKRGFFLS PWSLQLRSLY QKYSYSGFLP LLRVFKSRER RRVEILKGLF LVAAAVIVGA
SIYLLHSALL NEFTVKVQSG ILGTAWQVHA VIIGFSFVAL TFVWEEIYSN SLSDELTRLF
VEDIGSIWTV TFVFGSNLLL GVIAFTQSSA ENAGLLAVYI TAVLFVSSIA SVAGRFLDAL
DLLFYTDIDE EVKEYAKADL DKELIRESST PNKVLSEAIR SFEQVSIGMP NFGLEQTTIS
SRDIEKRGVI TDLNLKRMEN ISEIVDRERA VSISKNPTVG MSLAEDTTVL SLEGDIADET
VQELTEQLRR GLRTRREN