Gene Hlac_3410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3410 
Symbol 
ID7402258 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp159488 
End bp160684 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content56% 
IMG OID643709953 
ProductPyrrolo-quinoline quinone 
Protein accessionYP_002567519 
Protein GI222481283 
COG category[S] Function unknown 
COG ID[COG1520] FOG: WD40-like repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTGCAGG GATCCGACGA CGATTCCAAC ACAGACGCTC CTAGCCTATA CCGAACCAAG 
TACGATTGGT GTGTTCCAGG AGATCTGGAA GTCGACAGTG TGGTAGGGGA TACAGTACTC
GGTCGGAGAG ATATTACCGA ACGCAGATCG GAGACGAGGG AAGCTATCGC CCTGAGTGCT
GAGACCGGAT CGATTACATG GAGCTATGAA ACCGAACGAG ACGTTGATCA CTTTACTGCC
GTCACAGTTG ACGATGGAAT ATACCTCACC CGGGGTGGCC CCCAAGACGG CAACGGAGTG
GTTGCTCTCA ATATGGACGG CACGAAGCGC TGGATCGTGA ACACTGCAGT CGGTTACGAG
CGTCCACGTG TAGCCAACGA CACAGTCTAC GTCGCCGACA GCCGGGTGTA CGCATTTGAT
GTAGCTTCAG GAGATCTACG CTGGGAGTAC AACCTACAGG GTAGTAGATT GTCGCCGACA
ATCGTCGACG TCACCGACAC TGTCGTTGTC GAAACGGGCT ACACAGTCCT CGGACTCGAT
CCGACTGCGG GAACCGTGCG GTGGGAGTTC GAAACCGGCG ATCAAGTGAT AGGGGAAGTT
CAGCTCTCGG ACGGAATCAG TTACGTTATG ACCAGTGATC GGATCGCTGC GAGTTCGGAC
GGTGCCGAGC AGTGGCGCAC AGAATTCGAC ACTACTACTG CACAAACAGG CGAGAGTATT
GTCGGGACCA CGTCTGATCG TGTGTTCGTC TTTACAAGCG GTGATGACGG AGACGCCCAC
CAACTCCAGG CTTTCGAGGT TGCAACCGGC GAGCGATCCT GGACCTCCGA GCCGATCCAA
CCGATTGTCC AACCAGATTT TGAATGGACT CCGAGGACAA CTGTATACGG AGACATCGCC
TATCTCGGTG GCGAGACGCT CCGTGCAATA GACGCGACAA CTGGAGACGA ACTCTGGCAG
GCGTCAGTCG GCGATGGGCC GATCAAAACG CTGACCGTCA TCAATAACGG CGCTGAGGCC
GATCACACCG TGTTCGTTCA AGGGGGCGAG ACACAGCTCG CAACGTTCAC CCCAGACGGC
GAGCAGACGT GGAGCCACTC GGTCAACGCC CCTGATCGGG TCTCTGCGAT AGGGGAGTAC
GTGTTCGTTG GGACGGACAA CGAGATCTGC TCGCTGAATC GACTGGAAAA GTCGTAA
 
Protein sequence
MLQGSDDDSN TDAPSLYRTK YDWCVPGDLE VDSVVGDTVL GRRDITERRS ETREAIALSA 
ETGSITWSYE TERDVDHFTA VTVDDGIYLT RGGPQDGNGV VALNMDGTKR WIVNTAVGYE
RPRVANDTVY VADSRVYAFD VASGDLRWEY NLQGSRLSPT IVDVTDTVVV ETGYTVLGLD
PTAGTVRWEF ETGDQVIGEV QLSDGISYVM TSDRIAASSD GAEQWRTEFD TTTAQTGESI
VGTTSDRVFV FTSGDDGDAH QLQAFEVATG ERSWTSEPIQ PIVQPDFEWT PRTTVYGDIA
YLGGETLRAI DATTGDELWQ ASVGDGPIKT LTVINNGAEA DHTVFVQGGE TQLATFTPDG
EQTWSHSVNA PDRVSAIGEY VFVGTDNEIC SLNRLEKS