Gene Hlac_3663 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_3663 
Symbol 
ID7402454 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012030 
Strand
Start bp423537 
End bp425117 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content54% 
IMG OID643710194 
Producthypothetical protein 
Protein accessionYP_002567760 
Protein GI222481524 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0285] Folylpolyglutamate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGGATG TCCACTTCGA GGTGTACGAC ACCGAAGCTG GGTGGTGGTG GCGACTGCGA 
ACCGGCAGTC TCGTTTTGAG CCAGTCGCAA ACAACATTTG ATTCGCCCGA TCAGGCTCGT
GCAGCCGTCG ACCGCGTCCG TACAGCGGCA TCAGTCGTCA AAAATATCCC GGAGCGACAG
TTCGAGGGTA CCCAAGCGAG CGATCGCGTT ACTGACGCGC AGTGTGTTAC TGTCAATGTT
ACCGGGCAGT ACGAGTGGGT TCTTGAAGAC GACGGTGAAG TGCTTACGCA ATCGACAACA
GCATACGAAA CCGAGGCCGG TGCTCTAGCG GCTGCCAAGG CATTCTGTAC ACACGCCAGC
GCCACAGTAA CGGTGTTCCT CTTTAGGAAC CAGGAACAGC AGTCGTCATT TGATGTCGGC
TCAACATCTA TACTGGCGGC GCTTCGCTCG TTAGCGACGC TCCCATACCG AGGGGTCAAA
CACAATCAAA AAATCAAGGA GATTGACACT CGGATCGTTG TTTCTGGCAT CCGTGGAAAA
TCATCGACCA CTCGCCGACT TAACGACGTG TTCAGGCGTC GCGGGTACGA TACACTGACA
AAAATCACGG GGAATCAGCC ACATCTGATT CACAATAATG GAGTGATCCC GCTGAACCGC
CAAGGACCCA GAACGACCTT GTACGAAAAT ATTGGCGTCT TACGAGAGTA CGTCCCCAAG
CTTGCAGAAT ACGCTCCTGA CGATGTCGCA ATTTTCGAGA ATCAAGGTAT CACGGAGTAC
ACCACGCGCC TGATTAACGA ATCATTCATA CACCCACATA TAATTGTCCT GACCAACATC
CGGCGTGATC ACCAAGACAC GCTCGGCGAG ACTCGGGCTG AGATCGCACG GTCGTTCGCC
AAATCAGTCC CTTCTAGTGC CCATGTCGTG TGTGGTGAGC AAAATCCAGT CATCTACCAG
TATCTGGAGC GTGAGGTCAC GGCCACCGGG GCGACGATCG AACAAGTAAC AATTCCTGAG
AAACACAAAG GGTTGCTTGG AGCGGAGACG GTTCACGCAG TGAACCCCAC ACTTATAGCC
GTCGATGAAC CCCCCCTTCC TGCGGATGAG ATCCAAACGT ATCTCACACA GATCCAGCCG
AAGTGGACTG CCATCCCGAA CGGGCTCGTA TTCAACGCCG CTGAGGTGAA CGACGTCGAG
AGTACAGAAG CGGTCAGACA GGCCCTTGAG AAATCTGACC GCATCACTCC GTTTGTTTTC
TTGCGTCCGG ATCGGCGCGG GCGAACCGCC TCGTTCGTTT CGTACTTCGA TCACCTCGCC
AATCGTGGTG TTATCGACGT TGGATACGTG ATGGGTAGTG ACAGCTCAGT ATTCGCGAAT
GAAACGACGT GTGAAGTCAA GGAGATCGAC TCCGGCGCCG ATCCGGCAGC CGTGTTGGAT
CGGCTGCTCA ATCATGATCG ACCGGTGATG ATTATGGGAA ACACCGTCGA CGAGTTTATG
CGAGAGCTTG ATGGTAAAAT CGACTCGCGA GCACAGCGCA TGTCTCTAGC AGATAAGCCA
CGAGGGCCCC CAGCCACGTA G
 
Protein sequence
MEDVHFEVYD TEAGWWWRLR TGSLVLSQSQ TTFDSPDQAR AAVDRVRTAA SVVKNIPERQ 
FEGTQASDRV TDAQCVTVNV TGQYEWVLED DGEVLTQSTT AYETEAGALA AAKAFCTHAS
ATVTVFLFRN QEQQSSFDVG STSILAALRS LATLPYRGVK HNQKIKEIDT RIVVSGIRGK
SSTTRRLNDV FRRRGYDTLT KITGNQPHLI HNNGVIPLNR QGPRTTLYEN IGVLREYVPK
LAEYAPDDVA IFENQGITEY TTRLINESFI HPHIIVLTNI RRDHQDTLGE TRAEIARSFA
KSVPSSAHVV CGEQNPVIYQ YLEREVTATG ATIEQVTIPE KHKGLLGAET VHAVNPTLIA
VDEPPLPADE IQTYLTQIQP KWTAIPNGLV FNAAEVNDVE STEAVRQALE KSDRITPFVF
LRPDRRGRTA SFVSYFDHLA NRGVIDVGYV MGSDSSVFAN ETTCEVKEID SGADPAAVLD
RLLNHDRPVM IMGNTVDEFM RELDGKIDSR AQRMSLADKP RGPPAT