Gene Hlac_0471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0471 
Symbol 
ID7400351 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp488561 
End bp490840 
Gene Length2280 bp 
Protein Length759 aa 
Translation table11 
GC content72% 
IMG OID643707535 
Productphosphoribosylformylglycinamidine synthase II 
Protein accessionYP_002565143 
Protein GI222478906 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0046] Phosphoribosylformylglycinamidine (FGAM) synthase, synthetase domain 
TIGRFAM ID[TIGR01736] phosphoribosylformylglycinamidine synthase II 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.275415 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTGT CCGAGCCGGA CCACGAGCTC GTCGTCGCGG AGCTCGGCCG GGAACCGACG 
GCGGCCGAGG TCGCGCTGTT CGAGAACCTC TGGAGCGAGC ACTGCGCGTA CCGCTCCTCG
CGGCCTCTGC TGTCGGCGTT CGAGAGCGAG GGCGACCAAG TCGTCGTCGG CCCCGGCGAC
GACGCGGCGG TCCTCGCGCT GCCCGAGCCG GACGCCGCGG ACACGCCCGC GGCGGAGCGC
GACGCCGACG ATTACGGCGA TCAGTACGTC ACCTTCGGCG TCGAGAGCCA CAACCACCCC
TCCTTCGTCG ACCCGGTCGA CGGCGCGGCC ACCGGGGTCG GCGGCATCGT CCGCGACACG
ATGTCGATGG GCGCGTACCC GATCGCCCTA CTCGACTCGC TGTACTTCGG CGGCTTCGAC
CGCGAGCGCT CGCGGTACCT CTTCGAGGGC GTCGTGGAGG GCATCTCCCA CTACGGCAAC
TGCATCGGGG TGCCCACCGT CGGCGGCAGC GTCGCCTTCC ACGACGGGTA CGAGGGGAAC
CCCCTCGTCA ACGTCGCCTG CGTCGGTCTC ACGAACGAAG ACCGGCTCGT GACCGCGACC
GCACAGGAGC CCGGCAACAC CCTGATGCTC GTCGGCAACG GCACCGGTCG CGACGGGCTC
GGCGGCGCCT CCTTCGCCTC TGAGGACCTC GCCGAGGACG CCGAGACCGA GGACCGCCCC
GCGGTGCAGG TGGGCGACCC CTACGCCGAA AAGCGGCTGA TCGAGTGCAA CGAGGCGCTG
GTCGACGAGG ACCTGGTCCT GTCGGCCCGC GACCTCGGCG CGGCGGGCCT CGGCGGCGCC
TCCTCCGAAC TCGTCGCGAA GGGCGGGCTC GGCGCCCGAC TCGACCTCGA CAGCGTCCAC
CAGCGCGAGC CGAACATGAA CGCGATGGAG ATCCTGCTCG CCGAGAGCCA AGAGCGGATG
TGCTACGAGG TCGCGCCGGA GGACGTGGCG CGCGTCGAGG CGCTCGCCGA GCGGTTCGAC
CTCGGCTGCT CCGTCATCGG CGAGGTGACC GACGGGAACT ACGTCTGCGA GTTCGCGGGC
GCCGGGAAGG GCGAGGACGA TGCGGACGAC TCCGACGCGG AGTCCGAGGT CGTCGTCGAC
GTGGACGCCG AGTACCTCGC CGACGGCGCC CCGATGAACG ACCTCGCGAG CGAGTCCCCG
ACCCAGCCTG ACCGAGACCT CCCGGACCCC GAACCGTCCC TCGACGAGGC GGTCGAATCG
GTCGTCTCGG CCCCCTCGAC CGCGAGCAAG CGCTGGGTGT ACCGCCAGTA CGACCACGAG
GTCGGTGTTC GAACCGCGAT GAAACCCGGC GACGACGCCG CAATCATGGC GATCCGAGAA
ACAGCGTCGA CCGACGCGGC CGACCTCGCC CCGGCCGATC AGGGCGTCGG TCTCGCGCTC
TCGTCGGGCG CGAACCCGAA CTGGACTGAG ACCGACCCCT ACGAGGGCGC CCGCGCCGTC
GCCCTCGAAA ACGCGACCAA CCTCGCCGCA AAGGGCGCGG TCCCGCTGGC CGCAGTCGAC
TGCCTCAACG GCGGCAACCC GGAGAAGCCC GAGGTGTACG GCGGCTTCAA AGGGATCGTC
GACGGGCTCG CCGACGCCTG CGCCGACCTC GACGCCCCGG TGGTCGGGGG CAACGTCTCG
CTGTACAACG ACAGCGTCGA GGGGCCGATC CCGCCGACGC CGACCCTCGC GCTGATCGGC
ACCAGAAAGG GGTACAACGC GCCGCCTGCC GCCCTCGACG CCGACAGCGC GGGCGACTCA
GAACTCCTAC TGATCGGCGC CGGCGGCGCC GACGGCGGCG CGCTCGGCGG CTCCGAGTAC
CTCGCGCAGG CCGGGGGAAC AGACCGGTTC CCGACGCTGC CGGACACGGA GACCAAGAGT
CTTGCCGACC GCGTCGCGTC GCTCGCAGCG GTCGCCCGCC ACGAATCGAC GTTCGCGACC
CACGACGTGA GCGACGGCGG GCTCGCCGTC GCGCTCGCGG AACTCGTGAC TGACGCGGCG
GGCGCCGACG TGACCCTCCC CGACCGCGTG GCCGTCTTCG AGGAGACGCC CGGCCGCCTC
GTCGTCCAGA CGACTGACCC CGAGGCAGTC GCCGAACTCG CCGGCGAGCT ACCCGTTCTC
CGGCTCGGTG AGGTGACGAC CGACGGGGCG CTCTCGCTCT CAGTCGGCGA CGACGAGACC
GCGTTCGACG CCGACACGAT CCGGGAGCTC CGGGGCGTCA TCGACCGCGA ACTCGCCTGA
 
Protein sequence
MSLSEPDHEL VVAELGREPT AAEVALFENL WSEHCAYRSS RPLLSAFESE GDQVVVGPGD 
DAAVLALPEP DAADTPAAER DADDYGDQYV TFGVESHNHP SFVDPVDGAA TGVGGIVRDT
MSMGAYPIAL LDSLYFGGFD RERSRYLFEG VVEGISHYGN CIGVPTVGGS VAFHDGYEGN
PLVNVACVGL TNEDRLVTAT AQEPGNTLML VGNGTGRDGL GGASFASEDL AEDAETEDRP
AVQVGDPYAE KRLIECNEAL VDEDLVLSAR DLGAAGLGGA SSELVAKGGL GARLDLDSVH
QREPNMNAME ILLAESQERM CYEVAPEDVA RVEALAERFD LGCSVIGEVT DGNYVCEFAG
AGKGEDDADD SDAESEVVVD VDAEYLADGA PMNDLASESP TQPDRDLPDP EPSLDEAVES
VVSAPSTASK RWVYRQYDHE VGVRTAMKPG DDAAIMAIRE TASTDAADLA PADQGVGLAL
SSGANPNWTE TDPYEGARAV ALENATNLAA KGAVPLAAVD CLNGGNPEKP EVYGGFKGIV
DGLADACADL DAPVVGGNVS LYNDSVEGPI PPTPTLALIG TRKGYNAPPA ALDADSAGDS
ELLLIGAGGA DGGALGGSEY LAQAGGTDRF PTLPDTETKS LADRVASLAA VARHESTFAT
HDVSDGGLAV ALAELVTDAA GADVTLPDRV AVFEETPGRL VVQTTDPEAV AELAGELPVL
RLGEVTTDGA LSLSVGDDET AFDADTIREL RGVIDRELA