Gene Hlac_2166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_2166 
Symbol 
ID7401099 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp2155086 
End bp2156114 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content68% 
IMG OID643709236 
Productthymidylate synthase 
Protein accessionYP_002566813 
Protein GI222480576 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0207] Thymidylate synthase 
TIGRFAM ID[TIGR03284] thymidylate synthase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAACAAT ATCTCGATCT CGTCGACGAC ACCCTGTCGA CGGGCACGTA CAAGCCGAAC 
CGAACGGGCG TCGACACGAT CGCGACGTTC AGCGGGCAGT ACACCGTCGA CCTCTCGGAG
GGGTTCCCGC TCCTCACGAC CAAGAAGATG GACGGCTACC GCTGGAACTC GCTGATCCAC
GAGGTGCTCT GGTACCTCTC CGGCGAGGAG CACATCCGGG ACCTCCGCGA GGAGACGAAG
ATCTGGGACG CGTGGGCCGA CGACGAGGGC CGCCTCGACA CCGCGTACGG TCGGTTCTGG
CGCCGGTTCC CCGTGCCGGA CGGCGTCGAC GCGCTCCCCG GCGAGACGTG GCCGAAGGAT
GCGCACCGCT GGGTCACCGT CGAGGAGGGG CCGGAGGGCG TCGAGCGCCG GACCTTCGAC
CAGATCCAGT ACGTGCTCGA CACCCTCGAC GAGAACCCCC GGTCGCGCCG GATGGTCGTG
AACGCGTGGC ACCCCGCCAA CGCCGCTGTC TCGACGCTGC CGCCGTGTCA CTACACCTTC
GTGGTGAACG TCCAGGACGG GCGGCTCAAC CTCCACCTCA CGCAGCGCTC GGGCGACATC
GCGCTCGGGG TGCCCTTCAA CATCGCCGCG TACGCGCTGC TCGCGAACGC GCTCGCACAG
CGAACGGGGT TCGAGATCGG CGAGTTCGGC CACACCGTCG TCGACGCCCA CATCTACTGC
GGGCGCGGCG ATCGCGGGCA GTGGTACGCG AACAACCTCC GGTACGTGCA AGACCGGCTC
GCGACCGTCG AGAGCAAGGC CGACTACCTC GACGTGAAGA GCTGGGTCGA GCGGACCGCC
CCCGACGAGG CGGACGGCGA GGAGAGGTAC GACCACGTCC CCGGGCTGCT CGAACAGCTC
TCGCGGGAGC CGCGCGAGCG ACCCCGGATC GAGATCGCCG ACAAGCCGCT CGACGAACTC
ACGTACGAGG ACATCGAGGT CGTCGACTAC GACTCCGCGG ACGGCATCTC GTTCGCGGTC
GCGGAGTGA
 
Protein sequence
MQQYLDLVDD TLSTGTYKPN RTGVDTIATF SGQYTVDLSE GFPLLTTKKM DGYRWNSLIH 
EVLWYLSGEE HIRDLREETK IWDAWADDEG RLDTAYGRFW RRFPVPDGVD ALPGETWPKD
AHRWVTVEEG PEGVERRTFD QIQYVLDTLD ENPRSRRMVV NAWHPANAAV STLPPCHYTF
VVNVQDGRLN LHLTQRSGDI ALGVPFNIAA YALLANALAQ RTGFEIGEFG HTVVDAHIYC
GRGDRGQWYA NNLRYVQDRL ATVESKADYL DVKSWVERTA PDEADGEERY DHVPGLLEQL
SREPRERPRI EIADKPLDEL TYEDIEVVDY DSADGISFAV AE