Gene Hlac_0777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHlac_0777 
Symbol 
ID7400252 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorubrum lacusprofundi ATCC 49239 
KingdomArchaea 
Replicon accessionNC_012029 
Strand
Start bp781313 
End bp782788 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content67% 
IMG OID643707843 
Productprolyl-tRNA synthetase 
Protein accessionYP_002565447 
Protein GI222479210 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00408] prolyl-tRNA synthetase, family I 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.941817 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGACG ACGACCAGGA GCTCGGAATC ACCGAGTCCA AGTCACACAA CACCGGCGAG 
TGGTACGCCG AGGTCGTACA GAAGGCGGGG CTCGCCGACT ACGGCCCCGA GGGTATGAGT
GGGTTCATCG TCACCCGACC GCGCGCGTAC GCGGTGTGGG AGCGACTGCA GGGCTTTCTC
GACGCGAAGT TCAAAGACAC CGGAGTCCAG AACGCGTACT TCCCCCTCTT CATCCCCGAG
TCGTACCTCG AACGGGAGAA GGACATCGTC GAGGGATTCG ACCCCGAGGT CGCGTGGGTG
ACCGAGGCGG GCAACAAAGA ACTCGAAGAG CGACTCGCGG TCCGGCCCAC CTCCGAGTCG
ATCATCACTC CGTACATCAG CCAGTGGGTG CGGAGCCACC GCGACCTCCC GCTGCGCGTG
AACCAGTGGT GTTCGGTCGT GCGCTGGGAG GCGACTGAGA CGAAGCCGTT CTTCCGCACG
AAGGAGTTCC TCTGGCAGGA GGGCCACACC GCCCACGCTA CCCACGAGGG CGCCTGGGAG
GAAACGATGA CGCGGCTCGA CCAGTACGCG TCCGTCTACG AGGACCTGCT GGCGATGCCC
GTGTTGAAGG GCCAAAAGCC CGACCACGAC AAGTTCCCGG GCGCAGAGAC GACCACGACC
GTCGAGGCGC TGATGCCGGA CGGGAAGTCG GTGCAGGCAG GCACCTCCCA CCACCTCGGA
CAGTCGTTCG CGGAGGCGTT CGACATCACG TTCTCCGACG AGGACGAGGA AGAGCGGATC
GCGCACACCA CCTCGTGGGG GCTCTCGTGG CGCGCACTCG GCGCGCTCAT CATGACTCAC
TCCGACGAGC AGGGGCTCGT GCTCCCGCCC GGCGTCGCCC CCGAGCAGGT CGTCGTCGTC
CCCATCTGGC AGGAGGACAC GAAAGACGAA GTGCTCGAGT ACGCCGAGGG CGTCGCCGAC
GACCTCGACG ACGCGGGGAT CCGCGTCGAG CTCGACGACC GCGACGGGCG CAACCCCGGA
TTCAAGTTCA ACGAACACGA GCTCAACGGC GTTCCCCTCC GGATCGAGAT CGGCCCCCAC
GAGGTCGAGG ACGGCGAGCT CACCCTCGTC CACCGGCCCG ACGGCGAGAG CGTCGTCGAG
GACCGAGAGG GCGTCGTTGC GACCGTCCAA GACCACTTCG ACGAGGTGTA CGCGAAGCTG
TACGCGACCG CCGAGGAGAC CCTCGACGGC GCGGTTCGCG AGGCCGACGA CCGTGCCGAC
ATCCTCGGCA CGCTCGGCCA GCACGGCGGC TACGTGACGG CTCCGTGGTG CGGCGACGAG
GCGTGCGAGG AGCCGATCAA AGAACCGATG GCCGCCGAAA TCGTGATGGT CCCGTTCGAA
GACGACGACC CTCTCGCCGA GGCGGACCAC GGCGAGACCT GCGCGATCTG CGACGACGAC
GCCGAGCGGA CGGCGTACTT CGCGAAGTCG TACTGA
 
Protein sequence
MSDDDQELGI TESKSHNTGE WYAEVVQKAG LADYGPEGMS GFIVTRPRAY AVWERLQGFL 
DAKFKDTGVQ NAYFPLFIPE SYLEREKDIV EGFDPEVAWV TEAGNKELEE RLAVRPTSES
IITPYISQWV RSHRDLPLRV NQWCSVVRWE ATETKPFFRT KEFLWQEGHT AHATHEGAWE
ETMTRLDQYA SVYEDLLAMP VLKGQKPDHD KFPGAETTTT VEALMPDGKS VQAGTSHHLG
QSFAEAFDIT FSDEDEEERI AHTTSWGLSW RALGALIMTH SDEQGLVLPP GVAPEQVVVV
PIWQEDTKDE VLEYAEGVAD DLDDAGIRVE LDDRDGRNPG FKFNEHELNG VPLRIEIGPH
EVEDGELTLV HRPDGESVVE DREGVVATVQ DHFDEVYAKL YATAEETLDG AVREADDRAD
ILGTLGQHGG YVTAPWCGDE ACEEPIKEPM AAEIVMVPFE DDDPLAEADH GETCAICDDD
AERTAYFAKS Y