Gene Hhal_0701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHhal_0701 
Symbol 
ID4710810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHalorhodospira halophila SL1 
KingdomBacteria 
Replicon accessionNC_008789 
Strand
Start bp785255 
End bp786961 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content72% 
IMG OID639855164 
Productprolyl-tRNA synthetase 
Protein accessionYP_001002285 
Protein GI121997498 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.269166 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGTCA CCCGTTTTCC ACTATCCACC ACCCGTGAGA CCCCGGCCGA CGCCGAGATC 
GTCAGCCACC AGCTGATGCT GCGCGCCGGC ATGATCCGCC GCCTCTCCTC GGGGCTCTAC
ACCTGGCTGC CCCTGGGCCT GCGCGTGCTG CAGAAGGTGG AGCGCATCGT GCGCGAGGAG
ATGAACCGCG CCGGGGCGCT GGAGGTGCTG ATGCCGGCGG TGCAGCCGGC GGAGCTCTGG
CAGGAGTCCG GCCGCTGGGA GAAGTACGGC CCGGAGCTGC TGCGCATCCG CGACCGGCAC
GACCGCGAGG GCTGCTTCGG CCCCACCCAC GAGGAGGTGA TCACCGACCT CTTCCGCCGG
GAGATCCGCA GCTACCGCCA GCTGCCGGTG AACTACTACC AGATCCAGAC CAAGTTCCGG
GACGAGATCC GGCCGCGCTT CGGGGTCATG CGCGCCCGCG AGTTCCTGAT GAAGGACGCC
TACTCCTTCC ACCTCGACGA CGACGACCTG CGCGCCGAGT ACCAGCGCAT GCACGAGGCC
TACTGCCGGA TCTTCCAGCG CACCGGCCTG GCCTTCCGCC CGGTGGAGGC CGACACCGGG
GCGATCGGCG GCAGCGTCTC CCACGAGTTC ATGGTCCTGG CCGACTCCGG CGAGGACGCC
ATCGCCGTCT GCGAAGCCAG CGGCTACGCC GCCAACGTCG AGCTGGCCCC GGCGGTGGCA
CCCACCGAGC CGCGCCCGGC CCCCCAGGCG GAGCGGGCGG AGGTGGCTAC CCCGGGGCAG
CGGACCATCG CCGAGGTGGC CGCCTACCTG GGTCTGCCCG AGGCCCGCAA CCTCAAGACC
CTGCTGGTCG AGGGGGCCGA CGGCGGCCTG GTGGCGCTGC TGCTGCGCGG CGACCACGAG
CTCAACGAGC TCAAGGCCGA GAAGCATCCG GCGGTGAAGG CGCCGCTGAC CTTCGCCGAG
GCCGAGCGCG TCGAGCGCCA GCTCGGCTGC CCCTTCGGCT CCCTGGGGCC GGTGGGGCTG
ACGGGGGTGA CGCTGATCGC CGATCACGCC GCCGCCCACC TGGCCGACTT CGCCTGCGGC
GCCAACCGCG AGGGCTACCA CCTCACCGGC GTCAACTGGG GCCGCGACCT GCCCGAGCCG
GAGACCGCCG ACCTGCGCGA GGTGACCGCC GGCGACCCGA GCCCCGACGG CGAGGGCACG
CTGACCCTGC GCCGCGGCAT CGAGGTCGGC CACATCTTCC AGCTCGGCAC CACCTACAGC
GAGGCCATGG GCGCCAGCGT CCTCGACGAG CAGGGCCAGG AGCGCACGGT GACCATGGGC
TGCTACGGCA TCGGCGTCTC GCGCGTGGTG GCCGCGGCCA TCGAGCAGAA CCACGACGAC
CGGGGCATCT GCTGGCCGGC GCCCATCGCG CCGTTCCAGG TGGCCCTGGT GGCGATCAAG
GCCGAGGACC CGGCGGTGGC CGAGGCCGCC GAGGCGCTCT ATGCGGACCT GACCGCCAGC
GGCATCGACG TCCTCTACGA CGACCGCGAC GCCCGCCCCG GGGTGAAGTT CGCCGACATG
GAGCTCATCG GCATCCCCCA CCGGGTGGTG GTCAGCCCCC GGGCCATCCA GGAGGGCAGC
GTCGAATACA AGGGGCGCCA GGATGCGGAC CCGACCCACG TCCCCCGAGC GGAGATCGTG
ACATGGCTGA AGAACCGTCT GACGTAA
 
Protein sequence
MRVTRFPLST TRETPADAEI VSHQLMLRAG MIRRLSSGLY TWLPLGLRVL QKVERIVREE 
MNRAGALEVL MPAVQPAELW QESGRWEKYG PELLRIRDRH DREGCFGPTH EEVITDLFRR
EIRSYRQLPV NYYQIQTKFR DEIRPRFGVM RAREFLMKDA YSFHLDDDDL RAEYQRMHEA
YCRIFQRTGL AFRPVEADTG AIGGSVSHEF MVLADSGEDA IAVCEASGYA ANVELAPAVA
PTEPRPAPQA ERAEVATPGQ RTIAEVAAYL GLPEARNLKT LLVEGADGGL VALLLRGDHE
LNELKAEKHP AVKAPLTFAE AERVERQLGC PFGSLGPVGL TGVTLIADHA AAHLADFACG
ANREGYHLTG VNWGRDLPEP ETADLREVTA GDPSPDGEGT LTLRRGIEVG HIFQLGTTYS
EAMGASVLDE QGQERTVTMG CYGIGVSRVV AAAIEQNHDD RGICWPAPIA PFQVALVAIK
AEDPAVAEAA EALYADLTAS GIDVLYDDRD ARPGVKFADM ELIGIPHRVV VSPRAIQEGS
VEYKGRQDAD PTHVPRAEIV TWLKNRLT