Gene Cagg_2187 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2187 
Symbol 
ID7266760 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2680352 
End bp2681791 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content59% 
IMG OID643567018 
Producthistidyl-tRNA synthetase 2 
Protein accessionYP_002463506 
Protein GI219849073 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase
[TIGR00443] ATP phosphoribosyltransferase, regulatory subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTTG ATCTTGTACG CGGTATGCGA GACGTGATGC CGGCGGAATA TCTCACCAGC 
CGGCATATAC AAACGATATT AGAGCGAACG ATAGCCGGCT TTGGTTATCA AATGATCGAT
GCTCCGATCA TCGAGCATCG TGAATTGTAT CTGCGCAAAT TAGGTGAAGA TCTCGTCGGC
AAAGTGTATG AATTTAGCTT TGGTGGGCGT GATTTAGCGT TGCGTCCCGA ATGGACAGCA
TCGGTGCTAC GGGCCTACGT GGCCGGGATG CAAGACCAAC CGCTCCCGCT GCGGCTGAGC
TATGCCGGAC CGGTATTCCG CTACGAACGG CCTCAACGTC ACACCTACCG GCAGTTTACT
CAAGTCGGTG GCGAGATCAT CGGTGGGCTG CCACCACGGG CCGATGCTGA GGTGATAGCC
CTTGCCTGTG CCGGCCTTAA CGCAGCCGGG GTAACCGATT ACACAGTACG GATCGGTCAT
ATTGGGCTGG CCCGCGAGCT GCTCAGCCGG TTTGACTTAC CCGAACGAAC CCGCGGGTTA
TTGGTTTGGA GCCTCGAACG GTTACGCGCC GAAGGCCTCG CACCGGTACG CGAGCGCGTC
CTGGCCAATC TCGGTACACC ACCGGAGGGT CTAGAATTGC CGGCCGGGCT TGATGACGCG
CAGGCCGCAC TCTGGCTCGA ACGGGTGTTA GCCGCAATGG GGATCGATTT ACGCACCGGT
ACGCGCAGTC CGGCTGAAGT TATTCAACGG CTATTACGGA CTCTCCGCCG CGCCGATGAA
CAACCGCTGG TCGAAGCAGC ATTGGCGCAA TTGGCCCGCT TAGCCGCCAT CACCGGTACG
CCGCACCACA CCTTGCCGGC CCTTACCGAT TTACTCGGGA CCGAATCGGC AGCGTTTCGC
GAATTGCAAG CGATCCTCGA TCTGCTCATC GCCCACGGCG TATCCCCCCA ACAGGTGATC
ATCGATGGTG CCTTGGGCCG AGGTCTACAT TACTACACCG GCTTGATCTT CGAAATCTAC
GACAGCGAAG GTAATCAACT CTGCGGTGGC GGGCGCTACG ATGATCTGGT CGGCGCACTC
GGTGGGCGGC AATCAGTACC GGCGGTAGGT TTTGCCTATG GACTGGAGCG GGTGGTTGCA
GCCGCGACAC CCCCAGAGCC AACCGATCCG CGCACCGTCT TGGTGGCCGC CGTGAGTGAT
GAAGACTACC CGTATGCCGT GCTGGTTGCC AACCTCCTGC GAATGAGTGG TCATACAGTC
GTGCTCGATG TGCGCAAGCG CAGCATTAAA GACAACTTGC GTGATGCCAC GCGCCGTGGC
TTCGCCGCAG CGGTGATCGC CGGAGCTGCC GAACGCGAAG GGGAGTATGT CGTGTGGCGC
GATTTAGCGA CCCGTACCGA ACGGCGCATT ACCCTTGCTG AGTTAGGAGG GGGGGTATGA
 
Protein sequence
MTVDLVRGMR DVMPAEYLTS RHIQTILERT IAGFGYQMID APIIEHRELY LRKLGEDLVG 
KVYEFSFGGR DLALRPEWTA SVLRAYVAGM QDQPLPLRLS YAGPVFRYER PQRHTYRQFT
QVGGEIIGGL PPRADAEVIA LACAGLNAAG VTDYTVRIGH IGLARELLSR FDLPERTRGL
LVWSLERLRA EGLAPVRERV LANLGTPPEG LELPAGLDDA QAALWLERVL AAMGIDLRTG
TRSPAEVIQR LLRTLRRADE QPLVEAALAQ LARLAAITGT PHHTLPALTD LLGTESAAFR
ELQAILDLLI AHGVSPQQVI IDGALGRGLH YYTGLIFEIY DSEGNQLCGG GRYDDLVGAL
GGRQSVPAVG FAYGLERVVA AATPPEPTDP RTVLVAAVSD EDYPYAVLVA NLLRMSGHTV
VLDVRKRSIK DNLRDATRRG FAAAVIAGAA EREGEYVVWR DLATRTERRI TLAELGGGV