Gene Acry_1404 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1404 
SymbolcysS 
ID5160514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1559131 
End bp1560480 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content69% 
IMG OID640553319 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001234533 
Protein GI148260406 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCAGA TCAAGCTGCA CAACACGAAA ACACGCCGGC GCGAGCCCTT CGCCCCCGCC 
GACCCCGCGC ATGTGAAGCT CTATGTCTGC GGTCCCACGG TCTATGACCG CGCGCATCTC
GGCAATGCGC GCACGGTGGT GGTGTTCGAC ACGCTGGTCC GCCTGCTGCG CCACCTGTTC
CCGCGCGTCA CCTATGTGCG GAACATCACC GACATCGATG ACAAGATCAA CGCCCGCGCC
GCCGAAACCG GCGAGACGAT CGGCGAGATC ACCGCGCGGA CGACAAGCTG GTTCCACGAG
GACATGGCCG CCCTGTACTG CGCCCCGCCC GATATCGAGC CGCGCGCCAC CGGGCATATC
GGCGACATCA TCGCCCTCAT CGAGCGCCTG ATCGCCCGCG GCCACGCCTA TGCCGCCGAG
GGCCATGTCC TGTTCGCCGT CGCGACCGAT GCGGAATACG GCAAGTTTTC CGGCCGCTCG
CCCGAGGAAC TGCTGGCCGG CGCGCGGGTG GACGTCGCCA CCTACAAGCG CGATCCGGGC
GATTTCGTGC TCTGGAAACC CTCGCCGCCC GACCTGCCGG GCTGGGACAG CCCCTGGGGC
CGCGGCCGGC CGGGCTGGCA CATCGAATGC TCGGCGATGA TCCACGCCAC CCTCGGCGAG
ACGATCGACA TCCATGGCGG CGGCGCCGAC CTGATCTTCC CGCATCATGA GAACGAGATC
GCCCAGTCCT GCTGCGCCTT TCCCGGCTCG GAGTTCGCCC GCGTCTGGGT GCATGCCGGC
ATGTTGCAGG TGGACGGGCA GAAAATGTCG AAATCCCTCG GCAATTTCCG CACTGTGCAG
GACGTGCTGG GCGAGGCGCC GGGCGAGGCG GTGCGCTTCC TGCTGCTCAA GACGCATTAT
CGCGGCGTGC TTGACTTCTC CACCGCCGCG CTGGCCGAAG CGAAGCGGGA GCTCGACCGG
TTCTACCGCG CATTGGAAAA GCATGCCGAC CCCGCGCCCG CCGCCACACC GCCCGCCGCC
TTCATCGAGG CCCTGGCGGA TGACCTGAAC ACCCCCGGCG CGATTGCCGA ACTCCACGCG
CTGGCCGATG CGGCGATGCA GGGCGATGCC GCCAGCGCCG CCGGCCTGCG CGCCGCCGGC
ATGCTGATCG GCATTTTCAA TCACACTGCC GACCAGTGGT TCCGCGGTGA GGCGACCGAC
GATGCGCGGA TCGACGCGCT GATCGCCGAG CGTCTGGCCG CGCGGAGGAA CAAGGATTTC
GCCCGCGCCG ACGCGATCCG CGCCGAACTC GCCGCCGCCG GCATCCTGCT CGAGGACGGT
CCCGGCGGTA CCACCTGGCG GCGCGCATGA
 
Protein sequence
MIQIKLHNTK TRRREPFAPA DPAHVKLYVC GPTVYDRAHL GNARTVVVFD TLVRLLRHLF 
PRVTYVRNIT DIDDKINARA AETGETIGEI TARTTSWFHE DMAALYCAPP DIEPRATGHI
GDIIALIERL IARGHAYAAE GHVLFAVATD AEYGKFSGRS PEELLAGARV DVATYKRDPG
DFVLWKPSPP DLPGWDSPWG RGRPGWHIEC SAMIHATLGE TIDIHGGGAD LIFPHHENEI
AQSCCAFPGS EFARVWVHAG MLQVDGQKMS KSLGNFRTVQ DVLGEAPGEA VRFLLLKTHY
RGVLDFSTAA LAEAKRELDR FYRALEKHAD PAPAATPPAA FIEALADDLN TPGAIAELHA
LADAAMQGDA ASAAGLRAAG MLIGIFNHTA DQWFRGEATD DARIDALIAE RLAARRNKDF
ARADAIRAEL AAAGILLEDG PGGTTWRRA