Gene Haur_3108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3108 
SymbolcysS 
ID5734980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3921346 
End bp3922746 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content51% 
IMG OID641280252 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001545874 
Protein GI159899627 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00597205 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTAG CAATTTACAA CACGCTTACC CGCCAGACCG AGCCGTTTAC ACCGTTGGTT 
GCTGACCATG TGTCGATGTA TGTCTGTGGG CCGACGGTGT ATTCTGATGC CCATATTGGT
CATGCCATGT CAGCGGTGGT GTTCGATGTG GTGCGGCGCT ATTTGGAATG GTCGGGCTAT
ACGGTGCGCC ATGTGATGAA TTTTACCGAT GTTGATGATA AAATTATTCG CCGCGCCAAC
GAGCAAGGCC GTGATCCCAT GGAATTGGCC GAGAGCTATA CCTTGGCCTT TCTTGATCAA
TTGGGCCAAT TGAATGTGCT GCCCGCTACG GCCTACCCCC GCGTTTCAAC CACGATTCCG
CAAATTATTC AATTTATCGA AGGCTTGATT GCCAAAGATG CGGCATACCA CGCCAGCAAT
GGCGATGTCT ATTTTCGGGT ACGAGCCGAC GAAGATTATG GCAAACTTTC GCGCCGCGCT
GTCGATGATA TGCGCTCTGG TGCGCGAATT GCTCCCGATG AAGCCAAAGA TGACCCGTTA
GATTTTGCGC TGTGGAAATC GGCCAAACCA GGCGAGCCAG CTTGGGAAAG CCCATGGGGT
CAAGGCCGAC CTGGTTGGCA TATCGAATGC TCGGCCATGA GTTTGCACGA ATTAGCTGAG
CAAATTGATA TTCATGGTGG TGGAAACGAC TTGATCTTCC CCCACCACGA AAATGAAATT
GCCCAAACCG AATCATTAAC TGGCAAAAAT TTCGCCCAAG TCTGGATGCA CAATGGCATG
CTGCAATTGG CTGGCGAGAA AATGAGCAAA TCGCTGGGCA ATTTGATCAC GATTGATCAA
TTTTTAAGCG AACACTCCGC CGATATTATG CGCTTGCTGG TGCTTTCTGG CTCGTATCGT
GCACCATTGG TTTATAATGA TGAGGTTTTG GCTGATACCC AACGCAAACT TGAGCGAATT
ATGTCGGCGT TGAAACCAGC CCATGGCACG GCAACCAACG GCCCAGTCGT TGAGACGCTA
AATGCAATTG TTGCCAAAGC CCCAGCCGAT TTCCGTGCCG CGATGGACAG CGATTTCAAT
AGTGCAGCAG CCTTGGCGGT CTTGTTTGAT TTGGTGCGTT CGATCAACGC TGCCCGTGAT
GCAGGCGTTG GTGGCGAGCC ATTCGCAGCA GGTCAAGCCC GTTTACGTGA ATTAGCTGCG
GTGCTCGGCT TACGCTTAGA AGCGCCCAGC GCCAGCAAAA CCGATGCTGC ACCTTTTATC
GAATTGTTGA TTGAGCTACG CGCCGAGTTG CGCAAAGCTA AACAATGGGC ACTCTCCGAT
TTAGTACGCA ACCGCCTGAG CGAGCTTGAT GTACAACTCG AAGATAGTCC CAACGGCACA
ACCTGGACGA CGAAAGGCTA A
 
Protein sequence
MALAIYNTLT RQTEPFTPLV ADHVSMYVCG PTVYSDAHIG HAMSAVVFDV VRRYLEWSGY 
TVRHVMNFTD VDDKIIRRAN EQGRDPMELA ESYTLAFLDQ LGQLNVLPAT AYPRVSTTIP
QIIQFIEGLI AKDAAYHASN GDVYFRVRAD EDYGKLSRRA VDDMRSGARI APDEAKDDPL
DFALWKSAKP GEPAWESPWG QGRPGWHIEC SAMSLHELAE QIDIHGGGND LIFPHHENEI
AQTESLTGKN FAQVWMHNGM LQLAGEKMSK SLGNLITIDQ FLSEHSADIM RLLVLSGSYR
APLVYNDEVL ADTQRKLERI MSALKPAHGT ATNGPVVETL NAIVAKAPAD FRAAMDSDFN
SAAALAVLFD LVRSINAARD AGVGGEPFAA GQARLRELAA VLGLRLEAPS ASKTDAAPFI
ELLIELRAEL RKAKQWALSD LVRNRLSELD VQLEDSPNGT TWTTKG