Gene Haur_4135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4135 
SymbolargS 
ID5735996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5280847 
End bp5282622 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content50% 
IMG OID641281289 
Productarginyl-tRNA synthetase 
Protein accessionYP_001546895 
Protein GI159900648 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATACCT TTGCTCGCTT CGAGCAAGCC ATTCGCGAGG CCTTGCTTGC CACCAATTTA 
ATTAGCGCCG CCGATATTGA TTTAGGCGCA CCCAAAGCTG CTGGCGTACA GGCCGATTTA
GCCTTGCCTT GTTTTCGTGC TGCCAAAAGC CGTGGCAGCA CCCCTGCTCA AGTTGCCCAA
GAATTAGTTG CCGCGCTGCA ATTTGCCCCC GATAGTTTGG TTGCCAGTGC GACTATTTCT
GGCCCCTATG TTAATTTCAA TCTCAACCCT CAAACCTTTG CTAAAGCCGT TTTGGCCGAT
ATTCAGGCTG GCGGCGCAAC CTATGGCAGT AGCACCAAGG GCAACAATCG CAAAGTAATT
GTCGAATATT CATCGCCCAA TATTGCCAAG CGTATGCACG TTGGTCATAT TCGCTCAACG
ATCATCGGCC AAGCGATTGC CAATCTCTAC CAACGCTTGG GCTACGAAGT GATTCGCGAT
AATCACTTAG GCGATTATGG CAAACAATTC GGGGTCAATA TTGCCGCCAC CTTGCGTTTT
GGCAAGCCCG AAGGCGAAGG TGAGGCCGTG CTCGCAGCGA TTGAAGAACA ATACAAACGC
TATAATTTGT TGATGAAGGG CGCAGTTGCC GAAGATACCG AGTATGACCC TGATTCAGAT
GCTGGCTTGG ATGATGAAGC CCGCGCTTGG TCGTTGAAAT TAGAACAGGG CGATCCCCAA
GCAGTTGAAA TTTGGCAATG GATGGTTGAT TTGACCAAAA CTGCCAATCA GCCCAATTAT
GATCGTTTGG GCGTGCATTT CGATCTGCAA CATGGCGAAA GTTTTTACAA AGATATGTTG
GCCGAAATCA TCAGCGATGC TGGCGAGAGT GAGCTGGCAG AACGTGATGG CAATGCTATT
ATTGTCAAAG ATTTACCCGA CCATCGCGGC AAAAAATTAC CAACCTTTTT GATTCAGCGC
TCGGATGGCG GCACGCTCTA CATGACCCGC GATATTGCCA CCATCAAATA TCGTGAGCAA
ACTTACAATC CCGATGCGAT GATTTACATT GTGGGTCAGC CACAAGAATT GCACTTCCGC
CAAACCTTTG CCATCAGCAA GGCCTTGGGC TACACCGATG CCGAGTTGAT TCATATTTCG
TTTGGTACGG TGTTTGATGC CAAGGGCCAG CCACTTTCAA CCCGCAAGGG CAATATGATC
TATCTCGAAA CCTTGCTGGA TGAAGCCCGT AATCGCGCCA AAGCCTTGAT TGAACAAAAA
ATGGCTGAAG GCAAAACTCA ACTTACCGCC GAATTGATCG ATCAAGTTGC CGAGCAAGTT
GGGGTTGGCG CGGTGATGTA CAACGATTTG TACCAAGATA CCAAGCGCAA TATCACCGTC
GATTGGGATC GCATGTTGGC ATTCGAGGGC AATAGCTCGC CCTATTTGCA ATATATGCAT
GCTCGTTGCT GCTCGATTCT GCGCGATTTT GGCAAATTAC CCGCTAGCTA CGATGGCAGT
TTGTTGAGCC ATTCAGCTGA AACTGGCTTG TTGAAAGAGC TTGCCCGTTT GCCACAAATT
ATTGAAGAAG CAGCGGCACG GTATGCGCCG TTCGTGGTCG CCGATTGGCT GTATGCCACG
GCGCGGGCCT TCTCGGCTTT CTACGATGCC TGTTCAGTGC TCAAAGCCGA AACGCCAGAG
TTACGGGTTG CACGTGGTCA TGTAGTTGCC GCCACCGCCC AAGCGCTCCG CAATGGTTTA
GCGTTGCTCT CAATTGCTGC TCCTGAACGC ATGTAA
 
Protein sequence
MYTFARFEQA IREALLATNL ISAADIDLGA PKAAGVQADL ALPCFRAAKS RGSTPAQVAQ 
ELVAALQFAP DSLVASATIS GPYVNFNLNP QTFAKAVLAD IQAGGATYGS STKGNNRKVI
VEYSSPNIAK RMHVGHIRST IIGQAIANLY QRLGYEVIRD NHLGDYGKQF GVNIAATLRF
GKPEGEGEAV LAAIEEQYKR YNLLMKGAVA EDTEYDPDSD AGLDDEARAW SLKLEQGDPQ
AVEIWQWMVD LTKTANQPNY DRLGVHFDLQ HGESFYKDML AEIISDAGES ELAERDGNAI
IVKDLPDHRG KKLPTFLIQR SDGGTLYMTR DIATIKYREQ TYNPDAMIYI VGQPQELHFR
QTFAISKALG YTDAELIHIS FGTVFDAKGQ PLSTRKGNMI YLETLLDEAR NRAKALIEQK
MAEGKTQLTA ELIDQVAEQV GVGAVMYNDL YQDTKRNITV DWDRMLAFEG NSSPYLQYMH
ARCCSILRDF GKLPASYDGS LLSHSAETGL LKELARLPQI IEEAAARYAP FVVADWLYAT
ARAFSAFYDA CSVLKAETPE LRVARGHVVA ATAQALRNGL ALLSIAAPER M