Gene Arth_2168 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2168 
Symbol 
ID4445194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp2441708 
End bp2442988 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content66% 
IMG OID639689977 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_831648 
Protein GI116670715 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase
[TIGR03447] cysteine--1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0428201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAATCCT GGATCTCCCG CCCTGTCCCT CAGCTCCCGG GCCGCATGCC CGCCGTTCGG 
ATTTTCGACA CCGCCGTGGG GGCTTACAGC ACCCTCGACG CCACGGGTGA ACAATCACTA
TATGTCTGCG GAATCACCCC GTATGACGCA ACCCATATGG GGCACGCGGC CAGCTATGTC
GCTTTTGACC TGCTGAACCG GGCGTGGCGG GACGGCGGCC AGCGCGTTGC CTATGTCCAG
AACGTCACCG ACGTCGACGA CCCCCTTCTC GAGCGGGCCA CGGCAACAGG CGTTGACTGG
CGCGACCTTG CCGCAAGCCA GATCGAACTC TTCCAGACCG ACATGGCTGC CCTTAACGTC
CTGGCACCGG ACCACTACGT CGGTGCGGTC GAATCCATCC CGGAAATCGT TCCCGCCATT
GAACGCCTCC TGCACCTCGG CCTTGCATAC CGCGTTACCG GCACTCCCGG TGAGCCGGAC
GGCGACGTCT ACTACGACGT CGAGGCCGCC AGCAAGCACT CCGCGGAGGC CAAGGACGCC
TGGACCCTGG GATCAGTGTC CGGGCTCTCC GAAACCGAGA TGCTGGAACT CTTCGCCGAA
CGCGGTGGCG ACCCGGGCCG CCGTGGCAAG CGGCAGGCCC TGGATCCGCT GCTTTGGCGC
GTGGCCCGCG ACGGCGAACC CAGCTGGGCG GGCGGCGAAC TGGGTTCAGG CCGGCCAGGC
TGGCACATTG AGTGCACGGT CATTGCACAG AAGTACCTGC CGGCACCCTT CACGGTTCAA
GGCGGAGGAT CCGACCTCAT CTTCCCGCAC CACGAGATGG GCGCGGGACA CGCATACTCG
CTGACCGGCG TTCCCCTGGC ACGGCATTTC GCCCACGCCG GGATGGTGGG CCTCGACGGC
GAAAAGATGA GCAAATCCAA GGGAAACCTG GTGCTCGTGT CCAAACTCCG GGCTGCCGGC
GAGGAACCCG CGGCCATCCG CCTGGCAATC CTCGCCCACC ATTACCGCAC GGACTGGTCC
TGGACAGAGG CAGGCTTCGC GCAAGCAAAG ACAAGGCTTG CCGAGTGGCG GGACGCCCTC
ACTATGGCTC CGGGGGAGTC AGCCGCCACA CTTATCGCCG AGATGCGGAG CGAACTGGCC
AACGACCTGA ACGCCCCGGG CGCCCTCGCT GCCGTGGACC GCTGGGCCGT CGCCGCAAAG
CAGCAGGCAG GGGCCGGCTC GCCGATGGAC CAGGCGCTGG TCAGTGACGC CGTCAATGCC
CTGCTCGGCG TCGAACTCTA A
 
Protein sequence
MKSWISRPVP QLPGRMPAVR IFDTAVGAYS TLDATGEQSL YVCGITPYDA THMGHAASYV 
AFDLLNRAWR DGGQRVAYVQ NVTDVDDPLL ERATATGVDW RDLAASQIEL FQTDMAALNV
LAPDHYVGAV ESIPEIVPAI ERLLHLGLAY RVTGTPGEPD GDVYYDVEAA SKHSAEAKDA
WTLGSVSGLS ETEMLELFAE RGGDPGRRGK RQALDPLLWR VARDGEPSWA GGELGSGRPG
WHIECTVIAQ KYLPAPFTVQ GGGSDLIFPH HEMGAGHAYS LTGVPLARHF AHAGMVGLDG
EKMSKSKGNL VLVSKLRAAG EEPAAIRLAI LAHHYRTDWS WTEAGFAQAK TRLAEWRDAL
TMAPGESAAT LIAEMRSELA NDLNAPGALA AVDRWAVAAK QQAGAGSPMD QALVSDAVNA
LLGVEL