Gene Acel_1172 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcel_1172 
Symbol 
ID4485060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidothermus cellulolyticus 11B 
KingdomBacteria 
Replicon accessionNC_008578 
Strand
Start bp1301464 
End bp1302675 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content70% 
IMG OID639729948 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_872930 
Protein GI117928379 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase
[TIGR03447] cysteine--1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGCCT GGCCAGCTCC GAAGATTCCG AGTCTGCCGG GTCGCAACCG CTTGCCGAGC 
CTGTTCGATA CGGCGAGCCG CCGTCTGGTC ACCGTTGGCT CGCAGCAGGG CGCGTCGATG
TACGTCTGTG GCATCACGCC GTACGACGCC ACCCACCTCG GGCACGCCGC CACGTACGTC
GCCTTCGACC TGCTCGTCCG GACCTGGCAC GACGCCGGGG TGACGGTCCG TTACGCACAG
AACGTCACCG ACGTTGACGA TCCGCTGCTG GAACGGGCCC GGCAGGTTGA CCAACCGTGG
GAAGCCATCG CCGCCCGGGA AACCGCGAAA TTCCGGGCGG ACATGGCGGC GTTGCGCGTC
GTCCCGCCGG ACCGCTACGT CGGCGTCGTC GAATCCCTGC CCCAGATCAT CGGCCTCATC
GAGGTGCTCC GATCCCGCGG TTTGACCTAC GAGCTGGACG GCGACCAGTA CTTCGCGACG
CACGCGATCC CCGACTTCGG GGCCGTCAGC CATCTCGGCC GCGACGACAT GATTGCCCTG
TTCGCCGCGC GTGGCGGCGA CCCCGACCGT GCCGGGAAGA AAGACCCGCT GGACGCCTTG
CTCTGGCGGG GCAAGCGGCC GGAGGAGCCA AGTTGGCCGG CGCCGTTCGG CCGTGGCCGG
CCGGGCTGGC ACGTCGAGTG CGCCGCGATT GCGCTGACCC ATCTGCCGCT GCCGCTGGAC
GTGCAGGGCG GAGGAGCGGA CCTGGTCTTC CCGCATCACG ACATGACCGC CGCACAGGCC
GAGGCGGCAA CCGGACGCCG GTTCGCCCGC GCATACGTGC ACACCGGCCT CGTCGCCTAT
CAAGGCGAGA AAATGTCGAA ATCTCTGGGA AATCTGGTCT TCGTCTCGGA CCTGTGTGCA
GCCGGCGCCG ACCCGATGGC GGTCCGGCTC GCCTTGCTCG ATCACCACTA CCGCACCGAA
TGGGAGTGGA CGCCGCGGCT GCTCGACGAA GCGACCGATC GACTCGCCGA ATGGCGGGCT
GCGGTCCGGC GGCCGAGGGG AGCGCCGGGG GACGGCCTGC TCGCCGCCGT CCGGGACCGG
CTCGCCGACG ACCTCGACGC ACCCGGGGCG ATTGCCCTCA TCGACGAGTG GACGACGCAG
GACGGCGACG ACCCGGATGC GCCCACCTTG GTCGCCGCGA TGGCCGACGC GTTACTCGGC
GTACACCTGT GA
 
Protein sequence
MHAWPAPKIP SLPGRNRLPS LFDTASRRLV TVGSQQGASM YVCGITPYDA THLGHAATYV 
AFDLLVRTWH DAGVTVRYAQ NVTDVDDPLL ERARQVDQPW EAIAARETAK FRADMAALRV
VPPDRYVGVV ESLPQIIGLI EVLRSRGLTY ELDGDQYFAT HAIPDFGAVS HLGRDDMIAL
FAARGGDPDR AGKKDPLDAL LWRGKRPEEP SWPAPFGRGR PGWHVECAAI ALTHLPLPLD
VQGGGADLVF PHHDMTAAQA EAATGRRFAR AYVHTGLVAY QGEKMSKSLG NLVFVSDLCA
AGADPMAVRL ALLDHHYRTE WEWTPRLLDE ATDRLAEWRA AVRRPRGAPG DGLLAAVRDR
LADDLDAPGA IALIDEWTTQ DGDDPDAPTL VAAMADALLG VHL