Gene Acry_1020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_1020 
Symbol 
ID5160674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp1132168 
End bp1133169 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content69% 
IMG OID640552937 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_001234156 
Protein GI148260029 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACGCA TATTTTCGGG CATCCAGCCC ACCGGAATCG CGCATCTCGG CAATTATCTC 
GGCGCGATCC AGAACTGGGT CGCGCTGCAG GAGGGCAACG AGGGCATCTA CTGCCTGGTC
GACCTGCACG CGCTCACCGT CTGGGTCGAG CCCGAGGCGC TGCGCCAGCA GACGCGGGTG
AACGCCGCCC TGCTCGTCGC CTGCGGCATC GACCCCGCCC GCAGCATCCT GTTCCACCAG
TCGGCGGTGC ACGCCCATGC GCGCCTCGCC TGGATCTTCA ACTGCGTCGC CCGCTTCGGC
TGGCTGAACC GCATGACCCA GTTCAAGGAC AAGGCCGGCA AGGACCGCGA GGCCGTCTCC
ACCGGCCTGT TCGTCTATCC CAACCTGATG GCCGCCGACA TCCTCGCCTA TCACGCGACC
GAGGTGCCCG TCGGCGAGGA CCAGAGCCAG CATCTCGAAC TCGCCAACGA CATCGCGCAG
AAATTCAACC ACGATTACGG GGTGGAATTC TTCCCCGCCA TCACGCCGCG CATCATCGGC
GGCAGCGCGC GCATCATGAG CCTGCGCGAC GGCACCAGGA AAATGTCGAA ATCCGACACC
TCGGACCAGA GCCGCATCAA CCTCACCGAC GATGCCGACG CCATCGCCCA GAAGATCCGC
CGCGCGAGGA CCGACCCGGA ACCGCTGCCC GAGCGCCCGG CGCAGCTCGA TTCCCGCCCC
GAGGCCCGCA ACCTCGTCGG CATCTACGCC GCCCTCGCCG GCATCACCGC CGAGGAGGTG
CTGCGCCAGC ACGCCGGCAG CGGATTCGGC CCGTTCAAGG AACGGCTGAC CGAGCTCGCG
GTGCAGAAAC TCACCCCGAT CGGCGCCGAG ACCAAACGCC TCGCCGCCGA TCCCGCCGAG
ATCGACCGCA TGCTCCAGGC CGGCGCCGCC CGTGCGGCGG CCATCGCCGA GCCGATCGTC
GCCGAAGCCG AGCGCCTCGT CGGCCTGCTG CCCGCGCGCT GA
 
Protein sequence
MARIFSGIQP TGIAHLGNYL GAIQNWVALQ EGNEGIYCLV DLHALTVWVE PEALRQQTRV 
NAALLVACGI DPARSILFHQ SAVHAHARLA WIFNCVARFG WLNRMTQFKD KAGKDREAVS
TGLFVYPNLM AADILAYHAT EVPVGEDQSQ HLELANDIAQ KFNHDYGVEF FPAITPRIIG
GSARIMSLRD GTRKMSKSDT SDQSRINLTD DADAIAQKIR RARTDPEPLP ERPAQLDSRP
EARNLVGIYA ALAGITAEEV LRQHAGSGFG PFKERLTELA VQKLTPIGAE TKRLAADPAE
IDRMLQAGAA RAAAIAEPIV AEAERLVGLL PAR