Gene Acid345_3860 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3860 
Symbol 
ID4071012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4570109 
End bp4571188 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content60% 
IMG OID637985884 
Producttryptophanyl-tRNA synthetase 
Protein accessionYP_592934 
Protein GI94970886 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0180] Tryptophanyl-tRNA synthetase 
TIGRFAM ID[TIGR00233] tryptophanyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.968888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGAACA TGAGCAAAGA ACGCGTGTTG AGTGGGATGC GTCCCACCGG GAAGCTGCAC 
CTGGGGCACT ATGTCGGCGC CCTTCAGAAC TGGGTGAAGT TGCAGGAGCA GTACGACTGT
TTCTATTTCG TCGCCGATTG GCACGCGCTG ACCACGAACT ACGCCGATAC CTCCGACATC
AAGGCGAGTT GCATTGAACT GATCATCGAC TTCCTCTCCG CCGGACTCGA TCCCGAGAAG
TCCACGCTGT TCATCCAGTC ACACGTGCCG CAGCATGCCG AGTTGTACCT GCTGCTGTCG
ATGATCACGC CGCTCGGCTG GCTCGAGCGT GTCCCCACGT ACAAGGAACA GCTGGAGAAC
ATCAAGGACA AAGACCTCGG GATGTACGGC TTCCTCGGCT ACCCCGCGCT GCAAACCGCC
GACATCATCA TCTACAAGGC CAAGTATGTA CCGGTAGGCC AGGACCAGGT GCCGCACCTC
GAGATCAGCC GGGAAATCGC GCGTCGCTTC CACCAGTTCT ATCCGCGCAA AATGCACGCC
GGCATTGCCG CTCCGGAGCG CGACTACGTT TTTCCCGAGC CCAAGCCGCT GCTTACGCCG
GCTGCAAAAC TGCCCGGCAC CGACGGCCGC AAGATGTCGA AGTCGTACGG CAACAGTATT
CTGCTCAGCG ATCCGGAAGC GGAAATTCGC GCAAAGCTGA AGACCATGGT CACCGACCCA
GCGCGCGTGC GCCGCACCGA TCCCGGCAAT CCGGATGTGT GCCCGGTCGG CGACCTGCAT
AAAATCTTCA GCGACGCCGA GACCATGGCG AAGGTGAACG AAGGCTGCCG TACCGCTGGG
ATTGGCTGCA TCCAGTGCAA AGGATGGGCC GCCGACTCCA TCGTGAGAGT CCTGGCTCCG
ATTCAAGAGC GCCGCGCGAA ATACGAGGGC AATCCGAAGA TGGTCTGGGA TATCCTCGAA
GCCGGCTCGG CGAAGGCACG CGTTGCCGCC GAGGCCACAA TGGTCGAAGT GCGCGAGGCG
ATGGGAATGT CACACCAGTA CGAAGCGCCG AACACGTCGG CAGCAGCGGA GTCGAAGTAA
 
Protein sequence
MSNMSKERVL SGMRPTGKLH LGHYVGALQN WVKLQEQYDC FYFVADWHAL TTNYADTSDI 
KASCIELIID FLSAGLDPEK STLFIQSHVP QHAELYLLLS MITPLGWLER VPTYKEQLEN
IKDKDLGMYG FLGYPALQTA DIIIYKAKYV PVGQDQVPHL EISREIARRF HQFYPRKMHA
GIAAPERDYV FPEPKPLLTP AAKLPGTDGR KMSKSYGNSI LLSDPEAEIR AKLKTMVTDP
ARVRRTDPGN PDVCPVGDLH KIFSDAETMA KVNEGCRTAG IGCIQCKGWA ADSIVRVLAP
IQERRAKYEG NPKMVWDILE AGSAKARVAA EATMVEVREA MGMSHQYEAP NTSAAAESK