Gene Acid345_4033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_4033 
Symbol 
ID4071172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4767751 
End bp4768962 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content60% 
IMG OID637986063 
Productthreonyl/alanyl tRNA synthetase, SAD 
Protein accessionYP_593107 
Protein GI94971059 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0013] Alanyl-tRNA synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.22843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGAAC GGCTTTATTA CAACAATAGT TTCTTGCTGA ACTTTACGGC GGCGGTGCTG 
GACGCGCGTG TGGAGGACGG GCGCGCGATC GTCGTGCTCG ACCGGACGGC GTTTTATCCA
ACGAGCGGTG GGCAGATTTT TGATACCGGC TGGATGGAGT TGGAGAAGGA CGCGCGGAAG
TTGCGCGTGA GTGAAGTCGG CGAGACGGAA GAAGGCGTCA TTCAGCATTA CGTGGACACG
TCGGATGTAG AGACGCTGAA AGACGGGCGG GTCCGCGGGT TCATTGACGT GGAGCGGCGT
CGCGACCACA TGCAACAGCA CACGGGGCAG CACGTGTTGT CGTCGGCGTT CGAGTCGTTG
TTTGAGATGA AGACGGTGTC GTTCCATATG GGCGCGGAGA GTTGCACCAT CGATCTCGAT
ACCAAGGCCC TGGCGCCGGA ACAAGTGAAG AAAGCCGAGG CCGTGGCCAA TGAGGTGATC
GCCGAGGACC GTCCGGTGGA GATCAAGTAC GCGACGGTGG ATGAGGCGCG TGCGATGGGG
GTGCGGAAGA TTCCACCGGC GGAGCGCGAG AAGCTGCGGC TGATTGATAT CAAAGATTTC
GATCTGAATG CATGCGGTGG AACCCATGTG CGTGCGACGG GACAGATCGG GGGACTCCTG
ATCCGGAAGA TCGCGAAGGA GAAGCAGGGG TTTCGGGTGG AGTTTGTCTG CGGCGGACGC
GCGGTGAACA CGGCGCGCAG GGATTTCGAA ACGCTCACAG ACGCAGCGAC TTTGTTCTCA
AGCCACATCT ACGATGTGCC GGTGCAGGTG CGGAAGCTGA TTGAAGAAAA CAAGGCAGGA
ACGAAGCGCG AGCACAAACT GCTGGAAGAA GTCGCGTCGC TCACGGCGGA CGTGATGCTG
GCGCAGCTCG GTGACAAGAA GGTCGTGAGG CAGTTTTACA CGGACCGGGA TATGACGTTC
ATCAAGCTTC TGGCGCAGCG CCTGACCCGA CAGGGAAGCG TGGTGGCATT GCTTGGGTGC
GGGGGCACGC AGCCCGCCGT TATATTCGCC CAGACTTCCG GGCTCCCGAA TGACATGGGT
GGGCTGATGA AAGAGGCGCT GGTGGAACTT GGCGGGCGCG GTGGCGGGAA CAAAGATATG
GCGCAGGGCG GAGCTACGGA CGCGTCTAAG ATAGAGGCGG TACTGGAGAA GATCGCAGGC
AGAATCGCGT AA
 
Protein sequence
MTERLYYNNS FLLNFTAAVL DARVEDGRAI VVLDRTAFYP TSGGQIFDTG WMELEKDARK 
LRVSEVGETE EGVIQHYVDT SDVETLKDGR VRGFIDVERR RDHMQQHTGQ HVLSSAFESL
FEMKTVSFHM GAESCTIDLD TKALAPEQVK KAEAVANEVI AEDRPVEIKY ATVDEARAMG
VRKIPPAERE KLRLIDIKDF DLNACGGTHV RATGQIGGLL IRKIAKEKQG FRVEFVCGGR
AVNTARRDFE TLTDAATLFS SHIYDVPVQV RKLIEENKAG TKREHKLLEE VASLTADVML
AQLGDKKVVR QFYTDRDMTF IKLLAQRLTR QGSVVALLGC GGTQPAVIFA QTSGLPNDMG
GLMKEALVEL GGRGGGNKDM AQGGATDASK IEAVLEKIAG RIA