Gene Sare_3866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_3866 
SymbolileS 
ID5705897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4398782 
End bp4401961 
Gene Length3180 bp 
Protein Length1059 aa 
Translation table11 
GC content69% 
IMG OID641273287 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001538649 
Protein GI159039396 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.265551 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00623467 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCTATC CGTTGCATGA CCCGGCCACG ACCGGAGTCC CCGCGAGCCC GGACCTGCCC 
GCGGTCGAGC GTCGGGTCCT GGAGCACTGG ACGGCCGACA AGACCTTCGA GGCATCTGTC
GAGGCCCGGC CTGCTTCCCG GCAACCGGCA AGCGGTGTCG TCGCCGACGC CGCACCCGCC
GACAACGAGT ACGTCTTCTA CGACGGTCCG CCCTTCGCCA ACGGCCTGCC ACACTACGGC
CACCTCCTCA CCGGCTACGT CAAAGATGTG GTCCCGCGCT ACCAGACCAT GCGCGGCCGG
CGGGTGGAGC GTCGGTTCGG CTGGGACTGC CATGGGCTGC CCGCTGAGGT GGTCGCCGAG
AAGCAGCTCG GCATCACCAG CAAGGCGGAG ATCCTCGACC TGGGGGTGGC CCGGTTCAAT
GAGGCGTGCC GCACGTCGGT GCTGGAGTTC ACCCAGGACT GGGAGCGCTA CGTCACCCGC
CAGGCTCGTT GGGTCGACTT CGCCAGCGAC TACAAGACCC TCGACCTGGA CTACATGGAG
AGCGTCCTGT GGGCCTTCCG GACCCTGCAC GACAAGGGCC TGGTCTACGA GGGCTTCCGG
GTGCTCGCGT ACTGCTGGCG TTGCGAGACG CCGTTGTCGA ACACCGAGAC CCGGATGGAC
GACGTCTACC GGGACCGGCA CGATCCGACG CTGAGCGTGT GGTTCGCGCT CACACCGGAC
GAGTCGGCGC CGGAGCCGGT GCGCGGTGCC GTTCAGCTGG GGGTCTGGAC CACCACGCCG
TGGACGCTGC CGTCCAACCT GGCGCTCGCC GTCGGCCCGG ACATCGAGTA CGCGGTGCTG
GAGCGGGACG GGCAGCGCTA CGTTCTCGGC GCCGCCCGGC TCGCCGCGTA CGCCAAGGAG
CTGGAGGGGT ACCGTCAGGT CGGCACGGTG TACGGCCGGG ATCTGATCGG GCGCCGGTAC
ACCCCGTTGT ACGACTTTCT GGTCGGGCCG GCGGGTGAGC ACGCGTACCA GGTGCTCGGC
GCGGATTTCG TGACCACCGA GGACGGTACC GGGATCGTGC ACCTGGCACC GGCGTTCGGC
GAGGACGACC AGAACACCTG CCACGCCGCC GGCATACCGA CCGTCGTCAC GGTGGACGAC
CGCACCCGGT TCACCGCGCT GGTCCCGCCG TACGAGGGTG AACAGGTCTT CGACGTCAAC
AAGCCGGTGA TCCGGGAGCT GAAGGAGCGC GGTGTGGTGC TCCGGCAGGA CACCTACACC
CACGCGTATC CGCATTGCTG GCGCTGCGAC ACCCCGCTGG TCTACAAGGC GGTCTCGTCC
TGGTTCGTCG CGGTGACCCG GCTCAAGGAG CGAATGGTCG AGCTGAACCA GCAGATCAAC
TGGACACCGG GTCACATCAA GGACGGCTCG TTCGGCAAGT GGCTGGCCAA CGCCCGGGAC
TGGTCGATCA GCCGCAACCG GTTCTGGGGG TCGCCGATCC CGGTGTGGAA GTCCGACGAC
CCAGCCTATC CGCGGGTGGA CGTGTACGGC TCACTCGCGG AACTGGAGCG GGACTTCGGC
GTGCGCCTGA CCGACCTGCA CCGGCCGGCG GTGGACGAGC TGGTCCGTCC GAACCCGGAC
GACCCGACGG GGAAGTCCAT GATGCGCCGG GTTCCGGAGG TGCTGGACTG CTGGTTCGAG
TCCGGATCGA TGCCGTTCGC CCAGGTGCAC TACCCGTTCG AGAACGCCGA GTGGTTCGAG
TCCCACTATC CGGGTGACTT CATCGTCGAG TACATCGGGC AGACCCGAGG CTGGTTCTAC
ACCATGCACG TGCTCGCCAC GGCGCTGTTC GATCGGCCAG CCTTCCGTAA CTGCCTGAGT
CACGGCATCC TGCTCGGGTC CGATGGGCGC AAGATGTCCA AGAGCCTCCG TAACTACCCG
GACGTTTACC ACATCTTCGA CACGTACGGC TCGGACGCGA TGCGCTGGAT GCTGATGTCC
TCACCGGTGC TGCGCGGTGG TGACATGGCG GTGACCGAGG CCGGCATCCG GGACGCGGTC
CGGCAGGTGC TGCTGCCGTT GTGGAACGTC TGGTACTTCT TCTCGCTCTA CGCCAACGCC
GACGGCCACC TGGCCCGGCG GAGCACCACC TCGACGCACC TGCTCGACCG GTACGTGCTG
GCGAAGACGA ACGAGCTGGT GTCAATGGTG CAGGCGCAGC TGGAGGCGTA CGACATCTCC
GGCGCCTGCG GCACCGTCCG GTCCTACCTG GACGCATTGA CCAACTGGTA CGTACGTCGT
TCGCGAGATC GGTTCTGGTC TGGCGACGCG GACGCGTTCG ACACGCTGTG GACGGTGCTG
GAGACGCTCT GCCGAGTGGT TGCGCCGCTG GCGCCGCTGA CCGCCGAGGA GATCTGGCGA
GGCCTGACCG GCGAGCGCTC GGTGCACCTG ACCGACTGGC CGGCGGCGGA GGAGTTTCCC
GCCGACCACG ATCTGGTCGC CGCAATGGAT TCCGTTCGTG CGGTCGCCTC GGCCGCACTG
TCGCTGCGCA AGTCCCAGGG CCTGCGGGTG CGGCTGCCGC TGTCGGTGTT GACCGTCGCC
ACGCCGGCCG CAGACGCGCT GCGACCCTTC GCCGACCTGG TCGCCGACGA GGTCAACGTG
AAGCGGGTCG AGTTCACCGA CGAGGTGGGC AACTACTGCC AGCAGGTGTT GACGGTGGTC
CCCCGAGCGC TCGGCCCGCG GGTGGGCAAG GCAGTGCAAC AGGTGATCCG GGCGGTCAAG
GCCGGGCAGT GGGAGCTGGT CGACGGCGCT CCGGTCGCCG CCGGGGTCAC CCTCGCCGAA
GGCGAGTACG AGCTGCGGCT GGTCGCCGCC GACGCCGAGC ACTCGGCGCC GCTGCCCGGG
GGCGACGGCG TGGTCGTGCT GGACACCGAG GTGACTCCGG AACTGGCTGC CGAGGGGCTG
GCCCGGGACG TGGTCCGAGT GGTGCAGCAG GCCCGCCGGG ACGCCGACCT GGACGTTTCG
GACCGCATCG TGGTCGCGCT CGCCGCCTCC GACGAGGTGT GGGCGGCGGT GTCCGCGTAC
CGTGACGTCG TGTCCCGGGA GGTGCTGGCC GACTCGGTCG ACCTCACCCC AGGGCTGGCC
GGGTTCACCG GTGAGGTGGG TGACGGTGAG CAGGTTGCCG TGACCGTCCG CCGGATCTAG
 
Protein sequence
MAYPLHDPAT TGVPASPDLP AVERRVLEHW TADKTFEASV EARPASRQPA SGVVADAAPA 
DNEYVFYDGP PFANGLPHYG HLLTGYVKDV VPRYQTMRGR RVERRFGWDC HGLPAEVVAE
KQLGITSKAE ILDLGVARFN EACRTSVLEF TQDWERYVTR QARWVDFASD YKTLDLDYME
SVLWAFRTLH DKGLVYEGFR VLAYCWRCET PLSNTETRMD DVYRDRHDPT LSVWFALTPD
ESAPEPVRGA VQLGVWTTTP WTLPSNLALA VGPDIEYAVL ERDGQRYVLG AARLAAYAKE
LEGYRQVGTV YGRDLIGRRY TPLYDFLVGP AGEHAYQVLG ADFVTTEDGT GIVHLAPAFG
EDDQNTCHAA GIPTVVTVDD RTRFTALVPP YEGEQVFDVN KPVIRELKER GVVLRQDTYT
HAYPHCWRCD TPLVYKAVSS WFVAVTRLKE RMVELNQQIN WTPGHIKDGS FGKWLANARD
WSISRNRFWG SPIPVWKSDD PAYPRVDVYG SLAELERDFG VRLTDLHRPA VDELVRPNPD
DPTGKSMMRR VPEVLDCWFE SGSMPFAQVH YPFENAEWFE SHYPGDFIVE YIGQTRGWFY
TMHVLATALF DRPAFRNCLS HGILLGSDGR KMSKSLRNYP DVYHIFDTYG SDAMRWMLMS
SPVLRGGDMA VTEAGIRDAV RQVLLPLWNV WYFFSLYANA DGHLARRSTT STHLLDRYVL
AKTNELVSMV QAQLEAYDIS GACGTVRSYL DALTNWYVRR SRDRFWSGDA DAFDTLWTVL
ETLCRVVAPL APLTAEEIWR GLTGERSVHL TDWPAAEEFP ADHDLVAAMD SVRAVASAAL
SLRKSQGLRV RLPLSVLTVA TPAADALRPF ADLVADEVNV KRVEFTDEVG NYCQQVLTVV
PRALGPRVGK AVQQVIRAVK AGQWELVDGA PVAAGVTLAE GEYELRLVAA DAEHSAPLPG
GDGVVVLDTE VTPELAAEGL ARDVVRVVQQ ARRDADLDVS DRIVVALAAS DEVWAAVSAY
RDVVSREVLA DSVDLTPGLA GFTGEVGDGE QVAVTVRRI