Gene Apre_0284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_0284 
SymbolileS 
ID8397058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp316754 
End bp319867 
Gene Length3114 bp 
Protein Length1037 aa 
Translation table11 
GC content41% 
IMG OID644994644 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_003152056 
Protein GI257065800 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGAAT TTAAATCTTT AGACAGCAAA GATGTAAGGA AAAGAGAAGG AGAAGTCGAA 
AAGTATTGGC AAGAAATAGA CCTACTTAAC GAGACTTTCG CTACAAGAGA GGATGCCGAA
GAATACATAA TCTACGACGG ACCTCCAACA GCAAACGGTA GACCAGGAAT CCACCACGTT
ATCGCTCGTA CCCTTAAGGA CATGACAAGC AGATACAAAA ACATGCAAGG CTACAAGGTC
CTAAAAAAGG CAGGCTGGGA TACACACGGT CTTCCTGTAG AAATAGAAGT AGAAAAGACC
CTAGGCTTTC ACGACAAAAA CGACATAGAA GAATACGGAA TCGAAAAGTT CAACAAGTTA
TGTAAGGAAT CTGTTTGGAA ATACTCCGAC CAATGGAGAG AGATGTCAGA TAGGATGGCC
TATCTCTACA ACATGGATGA TCCTTATGTC ACAATGGATA ACGACTATGT AGAAACAGAA
TGGTACCTCC TAGATAAGGC CTTCAAAAAC GGCTATATCT ACGAGGGAGC CAAGGTTATG
CCTTACTGTC CACGTTGTGG TACAGGTCTT GCAAGCCACG AGGTTGCCCA AGGTTACCAA
ATGGATAAGA CCATTACTCT TACAGTTAAG TTTAAGAAAA AGGGGACTGA TAACGAGTAC
TTCCTAGCAT GGACCACAAC CCCATGGACC CTCCCATCAA ACGTGGCCCT ATCAGTTCAC
CCAGAGCTTA CTTACGCTAA GCTTTACGTA GCAGATGAAG ATTCTTACTA TTATTGTGCC
AAATCCCTAG CTGAAAATCT TATGGGAGAA AGGGACTACG AAGTCGTAGA AGAATTCCCA
GGCAAGGATA TGGAATACTG GGAATATGAA CAACTTATGC CATATGTAAA TGTAGGAGAT
GACAAGGCCT TTATCATAAC CCTTGCCGAC TACGTTTCTG CAGAAGATGG TACAGGTATA
GTTCACTCCG CTCCAGCCTT TGGTGAGGAC GACTACCAAA TAGGAAGAAA ATACGGACTT
GCCTTTGTCC AACCAGTTGA CCTTGAAGGA TGCTTTACAG AAACTCCTTG GAAGGGCGAA
TTTATCTTTG ATACCAACGA AAAAATCTGG AGACACCTCC AAGAAGAAGG CAAGGTATTT
AACAAACAAA CTATAGAACA CAACTACCCA CACTGCTGGA GATGCCACAC ACCACTAGTA
TATTATGCAA AACCATCCTG GTATATAGAA ATGAGCAAGT TCTCAGATGC CATGGTAGAA
AACAACAAAA GCGTAAACTG GTACCCACAA ACTATAGGGG ACAAGAGATT TGGAAACTGG
CTCGAAAACG TAAAAGACTG GGCTATATCT AGGTCAAGAT ACTGGGGAAC TCCACTAAAT
ATCTGGAAAT GTGATGAATG CGGCCACACA GATACAGTTG GATCAAGAGC CGAGCTAAAG
GAACGTGCAA TCGAGGATAT ATCAGAAGAT ATAGAACTTC ACAGACCATA TGTAGATAAC
GTATCTATCA AATGTGATAA GTGCGGAGGC ACCATGCATA GGGTTCCTGA TGTAATCGAC
GTTTGGTTTG ACTCAGGAGC AATGCCATTT GCCCAACTTC ACTATCCATT CGAACACAAG
GAAGACTTTG AAGAATACTT CCCAGCAGAC TTTATCTGTG AAGGAATCGA CCAAACAAGA
GGCTGGTTCT ACTCACTAAT GGCCATATCT ACCATCACAA CAGGCAAGGC ACCTTACAAA
AATGTCTTAG TAAATGACCT TGTAGTAGAT AAGAATGGTC AAAAGATGAG TAAATCCAAA
GGAAACACCC TCGATCCATT TGCCCTCTTT GATAAATACG GGGCAGATGC CGTAAGATTC
TACTCACTTT ATGTATCTCC ACCATGGATG CAAACCAAGT TTGACGAAAA GGGCCTAATC
GAAGTTAAGA ACAACTTCTT TAGAACATTC GAAAATGTCT ACAACTTCTT CGGCCTATAC
GCTGAAACAG ATAAACTAAG TGCTGAGGAA ATAGCAGGAT TTAGCGACCT AAAACTTGAA
AAAATAGATA AATGGCTCTA CTCCAAACTA AACACCCTAA TCAAAAACTA CACAGAAGCA
ATGGACGCCT TTGACTACAA CAAGGTAGTT CACATGATTT CAGACTTCGT AGTAGAAGAT
CTATCAAACT GGTACATCAG AAGAAATAGA AAGAGATTCT GGAACTCAGA GCTTACAGAT
AGCAAGAAAG CAGTATACAA GACTACATTT GATGCAATCC TTACAATATC CAAGCTCATC
GCTCCAATCA CACCATTTAT AGCTGAAGAA GTCTTTAGAT CACTTACAGG AGAAAAGACT
GTTCACACAA GCCTCCTCCC TAAGGCAGAC GAGAAAATGA TTGATACAGA CCTTGAAGAA
AGCATGGACC TTGTAAGAAA GATAGTAAAT CTCGGTAGAG CTTCAAGAGA GAAAGAATCA
ATCAAAGTTC GTCAACCACT AGCCAAAATC ATAGTAGACG GAGCCTACAA GGAAAAAATC
GCAGACCTTA CAGGTCTAGT TAAGGAAGAG CTAAACATAA AGGATGTCGA CTTCGAAGAC
GACCTATCAG ACTTCATGGA TTACTTCCTA AAACCAGACT TTAGAGTAGT AGGAAGAATC
TTCCAATCCA AAGTCAACGA CTTCGCTAAA TTCCTAGCAT CCACCGATGC CAAGAAATTC
ATAGAAGCTG TTGAAGAAAA GCCTCAAGAA ATCACCCTAG GAGATGAGAC TTATCAAGTC
ACAAAAGACT ATCTAGACAT AAGAATCTCT GCCAAAGAAG GATTCGATGT AGAAATGGAC
GGCAATGTCT TCGTAATACT CGATACAGAA ATCACAGAAG ACCTCCGTGA TGAGGGATAT
GCGAGAGAAT TTACCTCCAA GGTTCAAAAC ATGAGAAAAG ATAGTGGATT TGAAGTAACA
GACAGAATCA ATATCTACTA CCAAGCATCC GACGAACTAA ACAAATCTCT AGAGAAATTC
AAAGAAGAAA TCACAAAAGA AACCCTAGCA GATAAGTTCG AAAGAAAAGA CCTCACAAGT
GAAGAAATAG AACTAAATGA CAAGACAGTT AAAATCGAGC TTGAGAGATT ATAA
 
Protein sequence
MSEFKSLDSK DVRKREGEVE KYWQEIDLLN ETFATREDAE EYIIYDGPPT ANGRPGIHHV 
IARTLKDMTS RYKNMQGYKV LKKAGWDTHG LPVEIEVEKT LGFHDKNDIE EYGIEKFNKL
CKESVWKYSD QWREMSDRMA YLYNMDDPYV TMDNDYVETE WYLLDKAFKN GYIYEGAKVM
PYCPRCGTGL ASHEVAQGYQ MDKTITLTVK FKKKGTDNEY FLAWTTTPWT LPSNVALSVH
PELTYAKLYV ADEDSYYYCA KSLAENLMGE RDYEVVEEFP GKDMEYWEYE QLMPYVNVGD
DKAFIITLAD YVSAEDGTGI VHSAPAFGED DYQIGRKYGL AFVQPVDLEG CFTETPWKGE
FIFDTNEKIW RHLQEEGKVF NKQTIEHNYP HCWRCHTPLV YYAKPSWYIE MSKFSDAMVE
NNKSVNWYPQ TIGDKRFGNW LENVKDWAIS RSRYWGTPLN IWKCDECGHT DTVGSRAELK
ERAIEDISED IELHRPYVDN VSIKCDKCGG TMHRVPDVID VWFDSGAMPF AQLHYPFEHK
EDFEEYFPAD FICEGIDQTR GWFYSLMAIS TITTGKAPYK NVLVNDLVVD KNGQKMSKSK
GNTLDPFALF DKYGADAVRF YSLYVSPPWM QTKFDEKGLI EVKNNFFRTF ENVYNFFGLY
AETDKLSAEE IAGFSDLKLE KIDKWLYSKL NTLIKNYTEA MDAFDYNKVV HMISDFVVED
LSNWYIRRNR KRFWNSELTD SKKAVYKTTF DAILTISKLI APITPFIAEE VFRSLTGEKT
VHTSLLPKAD EKMIDTDLEE SMDLVRKIVN LGRASREKES IKVRQPLAKI IVDGAYKEKI
ADLTGLVKEE LNIKDVDFED DLSDFMDYFL KPDFRVVGRI FQSKVNDFAK FLASTDAKKF
IEAVEEKPQE ITLGDETYQV TKDYLDIRIS AKEGFDVEMD GNVFVILDTE ITEDLRDEGY
AREFTSKVQN MRKDSGFEVT DRINIYYQAS DELNKSLEKF KEEITKETLA DKFERKDLTS
EEIELNDKTV KIELERL