Gene Apre_1664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApre_1664 
Symbol 
ID8398476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerococcus prevotii DSM 20548 
KingdomBacteria 
Replicon accessionNC_013171 
Strand
Start bp1805136 
End bp1807058 
Gene Length1923 bp 
Protein Length640 aa 
Translation table11 
GC content39% 
IMG OID644996028 
Productthreonyl-tRNA synthetase 
Protein accessionYP_003153406 
Protein GI257067150 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0441] Threonyl-tRNA synthetase 
TIGRFAM ID[TIGR00418] threonyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000605435 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTAAGA TTAAATTACC TGACGGATCA GTTAAGGAAT ATGAAAGCGG AGTTCTCGTT 
GGAGATGTTA CAAAGGATAT CTCCATGGGA CTTTTTAGAG CTGCAGTAGG AGCTGTAGTA
AATGGAGAAA TCAAAGGCTT TGCTGAACCT ATAGAAGAAG ATAGTGACTT TAGAGTTGTT
AAGTTTGAAG ATAAGGAAGG AAAGGAAATT TTCTGGCATA CTTCTTCTCA CGTGATGGCT
GCTGCTATTA AGGAAATGTA TCCAGAAACA AAATTTGCTA TAGGACCAGC TATTGATGAT
GGTTTCTATT ATGATATGGA CCTTGAGCAC AGATTTACTC CAGAAGATCT AGAAAAAATC
GAAAAGAAAA TGCTAGAGAT TGCAAAACGT GATCTTAAGC TTGAAAGAGT TGAAATTTCT
AGGGCTGAAG CTTTGGAGAA ATTTAAGGAA GAAGGTCAAG ACTACAAGGT TGAGCTTATA
GAAGATCTTC CAGAAGATGA AAAGATTACT CTATATAAGA TGGGAGATGC CTTCACAGAC
CTTTGCAGGG GACCTCACCT TCTATCAACC AAAGCTATCA AGGCTGTTAA GCTTAAGACA
ATTGCCGGTG CTTATTGGAG AGGAGATTCT GATAGACAAA TGCTTCAAAG AATCTACGGA
ATTTCCTTCG AGAAGGCTAA GCAATTAGAA GAATACGAAG AACTTCAAAA AGAAATCGAA
AAGAGAGACC ATAGAAAGAT TGGTCGTGAG ATGAATCTTT TCACCTTCCA CGAAGAAGGA
CCAGGCTTCC CATTCTTCCA TCCAAATGGA ATGATCCTAA TGAACGAGCT ATTAGGTTGG
TGGACAGATG TCCTAAATGA AAATGGATAC GGAGAAATCA AGACTCCACT TATCCTAAAT
GAAGACCTAT GGCATAGATC AGGTCACTGG GACCACTACA AGGAAAACAT GTACTTCACA
AAAATTGACG GTGAAGCTTA CGCTATTAAG CCAATGAACT GCCCAGGTTC TGTTTTGACT
TATGCTTCTA ACCAACACTC TTATAGAGAC CTTCCTATTA GACTTGCTGA GTACGGTCAA
GTTCACAGAC ACGAGCTTTC AGGAACCTTA CACGGACTTT TCAGAGTAAG AACATTTACC
CAAGACGATG CTCACGTATA TTGTCTTCCA AGCCAAATCA AGGAAGAAGT TTATAGGATG
ATTGACCTAG CAGACCTCCT CTATTCAACA TTTGGTTTCA AATACACAAT CGAGCTATCA
ACAAGACCAG ATGACTTCAT GGGAGATATC AAGGATTGGG ACTTTGCTGA AGACCAACTT
AAGAAAGCCC TAGAAGAACG TGGAATCGAA TATGAAATCA ACCCAGGAGA CGGTGCTTTC
TACGGTCCAA AGATCGACTT CCACCTCCTA GATGCTGCTA AGAGAGAATG GCAATGTGGT
ACTATCCAAC TTGACTTCCA ACTACCACAA AACTTTGACC TAACTTATAT AGACGAAAAT
GGTGAAAAAC AAAGACCAGT TATGCTTCAC CGTGCCCTAC TTGGATCAAT CGAAAGATTT
ATCGGAGTTC TAACAGAGCA CTTTGCAGGA AGATTCCCAC TATGGCTCAA CCCACAACAA
GTTGAAATAA TCCCAGTATC TGACAAGTTT ATCGACTACT GTGAAGATCT AAAAGAAAAA
ATCAAAGAAG CAGGATTTAG AGTAAATATT GACTACAGAA GTGAGGGAGT AGGATACAAG
ATTCGTCAAG CTCAACTAAT GAGAACAAAC TACATGCTTG TAGTAGGAGA AAAGGAAGAA
ACTTCTGGCA AACTTACTGT AAGAAATAGG GACGGAGAAG AGACAGCTGA CGTTTCAGTA
GATGAATTTA TAGAAAAAAT CTCTAAAGAA AGAGATTCAA GAAGCATGGA AAATATATTC
TAA
 
Protein sequence
MIKIKLPDGS VKEYESGVLV GDVTKDISMG LFRAAVGAVV NGEIKGFAEP IEEDSDFRVV 
KFEDKEGKEI FWHTSSHVMA AAIKEMYPET KFAIGPAIDD GFYYDMDLEH RFTPEDLEKI
EKKMLEIAKR DLKLERVEIS RAEALEKFKE EGQDYKVELI EDLPEDEKIT LYKMGDAFTD
LCRGPHLLST KAIKAVKLKT IAGAYWRGDS DRQMLQRIYG ISFEKAKQLE EYEELQKEIE
KRDHRKIGRE MNLFTFHEEG PGFPFFHPNG MILMNELLGW WTDVLNENGY GEIKTPLILN
EDLWHRSGHW DHYKENMYFT KIDGEAYAIK PMNCPGSVLT YASNQHSYRD LPIRLAEYGQ
VHRHELSGTL HGLFRVRTFT QDDAHVYCLP SQIKEEVYRM IDLADLLYST FGFKYTIELS
TRPDDFMGDI KDWDFAEDQL KKALEERGIE YEINPGDGAF YGPKIDFHLL DAAKREWQCG
TIQLDFQLPQ NFDLTYIDEN GEKQRPVMLH RALLGSIERF IGVLTEHFAG RFPLWLNPQQ
VEIIPVSDKF IDYCEDLKEK IKEAGFRVNI DYRSEGVGYK IRQAQLMRTN YMLVVGEKEE
TSGKLTVRNR DGEETADVSV DEFIEKISKE RDSRSMENIF