Gene Amuc_1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1397 
Symbol 
ID6275698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1669309 
End bp1670583 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content59% 
IMG OID642613454 
ProductHistidine--tRNA ligase 
Protein accessionYP_001878002 
Protein GI187735890 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0124] Histidyl-tRNA synthetase 
TIGRFAM ID[TIGR00442] histidyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000267418 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGACG CTCGCTTTCA ACCACTTCCC GGATTCCGCG ATTTTGCACC GGCGGATTGC 
GCCGTGCGCA ACTATCTTTT CAACGCATGG AAGAAGGTGG CGCATCGTTA CGGATTCCTG
GAATGGGAGG GCCCGACTGT GGAAGCCACG GAGCTTTATC TTAAAAAGAG CGGAGGCGAG
CTGCCCACCC AGCTCTTCCG CTTCACGGAC CAGGGGGACC GGGACATCAC CATCCGCCCG
GAACTCACCG CCTCCCTGGG ACGCATCGCA GCCGCTTATC AGCGCGAATA CACCAAACCG
CTCAAGTGGT TTGAAATCGG CTCATGCTTC CGTTATGAAA AACCCCAGAA AGGGCGTCTG
CGCGAATTTT ATCAATTCAA TGCGGATATT CTGGGAGAAG CCTCCGCGTG GGCGGATTCC
GAACTGATAG CCCTGGCCAT CGACTGCATG AGGGAACTGG GCTTTACGCA AAACGACTTC
ATTGTGCGCG TCTCTGACCG GGAAGCCTGG ATCAGGTTTG CTTCGGAACA CGGCGTGCAG
GAACAGGATA TTCCCGCTTT CCTCGGCATC GTGGACAAAT TTGAACGTGA CCGCCCGGAA
GAATGCCAGC GCAAGCTGGA CGCCTTTCAT ATCAGCCGCG GCGATCTGGT GGCCTTCATT
GAAAACCCGC CCGCCGGAGC GTCCGAACGC TATGACATCC TGATGAAAGA CCTGACGGCC
CGCGGGCTGG ACGGCTATGT CAAGCTGGAC TTATCCGTAG TGCGCGGACT GGCATACTAC
ACCGGTCTGG TTTTTGAAAT CTTTGACACA CAGCGCAGCC TGCGCGCCGT AGCCGGTGGA
GGACGTTACG ACACGCTGGT AGGCGCCCTT TCCAACAACG CAGTGGACAT GCCGGCAACC
GGTTTTGCCA TGGGGGATGC CGTCATCACC CACCTTATTG AACAGACGCC CCATGCCAGG
GCGCTGAAAG ATGCGGCGCT GGCCTCCGCA GGCTGCGACA TCTTCATGGT TCAGGCATCT
GAAAGCCGCC GCGCGGAAGT TCTTGCCATC GTCTCCGCCC TCCGGGACCA GGGATACAGT
GTGGACCTTC CCCTCACGCT CACCAAAGTC AACGGACAGC TCCAGAAGGC CGTCAAATCC
GGCGCCCGCG CGGCCTTGAT CGTGGGAGAT GAATTCCCGG TCATGGAACT TCGCGACCTG
GGCGCGCGCA CCTCATCCCC CGTCGCCATG GACGACCTGT TTGACGCCGT GGCTTCACTC
ACCGGAAGCA ATTGA
 
Protein sequence
MPDARFQPLP GFRDFAPADC AVRNYLFNAW KKVAHRYGFL EWEGPTVEAT ELYLKKSGGE 
LPTQLFRFTD QGDRDITIRP ELTASLGRIA AAYQREYTKP LKWFEIGSCF RYEKPQKGRL
REFYQFNADI LGEASAWADS ELIALAIDCM RELGFTQNDF IVRVSDREAW IRFASEHGVQ
EQDIPAFLGI VDKFERDRPE ECQRKLDAFH ISRGDLVAFI ENPPAGASER YDILMKDLTA
RGLDGYVKLD LSVVRGLAYY TGLVFEIFDT QRSLRAVAGG GRYDTLVGAL SNNAVDMPAT
GFAMGDAVIT HLIEQTPHAR ALKDAALASA GCDIFMVQAS ESRRAEVLAI VSALRDQGYS
VDLPLTLTKV NGQLQKAVKS GARAALIVGD EFPVMELRDL GARTSSPVAM DDLFDAVASL
TGSN