Gene Amuc_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1999 
Symbol 
ID6274518 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2428414 
End bp2429439 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content62% 
IMG OID642614059 
Productphenylalanyl-tRNA synthetase, alpha subunit 
Protein accessionYP_001878591 
Protein GI187736479 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0016] Phenylalanyl-tRNA synthetase alpha subunit 
TIGRFAM ID[TIGR00468] phenylalanyl-tRNA synthetase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.19411 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGAAG AGATTGTACG CATCCAACGG GACGCCCTGG CACGCATCGC CCAGGCGTCT 
GACAGGCGCG GCGTGGAAGA CGCGCGCGTG GCCATCCTCG GCAAAAAAGG GGAACTGACC
CTGGCCCAGA CGGGCATGAA GGACGTTCCC AGAGAGGAAA AGCCCGCTGT CGGCCAGTTG
CTTAACGAGG CCCGCAAGGC CATTACGGAA GCTCTGGACG CCAAGCTGGA GGAAGTGCAG
GCCCAGGCGG ACAAGGCCGC CGTGGCCGGT GTGGACCTGA CGCTTCCGGC CCGCTCCCTG
CCGCCTGGCG GTCTGCACCC CCTCACCATC GTTAGGGATG AAGCCGTCCG CATTCTGCGC
CACATGGGCT TTGCCCTGGC GGACGGCCCG GAGATTGAGG ACGAGTTCCA CTGCTTTGAC
GCGCTGAACA CGCCGGAAGA TCACCCGGCC CGCAATGAGA AGGATACATT TTACTTTGAT
TCCGGCAAGC TTCTGCGTAC GCACACGTCT TCCGTGCAGA TCCGCTCCAT GGAAAAGCAG
CTGCCGCCCG TGCGTGTCAT CGCTCCCGGT TCCGCCTACC GCCGCGACGA AATTGACGCC
ACGCACCTTT CCGCCTTCAA TCAGCTTGAA GGCCTGTATG TGGATACGGA CGTTTCCGTG
GGCGACCTGA AAGGAACGCT GGAATATTTT CTGCGCGCCC TTTTCGGTTC CGGAACGGAG
GTGCGCTTCC GCCCCCATTT CTTCCCGTTC ACGGAACCCA GCTTTGAAAT TGACGTCAAG
CTGAAGGTGG ACGGCCAGGC CCCCCGCTGG GTGGAGATTG CCGGCTGCGG CATGGTGGAT
CCCAATGTTT TTGAAGCCGT GGACCGCGAA CTGGGCCTGG ACCCCGGAGC GCAGGCCCGC
TACACGGGGC TGACCGGCTT TGCCTTCGGC ATCGGCCTGG ACCGCCTGGC GATGATCCGC
TGGGGCATCA GGGACATTCG CGCCCTGATT GAGAATGATG TGCGCTTCCT TGCCCAATTC
CAATAA
 
Protein sequence
MKEEIVRIQR DALARIAQAS DRRGVEDARV AILGKKGELT LAQTGMKDVP REEKPAVGQL 
LNEARKAITE ALDAKLEEVQ AQADKAAVAG VDLTLPARSL PPGGLHPLTI VRDEAVRILR
HMGFALADGP EIEDEFHCFD ALNTPEDHPA RNEKDTFYFD SGKLLRTHTS SVQIRSMEKQ
LPPVRVIAPG SAYRRDEIDA THLSAFNQLE GLYVDTDVSV GDLKGTLEYF LRALFGSGTE
VRFRPHFFPF TEPSFEIDVK LKVDGQAPRW VEIAGCGMVD PNVFEAVDRE LGLDPGAQAR
YTGLTGFAFG IGLDRLAMIR WGIRDIRALI ENDVRFLAQF Q