Gene Amuc_1031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1031 
Symbol 
ID6274084 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1223090 
End bp1224505 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content59% 
IMG OID642613080 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001877638 
Protein GI187735526 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones75 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACACC TTTACGATAC CCGCACCAGA ACGGCCCAGG ACATTTCTCC CATGGATGGA 
AAAACACTGC GCTTTTACTG TTGCGGCCCC ACGGTGTACG GCCCTACCCA CATCGGCAAT
TTCCGCACTT TCGTGATGCA GGACGTCTTC CGCCGCGTCC TGGAACTGGG GGGGGTGCCC
ACCACGCATA TCCGCAATCT GACGGATGTG GACGACAAAA CTATCCGGGA TTCTCAAAAG
GCTGGCGTTT CTCTGGCGGA ATTCACCGCA GGCTGGGCGG ATCTGTTCCA CCGGGACTGT
GCCGCCCTTA ATTGCCTGCC TCCCCATGCG GAACCCTCCG CCGTGGGCCA TATTCCCGAA
CAAATACGGA TGGTTCAAAC ACTGGTGGAA AAGGGCCATG CCTATGTATC GGAAGACGGT
TCCGTGTATT TCAGAATTTC TTCCTTCCCG GAATACGGAA GGCTTTCCCA CCTGGACGAA
CGTGAACTGG ATTTAGGAAA AACCGCCAAT ACCCGGTCCA ACGCAGACGA ATATGAAAAA
GACTCCGTGG CAGACTTCGT GCTGTGGAAG AGCCGCAGGC CGGAAGACGG AAACAACTTC
TGGCCCTCTC CCTGGGGAGA AGGCCGCCCC GGCTGGCACC TGGAATGCTC CGCCATGATC
CATAAATACT TCGGCAATGA CTTTGATCTC CACTCCGGCG GCGTGGATCT GGTATTCCCC
CACCATGAAA ACGAAGTGGC CCAGTCCCGC TGCGCCTGCG GCGGCGGCTT CGCGCGCCTG
TGGTTCCACA TCACGCACCT GCTGGTGGAC GGAGGCAAGA TGTCCAAATC CCTGGGCAAC
ATGTACACGC TGGCGGATTT GGACAAACTG GGCCACAGGC CGTCCGCGGT CCGGTACGTG
CTGGCGGGGG GCTATTACCG CCGTCCGTTG AATTTCACCC TTTCCTCTCT GGAAGACGCT
AAAGCCGCGC TGAACCGCCT GTCCAAATTC GATATGCAGC TCAGGAACGC CTCCGGAACG
GATTCCGTTC CCTCCTATGA GGAATTCTGC GCGGCATTCC CGGAATTGGG AATTTTCCAG
CCGGCATGGG ACAGCCTGAA CGATGACCTA AACACTCCGG AAGCCCTGGG CCATGTTTTC
AGCGCCATCA GGAAGGCGGA TATCCCCTCC CTTTCACCGG AGGAGGCGGC CCGCCTGCGG
AATGCCTTCC ACTTTATTCT GGCCGCCTTC GGCATTATTC TGCCGGAGGA GGGACAGGAG
GAAGCCCCGG AAGAAATCCG CACCCTGGCG GATCAGCGCT GGCAGGCCAA GCAGAACCGG
GACTGGACGG AAGCCGACCG CCTGAGGGCG GAAGTGGCAG CGCTGGGCTG GGTCATTAAA
GACCGCAAGG ACGGATACGA CCTGGCACGC AAATAA
 
Protein sequence
MLHLYDTRTR TAQDISPMDG KTLRFYCCGP TVYGPTHIGN FRTFVMQDVF RRVLELGGVP 
TTHIRNLTDV DDKTIRDSQK AGVSLAEFTA GWADLFHRDC AALNCLPPHA EPSAVGHIPE
QIRMVQTLVE KGHAYVSEDG SVYFRISSFP EYGRLSHLDE RELDLGKTAN TRSNADEYEK
DSVADFVLWK SRRPEDGNNF WPSPWGEGRP GWHLECSAMI HKYFGNDFDL HSGGVDLVFP
HHENEVAQSR CACGGGFARL WFHITHLLVD GGKMSKSLGN MYTLADLDKL GHRPSAVRYV
LAGGYYRRPL NFTLSSLEDA KAALNRLSKF DMQLRNASGT DSVPSYEEFC AAFPELGIFQ
PAWDSLNDDL NTPEALGHVF SAIRKADIPS LSPEEAARLR NAFHFILAAF GIILPEEGQE
EAPEEIRTLA DQRWQAKQNR DWTEADRLRA EVAALGWVIK DRKDGYDLAR K