Gene Amuc_1145 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1145 
Symbol 
ID6273890 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1372509 
End bp1374116 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content54% 
IMG OID642613197 
Producthypothetical protein 
Protein accessionYP_001877752 
Protein GI187735640 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3209] Rhs family protein 
TIGRFAM ID[TIGR01643] YD repeat (two copies) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0989818 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0000170395 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAAAT ATTCATCCTG CCTTATCCAC CGGAATTTTC TGGTGGGAAT TAGCTATTCG 
GACGCCACGC CCGGCCAGAG CTTCGCCTAC AATCCCTACG GAGAACTGGA AACCGACAAC
CTGCGAGTGG GCAACCACAC CCACCTCATC ACCGAGAAAA AAGACGCCTT CGGGCGCAGC
ACAGGCTACA TCTACGCGCG GAGCGGCGCG GCGCAGCACA CCGTAAGCAT CGGCTACGGG
GAAGACGGCC GCATCGCCAC GGCCGGGTTC CTGCAGGGCA GCGCGCCCCA GACCTTCACC
TGGCAATACA TGGAAGGAAG CGGCCTGCCC TCCGTGATGG CCATGCCCAA CGGCATGACG
CTGGAATGGG GCTATGAGGA AAAGCGGAAT CTGGTCTCTA CCATGGTCTA CAAACGCGGC
GCCACAAGAG TTGTGGAAAG GGAATACGTT TACGACAGCC TGGCAAGGCC CGTTACGCGG
CGCACGGCCC GCCAGGGGAA TACCGTCAAC GACAGCTTTG CCTGCAACAG CCGGAGCGAA
CTGACCGCCG CCACCGTCAG CGGCAAGACA TACGGCTACA GTTACGACAA CATAGGCAAC
CGGAAAACCG CGCGGGAAGA CGCGGAAGAA GCGACCGCCT ACACAACCGG CCCGCTCAAC
CAATACACCG CCATCGAGCG CGGAGAAGAA GCCGCCTTTG AGCCGGTTCA CGACGCGGAC
GGCAATCAGA CATTGATCAG GACATCCACC GGCATCTGGC AAGTCGCCTA CAACGCGGAA
AACCGCCCGG TGCGCTTCGT CAATGAAAGC GCCAAAACCG TGGTGGAATG CACCTACGAC
TACATGGGCA GGCGGCACAC CCGCAAAGTG AGCGTCAACG GGACGGTGAG CAGCTACCTG
CGCTACATGT ACCGTGGCTA CCTGCAAATA GCCGCCATAG ATGCCGTCAG CGGAGTCTTT
CGATGGTTCC TGTTCTGGGA CCCGACGCAG CCTGAGGCTG CGCGTCCGCT GGCCATCCGC
AAAGACGGCG CCTGGTACGC CTACGGGTGG GATCTGACCG GGAACGTCAC GGGAATCTTC
GGGAAAGCCG GTTACCTGCG GACGGTCTAC ACCTACACGC CTTACGGAGA AGTCACCGCC
GAAGGGGATG TCACCCAGCC CATCCAGTGG AGCAGCGAAT ACAGTGACGA AGAACTGGGG
TTGGTCTACT ACAACTACCG GCATCTCAAT CCGCACGACG GAAGGTGGAT CAGCCGCGAT
CCCATCGAGG AAGAAGGTGG TTGGAATTTG TTCGCGTTTG TAGGAAATAA AATTTTTAAT
CAATCTGATA TTTTAGGGTT GATATGTACA ATAGAATATA GTATAAAATT ACATACAATA
TTAATAAGGA AGGTAGATAA AGATAGTAAT ATACTTCGTT TAACGACGAG TAGAGTTTTT
TCTGGAAATG GTGATGGGAA AAATAATCCG GACAATGTTG GAAATAAAGA TAACGGTCCT
ATACCACCAG GGAAATACTA TGTGATAAAA AGACAATCTG GAGGAATTCG GAGCCAAATT
AAAGATTGGA CCTACAAATT ATGGAATGAT AATGATAAGA ATCAATAG
 
Protein sequence
MSKYSSCLIH RNFLVGISYS DATPGQSFAY NPYGELETDN LRVGNHTHLI TEKKDAFGRS 
TGYIYARSGA AQHTVSIGYG EDGRIATAGF LQGSAPQTFT WQYMEGSGLP SVMAMPNGMT
LEWGYEEKRN LVSTMVYKRG ATRVVEREYV YDSLARPVTR RTARQGNTVN DSFACNSRSE
LTAATVSGKT YGYSYDNIGN RKTAREDAEE ATAYTTGPLN QYTAIERGEE AAFEPVHDAD
GNQTLIRTST GIWQVAYNAE NRPVRFVNES AKTVVECTYD YMGRRHTRKV SVNGTVSSYL
RYMYRGYLQI AAIDAVSGVF RWFLFWDPTQ PEAARPLAIR KDGAWYAYGW DLTGNVTGIF
GKAGYLRTVY TYTPYGEVTA EGDVTQPIQW SSEYSDEELG LVYYNYRHLN PHDGRWISRD
PIEEEGGWNL FAFVGNKIFN QSDILGLICT IEYSIKLHTI LIRKVDKDSN ILRLTTSRVF
SGNGDGKNNP DNVGNKDNGP IPPGKYYVIK RQSGGIRSQI KDWTYKLWND NDKNQ