Gene Amuc_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0043 
Symbol 
ID6275154 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp59317 
End bp61533 
Gene Length2217 bp 
Protein Length738 aa 
Translation table11 
GC content60% 
IMG OID642612085 
Productprimosomal protein N' 
Protein accessionYP_001876671 
Protein GI187734559 
COG category[L] Replication, recombination and repair 
COG ID[COG1198] Primosomal protein N' (replication factor Y) - superfamily II helicase 
TIGRFAM ID[TIGR00595] primosomal protein N' 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.401318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGCGG CGCGCATTCT GGTTGACGGA CAAAGCGATC TGGTACTGGA TTACGGTATT 
CCTCCGGAAG CGGGAGACGT CAAGCCGGGC TGCCGCGTAC AGGTGCCCCT GCGCAACAGA
ACGGCTACCG GAACCGTGCT TACGCTGTCA GAACCGGCCC CGGCCTGGAA GGACAGGCTC
AAGCCCATTC TGAAACTGAT CGATCCGGAG CCCCTGATTT CCCCGGTGAT GATGAATCTG
GCCTCATGGG CGGCGGACTA TTACTCCGTG GCTCTGGATC AGATGATCCG GTGTCTGCTC
CCGGAAACCG TTCGTCAGGA AAATACGGCG GAAAAAATGC GCAAAATGGT GTATCTGGAA
AAAACTCCGG CACGGGAGGA ACTGGACGCC CTGTACCGCA AAGCACCCCG GCAGGCCCAA
ATGCTGGATT ATTTCTCATC CGCGAAACAA CAGTCCGCCC CCCTGGCGGC ATTCGGCGCC
GGAGCCCTGA ACGTTGCCCG CAGCCTGGAA GCCAAAGGCT TCATCTCCCT GAAAGAAGAG
GCCGTGCACC GGGACCCCAG CACCGGGGAG CAATTCGTCC CCACCCAGCC CATGAAGCTG
AACAGCCAGC AGCAGAAAGC GCTGGAAGAA ATCACGGCCA TGTGCGCGGC AGAGCGCAAA
AAACCGGTCC TGCTGCAAGG AGTTACCGGT TCCGGCAAGA CGGAGGTTTA CTTACAGGCC
GTTTCCCAAA TAGTAAAATC CGGAAAATCC GCCCTGATCA TGGTGCCGGA AATTTCCCTG
ACGCCCCAGA CGGTTCAGCG TTTCAAATCC CGTTTCGCAG AACTGCCTTC CTCCGTGGCA
GTGCTGCACA GCCTCCTCTC GGACGGGGAG CGCTTTGACG AATGGCATGC CATCCGTTCC
GGAAAAGCCC GCATCGTCAT CGGTCCCCGT TCCGCCGTCT TCGCCCCCCT CCAAAATCTG
GGCCTGGTGA TTGTGGATGA AGAACACGAC GCCTCTTACA AACAGGAAAG CTCCCCCCGC
TACCACGGGC GGGATCTGGC CGTGCTGCGC GCCCATCTGG AAAATTGCGC CGTGCTTCTT
GGCTCTGCCA CTCCCTCCCT GGAAAGCATC CATAACGCCC TGATTGGAAA ATATTCCCTG
GTAAAACTGA CGGAAAGGGC GGACGGCCAG CAGCTCCCGC TCATTCGCAT CCTGGACATG
AAAACGGAAG GAAGGAACAA ATCCGGTCCC AACGTTATCT CCGAACGCCT CAGGATGTCC
ATTGACCGGC GTCTCGACAA GGGGGAACAG GTCATCCTGC TGCTCAACAG GCGCGGATTC
GCACGCTCCA TCCAATGCCC GGACTGCGGC CACGTAGTCA CGTGCCTGCA CTGCTCCCTG
CCCCTGACCT ACCACCGTAC GGAAGACCGC CTCATGTGCC ACCTGTGCGG ATTCAAAGCC
CTCCCGCCCC GTTCCTGCCC GGAATGCCGG TCCGCCAACA TCCTGCTCCA GGGGTACGGA
ACCCAGAAAG TGGAGGAACT CCTGCGCCGC ACCTTCCCTG CGGCGCGCAT CACGCGCGTG
GATGCGGACG TGGCCCGGCG GAAAAACGCC GTCCGGACCA TTCTGAACCA GTTTCGCGCC
CATAAGATAG ATATTCTCCT GGGCACCCAA ATGATCGCCA AAGGTTTGGA TTTTCCCAAC
GTGACGCTGG TAGGCGTACT GAACGCGGAC CTGGGGCTCC ACATCCCGGA TCCCCGTGCA
GGGGAACGCA CTTTCCAGCT CCTGACGCAG GTAGCGGGCC GCGCCGGCCG CGGCGATCTT
TCCGGAGAAG TCATTATCCA GACTTTTACG CCCCAGTCCC CCTCCCTGCA ATACGCCCGG
CACCATGACA CGGACGGCTT TGCAGCCCAG GAACTGGAAA TGCGGCGTAC TTTCGACCTC
CCCCCCTTCA CCCATATCGC CGTCTTGACC ATACGTTCCC AGCATGAAAG CATGGCGGAA
TTCGCCACGC AAACCCTCGC GGCCCGCCTG CGCGGCATGC TCCCCCCTCC CGCCACGATG
ACCGACCCCA TGCCTGCCCC CATCCCCCGT GCGCACGGAC AGTTCAGATT CCAGATTACG
GTCAAGGGGC CATCCGCCCG CATCCTCTCC CGCACTCTCC GGAAACTGGT GCAGGAAGCC
GGCCTGGGGG AAGACTTGAC GGCTGTCATT GATGTGGATG CCATGTCATT CATGTAA
 
Protein sequence
MQAARILVDG QSDLVLDYGI PPEAGDVKPG CRVQVPLRNR TATGTVLTLS EPAPAWKDRL 
KPILKLIDPE PLISPVMMNL ASWAADYYSV ALDQMIRCLL PETVRQENTA EKMRKMVYLE
KTPAREELDA LYRKAPRQAQ MLDYFSSAKQ QSAPLAAFGA GALNVARSLE AKGFISLKEE
AVHRDPSTGE QFVPTQPMKL NSQQQKALEE ITAMCAAERK KPVLLQGVTG SGKTEVYLQA
VSQIVKSGKS ALIMVPEISL TPQTVQRFKS RFAELPSSVA VLHSLLSDGE RFDEWHAIRS
GKARIVIGPR SAVFAPLQNL GLVIVDEEHD ASYKQESSPR YHGRDLAVLR AHLENCAVLL
GSATPSLESI HNALIGKYSL VKLTERADGQ QLPLIRILDM KTEGRNKSGP NVISERLRMS
IDRRLDKGEQ VILLLNRRGF ARSIQCPDCG HVVTCLHCSL PLTYHRTEDR LMCHLCGFKA
LPPRSCPECR SANILLQGYG TQKVEELLRR TFPAARITRV DADVARRKNA VRTILNQFRA
HKIDILLGTQ MIAKGLDFPN VTLVGVLNAD LGLHIPDPRA GERTFQLLTQ VAGRAGRGDL
SGEVIIQTFT PQSPSLQYAR HHDTDGFAAQ ELEMRRTFDL PPFTHIAVLT IRSQHESMAE
FATQTLAARL RGMLPPPATM TDPMPAPIPR AHGQFRFQIT VKGPSARILS RTLRKLVQEA
GLGEDLTAVI DVDAMSFM