Gene Amuc_1914 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1914 
Symbol 
ID6275373 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2322961 
End bp2324121 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content43% 
IMG OID642613974 
Productrestriction modification system DNA specificity domain 
Protein accessionYP_001878508 
Protein GI187736396 
COG category[V] Defense mechanisms 
COG ID[COG0732] Restriction endonuclease S subunits 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.00381757 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATGAGA AATCGTTGAT TCCGTCTATC CGCTTTGCCG GATTTACTGA CGCATGGGAA 
CGGCGTAAGC TGGGGGATTT AGCGGAGTTT AGAAGAGGGC TAACCTATTC ACCAAGAGAT
ATATCAACAT CTGGAATCAG GGTATTGCGC TCGTCAAATA TAGATGAGGA TTCTTTCGTT
TTAGCAGAGG ATGATGTTTA TGTAAAAGAG ACGGCTGTGT GCATCCCGCT TGTTGAAAAA
GGCGACATTT TAATTACCGC AGCTAATGGC TCAAGCAGAT TAGTCGGAAA GCATGCTTTG
ATTATTGACG ATAAGGGTAA AATGGTACAC GGCGGGTTCA TGCTGCTCGC GCATCCGTAT
ACGCATTCTG CTTTCGTTAA TGCTCTTATG CATGCACCCT GGTACTCATC GTTTATCCGC
ACTAACGTTG CTGGAGGAAA TGGAGCTATA GGAAATCTGA ATAAAAGCGA TTTGGAAGAA
CAAGATATTG CGGCGACCTC TGAGCAAGAG CAAGAAAGAA TCGGTTCCTT GTTTGCCTCC
CTCGACCATC TCATCACCCT TCATCAGCGT AAGTATGAAA AGCTCCTTAA CATCAAAAAA
TCGATGTTGG ACAAAATGTT CCCGAAAAAT GGTGAGCTTT TCCCCGAAGT TCGCTTTGCC
GGATTTACTG ACGCATGGGA ACGGCAGAAG CTGGGGGATT TGGTAGAGTC TGTTCCGTTT
AAGCAGTATA TAGCATCACC TGAACCTGAC GGAAAATTCG AAATTATCCA ACAAGGAAGT
GAGCCTATTA TTGGATATGG AAACGGAATC CCTTGTGAAG ATTATGCAAA GATAACGATT
TTCGGAGACC ATACAGTTTC AATCTACAAA CCACAAAAGC CCTTTTTTGT AGCCACTGAT
GGCACAAGAC TCCTTACAGC AAGAGTTCTA GATGGAGATT TTTTTTATTT CCTCTTGGAG
CGATACAAAC CAATCCCTGA AGGATATAAG CGGCATTACA CGATATTGAT TGAAAGGTAT
GGATGTTTTC CTTCCCATCG AGAGCAAAAG TTAATTGCCA TATTTTTTAG GAACATCGAC
CACCTCATCA CCCTTCATCA GCGTAAGTTG GAAAAACTGC AAAACATCAA GAAAGCCTGT
CTGGAAAAAA TGTTTGTTTA A
 
Protein sequence
MNEKSLIPSI RFAGFTDAWE RRKLGDLAEF RRGLTYSPRD ISTSGIRVLR SSNIDEDSFV 
LAEDDVYVKE TAVCIPLVEK GDILITAANG SSRLVGKHAL IIDDKGKMVH GGFMLLAHPY
THSAFVNALM HAPWYSSFIR TNVAGGNGAI GNLNKSDLEE QDIAATSEQE QERIGSLFAS
LDHLITLHQR KYEKLLNIKK SMLDKMFPKN GELFPEVRFA GFTDAWERQK LGDLVESVPF
KQYIASPEPD GKFEIIQQGS EPIIGYGNGI PCEDYAKITI FGDHTVSIYK PQKPFFVATD
GTRLLTARVL DGDFFYFLLE RYKPIPEGYK RHYTILIERY GCFPSHREQK LIAIFFRNID
HLITLHQRKL EKLQNIKKAC LEKMFV