Gene Amuc_1504 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1504 
Symbol 
ID6274586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1796305 
End bp1797363 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content57% 
IMG OID642613563 
Producthypothetical protein 
Protein accessionYP_001878106 
Protein GI187735994 
COG category[R] General function prediction only 
COG ID[COG1611] Predicted Rossmann fold nucleotide-binding protein 
TIGRFAM ID[TIGR00725] conserved hypothetical protein, DprA/Smf-related, family 1
[TIGR00730] conserved hypothetical protein, DprA/Smf-related, family 2 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.0023514 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCCATGG ATTCCAGCTA CCAGACTCTG GAAGGCGCCG TTAAAAAGCT TTCCGAAATA 
GCCTCCACCA GGCACGATCC CCGCTACCTC CAAGAGTATA TCAAAACGGG AATCAACATG
GCCCAGTCCG CCGCCTCGGA CCATGATTTT ACCGTCCTGA TCCGCTCCGG GCGGGAGATG
TACCGCGCCA ACTGCGTTTT CGCCCCGTAC CGCCACATCC GCAAGATTTC CGTCTTCGGC
TCCGCCCGCA TCAGGAATGA CGAACCGGCG TATGAAACGG CGAGGGAATT CGCCAGGGAA
GCCAGCGAAC ACGGCTACAT GGTCATTACC GGAGGCGGAC CGGGCATCAT GCAGGCAGCC
AATGAAGGGG CGGGAGAGCA ACGCTCCTTC GGCCTGAACA TCACCCTGCC GTATGAACAG
ACCTCCAACC ATGTGGTGGC CCACAGCGAC AAACTCATCA ATTTTTATTA CTTTTTCGTC
AGAAAACTGA ACTTCGTGGC GGAAAGCGAC GCCATGGTGG CATTCCCCGG AGGCTTCGGA
ACCATGGATG AAGTGTTTGA AACACTCACT CTGATCCAGA CGGGAAAAGC GACCATTTAC
CCGATCGTCC TTCTGGATTC CCCCGGCAAA ACCTTCTGGC TGAACTGGCT GGCCTTCATT
CGCGTGGAAC TGGTGGATTC CGGACTGATT TCCGCAGACG ATCTTCATCT CATCCATGTC
ACTAAAAATC CGGCGGAAGC CATGGAACAC ATCGACCGTT TTTACCGGAT TTTCCACTCC
TACCGTTTTG TCGGAGATTC CATCGTCATC CGGCTGAATG CGCAGCTTCC CGCCCAGTGG
GTGGAACATC TGGAACGGGA CTTTTCAGAC CTGATTCTGC CTGGGGGGAA AATGATCCAG
AGCGGCCCCC TGCCGGATGA AGCGGACGAA CCACACCTGG ACCGCCTGCC CCGGCTCGTC
TTCCCCATCA AACGCGGCAA CTACGGCAGG CTGAGGTTGC TGATTGACCG CATCAACCAG
ACGCCCAGCC GAACCTATTC CCCGCCCACC CATGCCTGA
 
Protein sequence
MSMDSSYQTL EGAVKKLSEI ASTRHDPRYL QEYIKTGINM AQSAASDHDF TVLIRSGREM 
YRANCVFAPY RHIRKISVFG SARIRNDEPA YETAREFARE ASEHGYMVIT GGGPGIMQAA
NEGAGEQRSF GLNITLPYEQ TSNHVVAHSD KLINFYYFFV RKLNFVAESD AMVAFPGGFG
TMDEVFETLT LIQTGKATIY PIVLLDSPGK TFWLNWLAFI RVELVDSGLI SADDLHLIHV
TKNPAEAMEH IDRFYRIFHS YRFVGDSIVI RLNAQLPAQW VEHLERDFSD LILPGGKMIQ
SGPLPDEADE PHLDRLPRLV FPIKRGNYGR LRLLIDRINQ TPSRTYSPPT HA