Gene Amuc_1942 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1942 
Symbol 
ID6275172 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2357359 
End bp2358612 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content59% 
IMG OID642614002 
Productpeptidase T 
Protein accessionYP_001878536 
Protein GI187736424 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2195] Di- and tripeptidases 
TIGRFAM ID[TIGR01882] peptidase T 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.366024 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.129304 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCT CCGTTTTAGA CCGTTTTTTG AAGTATGTCT CCGTGGGCTC TCAGTCGGAC 
GCGGATTCCG CCACCGTGCC ATCCACCCCC GGCCAGACGG AGCTGGCCCG GCTGTTGGCC
GGGGAATTGA AGGACATGGG AGCGCAGGCG GAACTGGATG ATGATTCCGG AATTGTTTAC
GCCTCCATCC CTTCCAACGT GGAGGGGAAT GTTCCCGTCA TCGGCTGGGT GGCCCATGTG
GATACGGCTC CGGGAGTCTC CGGCAGCGGA GTGAAGCCCC GCATCGTGCG CTCGTACGAC
GGCGGAGACA TCCTGCTCAA TCCGGAACAG GGAATGGTGC TTTCCCCCGC CGTTTTTCCC
GAACTGACGG ATTATTTGGG CCAGGACCTG GTCGTGACGG ATGGAACTAC CCTGCTTGGC
GCGGACGACA AGGCCGGTGT GGCGGAAATC ATGGATATGG CAGCCTCCTT CCTGCTGAAC
CCCGAACGGC CTCACGGCGA GATACGGATC GCCTTCACGC CGGATGAGGA AATAGGACGC
GGAACGGATG CCTTTGACGT GGCGCGATTT GGTGCGGATT TCGCCTATAC CGTGGACGGA
GGCGCCCTGG GGGAAATTGA ATACGAAAAT TTCAATGCGG CTTCCGCAGT GGTGACGGTG
CAGGGCTCTT CCATTCATCC CGGAAGCGCC AAGGGCAGGA TGTTGAATGC CTGCCTGGTT
CTCATGGAAT TCCAGGGAAT GCTGCCCGCG TTCCAGAATC CCGCTTTTAC GGAAGGGTAC
GAGGGCTTTT ATCATTTGGA TTCACTCCGG GGAGATGTGG AACGGGCGGC TGCGGAGTAC
CTCATCAGGG ATCATGACAG GGAGGCCTTT GAGCGCAAGA AGGAATTCAT GCAGGAATGT
GCGGCCCTGC TGAACCGTAA ATACGGGGAG GGGACGGTAA AGGCGGAAAT TACGGATTCC
TATTACAACA TGAAGGAAAA GATACTTCCG CACATGCACC TGGTGGAACA TGCCCGCAGG
GCCATGGAAG CCGTGGGAGT GAGGCCGGAA ATCGTCCCTG TGCGCGGAGG CACGGACGGA
GCCCACCTCT CCTTCATGGG GCTGCCGTGC CCCAACCTGT GCGCCGGAGG CCACAACTTC
CATGGAAAGT ATGAATACGT CCCCGTCCGT TCCCTGGAAA AAATTTCCGC CATCCTTCAG
GAAATCGTCT CCGGTTATGC CCGGTACGGC CTGGAGCCGC CGCCAAGGGA ATGA
 
Protein sequence
MNISVLDRFL KYVSVGSQSD ADSATVPSTP GQTELARLLA GELKDMGAQA ELDDDSGIVY 
ASIPSNVEGN VPVIGWVAHV DTAPGVSGSG VKPRIVRSYD GGDILLNPEQ GMVLSPAVFP
ELTDYLGQDL VVTDGTTLLG ADDKAGVAEI MDMAASFLLN PERPHGEIRI AFTPDEEIGR
GTDAFDVARF GADFAYTVDG GALGEIEYEN FNAASAVVTV QGSSIHPGSA KGRMLNACLV
LMEFQGMLPA FQNPAFTEGY EGFYHLDSLR GDVERAAAEY LIRDHDREAF ERKKEFMQEC
AALLNRKYGE GTVKAEITDS YYNMKEKILP HMHLVEHARR AMEAVGVRPE IVPVRGGTDG
AHLSFMGLPC PNLCAGGHNF HGKYEYVPVR SLEKISAILQ EIVSGYARYG LEPPPRE