Gene Amuc_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1423 
Symbol 
ID6275672 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1707539 
End bp1708750 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content56% 
IMG OID642613480 
Producthypothetical protein 
Protein accessionYP_001878026 
Protein GI187735914 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.994888 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.140536 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATGC CTCCCGTCTT ACCCCTGATG AAAAAGCCCG CACGCCTCTG CCTGGCGGCG 
GGAACTCTGT TCACCCTTTT CCTTACCCCC CTTGCACAGG GGCGCACCTG GACCAACCTC
CAAGGGAAAA AACTGGAAGC AGAATTCATC AGGCTGGACG GCCAGAAAGC CGTGCTGAAA
CGTTCCGGCG GCCAAACCGT CTCCATTCCC CTCACCCAGC TCTCCCGGGA AGACAGGGAT
TTCATCGCGG AACAGGAAAA AGGAGGGGCA CTCCCCTCCA ATACGGCGGA CAATTACCAC
CTGCCGTGGC CCAGGAGCGT CAAATGCCCG GACAATTTCA AGGTGGAAAC CATCAAGGAG
GAACCGGGAG AATATATTTA TGAAACACCC CATTTCCGCT TCATCTGCGA CGCCAAGCTG
GGCACCGGCA TGATCAAGCG CCTGGGCCTC CTCTTTGAGG CCACCCACTT GGCCAACAAA
ACCCTTCCTA TAGGAAACTC CCCTGCCCAT GACGATTCCG CCAAATTCCC CGCCTACCTG
TATGAAAAAT TCAGCACCTA TCTGGAAAAC GGCGGACGCG AAGGCACGGC GGGCATCTTC
CTGGGGACAA CGCGGCCAGG GGACCGCGGA AGAATTCTGG TTCCGTTCGA TTCCCTGGGA
GTCAAAACCA TGGGAAGCAC ATACGTCATT GACCGTGACA AGGACGCTTC CACCCTCATC
CATGAACTGA CGCACCAGCT CATGTCTTCG CAGGCCAAGC AGGCCAGCTG GTTTTGTGAA
GGCTCCGCGG AATACATGGG CATGACGCCC TATGCCGGAG GCCGCTTCAA CTTTGGAGCC
AACCGATCCC ACATTGTCTC CCGCGTGACG GAATACGGCA AAAAAAATAC GGGGGGACGG
GCCCTTGGGG ATGACTTTGA GGCGCCCGGC CTGGAAGCTT ACATGAACAT GCCCTATTCC
CAGTTCACGG GAGAAAACGC CAACCTGAAC TACGGCCTGG CCGCCCTGAT GGCCTACTAT
TTTTACCACA TGGACGGCAA GGGCGATGCC CGGCGCATCA AGAATTACAT GAAAGCCATT
CAATCCGGAA CCAGTGAAAA GGAAGCTCAG AAACTCCTCC TTGACGGACG GAGCTATGAA
GAACTGGCCA AAGAAATTGA ACAGAAATGG CGCAAGGCCG GCGTTAAAAT CCGCTTCCGT
TCCTCTTCCT GA
 
Protein sequence
MFMPPVLPLM KKPARLCLAA GTLFTLFLTP LAQGRTWTNL QGKKLEAEFI RLDGQKAVLK 
RSGGQTVSIP LTQLSREDRD FIAEQEKGGA LPSNTADNYH LPWPRSVKCP DNFKVETIKE
EPGEYIYETP HFRFICDAKL GTGMIKRLGL LFEATHLANK TLPIGNSPAH DDSAKFPAYL
YEKFSTYLEN GGREGTAGIF LGTTRPGDRG RILVPFDSLG VKTMGSTYVI DRDKDASTLI
HELTHQLMSS QAKQASWFCE GSAEYMGMTP YAGGRFNFGA NRSHIVSRVT EYGKKNTGGR
ALGDDFEAPG LEAYMNMPYS QFTGENANLN YGLAALMAYY FYHMDGKGDA RRIKNYMKAI
QSGTSEKEAQ KLLLDGRSYE ELAKEIEQKW RKAGVKIRFR SSS