Gene Amuc_1420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1420 
Symbol 
ID6275745 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1699183 
End bp1701132 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content57% 
IMG OID642613478 
Producthypothetical protein 
Protein accessionYP_001878024 
Protein GI187735912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.828826 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.205294 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTGTAA AATCACTCAT TATGCTCAGC AGTACGGCAG GATGCCTCCT GACCGGCGCC 
GGACAGGCGG AGGAAACGCA CGCATCCCTG CAATCTCCGG ACGGGCGGCT GTCCTGGAGG
CTCCACGCGG GCAGCCAGGC CGCCCATTCT CTGAAAAAAG GCGGCAAAAC GATTCTGGAA
CCATCCGGGC TGGGCGTCGT CGTGAATGGA AAAAACCTCG CCGGAGGCAT CACGGGATGG
AATGTGGAAA AAGTGAAGGA TAACGTTCGC GACACCTTCG AGACGCGGGG CAAATACCCC
ACCTCTTCCG TCTGCTTCAA TGAATACGTG GTTTCCGGGA AGAACTCCTC CCTCCGGCTC
CGGGCCCGTG TCTTCAACAA CGGGGTCGCC TTCCGCTATG AGTGGACGGA AGAGGTGAAG
AAAGTGCCCT CGCTGAATAT CAGTGAAGAA AAAACCTCCT TCGCCTTTCC GGAAAAAACC
GTTCTGTGGA CCCAGGACGC CTCCTCCGCC CTGGGCCCCT GCGAGGGGGT CTGGTCTCCT
TCCAGAATAA CGGATTTCAA AAAAGATCCC GGCAATCCCC GCAGCTGTGT CCGCACCATG
CCGATTACGG CGGAACTGCC GGACGGAGGC GTCGCGCTCA TACAGGAAGC AGCCAACTTC
AACAGGAAAT GGAGCGGCAT TAAATTTTCC CTCCAGGATG GCGCATGCCA TACGGTATAT
TTTCAGGATC CCGGCGGCTT TTCCGCCCCC TCATCCTCGG AAATGCCCTG GCGTGTCATC
CTAGTGAATG ACGACCTGAA CGGACTGGTC CGGAATGACG TCATCCCTTC CCTGGCTCCG
GAACCGGACA GGAAGCTTTT TCCGGAAGGT TCCAAAGCCT CCTGGATCAG GCCGGGCCGC
TCTACCTGGA CCTGGTGGGA CCGCGGGAAT GTCCTGGAAA ACGACCAATA CGCTTTTGCC
GATATGGCTG CCGAATTCGG CTGGGAATAC CACCTCGTTG ACGAAGGGTG GAAAAAATGG
GGGCCATCCC TACCAGAAAG CATGGGCAAG CTGGCCAAAC TGGCCAGCTA CGCCGCTGGC
AAAAATGTAG GCATCTGGGT ATGGGTGCGA TGGTCAGACG TCAACAACCC CGCCAATGAC
TGGGAAAATA TGCGCAGCTT TTTCGGCTCA CTCTCCAAAA CGGGAATCAG GGGAATCAAG
ATAGACTTTA TGGACTCCGC CTCTCAGGAA CGCTTGGCCT TCTACGACGC CGTAGCGGAA
AATCTGGCGA AAAACAAGCT TATGGTCAAC TTCCACGGAG CCAATACCCC CACGGGGGAG
GAACGTTCCT GGCCGCACGA AATGAGCCGG GAAGGCATTT ACGGCGGAGA ACAGAACATC
TGGGCCGCCA TTGGCGGGCA GCACTACTGC GCGCTGCCTT TCACCCGCCT GATATCCGGC
CACGCCGATT TCACGGGAGG CTACTTTGGC CACGGCCCCA AGCTGCGCGG CTCTTCCTGG
ACCCTCCAAA TGGCTGCCAA TATCATTTAC ACGTCCTCCA TGCTTCACTG GGTCTCCAAC
CCTGCGGACA TGGAAGCTGC TTTCCCGAAG GATTCCCCTG AACGGGAAGT TGTCCGGAAC
ATCCCTTCCG TATGGGAGGA AACCATCGTC CTTCCGCCGT CTGCCATTGG GGAATGCGCT
GCCTTCGCCC GGCGGTCAGG AAACCAGTGG TACATTGCTC TGATGAACGG AGACGGCAGG
GAACGCACCG TTTCCATTCC TTTGAATTTC CTGGACAAAA ACACGGCATA CCAGGCCACC
ATTCTCCGGG ATCTGGCGGA AAAAAATGAC GGATGGAGCG TGGAAACCCG GAAAGTCACT
TCCGGAAATG CCCTTTCCTT CACCATGCGC ATCAAAGGCG GAGGTATCGT CCGCATGGTT
CCCTCCGGAA CGCGGCATCC TGCCCCCTAA
 
Protein sequence
MLVKSLIMLS STAGCLLTGA GQAEETHASL QSPDGRLSWR LHAGSQAAHS LKKGGKTILE 
PSGLGVVVNG KNLAGGITGW NVEKVKDNVR DTFETRGKYP TSSVCFNEYV VSGKNSSLRL
RARVFNNGVA FRYEWTEEVK KVPSLNISEE KTSFAFPEKT VLWTQDASSA LGPCEGVWSP
SRITDFKKDP GNPRSCVRTM PITAELPDGG VALIQEAANF NRKWSGIKFS LQDGACHTVY
FQDPGGFSAP SSSEMPWRVI LVNDDLNGLV RNDVIPSLAP EPDRKLFPEG SKASWIRPGR
STWTWWDRGN VLENDQYAFA DMAAEFGWEY HLVDEGWKKW GPSLPESMGK LAKLASYAAG
KNVGIWVWVR WSDVNNPAND WENMRSFFGS LSKTGIRGIK IDFMDSASQE RLAFYDAVAE
NLAKNKLMVN FHGANTPTGE ERSWPHEMSR EGIYGGEQNI WAAIGGQHYC ALPFTRLISG
HADFTGGYFG HGPKLRGSSW TLQMAANIIY TSSMLHWVSN PADMEAAFPK DSPEREVVRN
IPSVWEETIV LPPSAIGECA AFARRSGNQW YIALMNGDGR ERTVSIPLNF LDKNTAYQAT
ILRDLAEKND GWSVETRKVT SGNALSFTMR IKGGGIVRMV PSGTRHPAP