Gene Amuc_1119 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1119 
Symbol 
ID6273950 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1338657 
End bp1339814 
Gene Length1158 bp 
Protein Length385 aa 
Translation table11 
GC content53% 
IMG OID642613170 
Producthypothetical protein 
Protein accessionYP_001877726 
Protein GI187735614 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones78 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAAAC GACTTCTCTC CGCATTTTTT TCTCTGTTCT TTCTGGGAGC GGCCTCCGGG 
ACATCCTTTG CGGAAGTCAC CGTTCCGGAC GCCCTGAAAG ACCGGATTGC TCTGAAAAAA
ACGGCCCGTC AGCTCAATAT CGTTTATTTT CTGGGCAGTG ATACGGAACC CGTTCCGGAT
TATGAACGGC GCCTCAGCGA ACTGCTCCTT TACCTCCAGC AGTTTTACGG CAAGGAAATG
CAGCGGCATG GCTATGGCGC GCGTTCCTTC GGCCTGGACA TCAAATCCCC AGGCCGCGTG
AACATCATTG AATACAAGGC CAAAAATCCG GCGGCCCATT ATCCTTATGA AAACGGAGGC
GGCTGGAAAG CGGCCCAGGA ACTTGACGAA TTTTTCAAAG CCCATCCGGA CAGGAAAAAA
AGCCAGCACA CGCTCATCAT CATGCCCACC TGGAATGACG AAAAGAACGG CCCCGACAAT
CCCGGCGGAG TTCCCTTTTA CGGCATGGGG CGCAACTGTT TCGCCCTGGA TTATCCGGCC
TTCGATATCA AACACCTGGG GCAGAAAACA AGGGAAGGAA GGCTGCTGAC CAAATGGTAC
GGAGGCATGG CCCACGAATT GGGGCACGGC CTTAATCTGC CGCACAACCA CCAAACCGCC
TCGGACGGTA AAAAATACGG CACGGCCCTG ATGGGTTCGG GCAATTACAC GTTCGGGACC
AGTCCCACGT TCCTGACCCC GGCCAGCTGC GCCCTGCTGG ATGCCTGTGA AGTGTTTTCC
GTCACCCCGT CCCAGCAATT CTACGAAGGC AAGCCGGAAG TGGAGGTCGG GGACGTAGCC
ATTTCTTTTA AAGGAGACCA GATTCTGGTT TCCGGCAATT ATAAAAGCCC CCAGACCGTC
AAAGCTCTGA ATGTTTACAT CCAGGATCCT CCTTATGCGG TCAACCAGGA CTATGACGCC
GTTTCCTTCT CCCGGCGCCT TGGAAAAAAG AGCGGGAAAT TCTCCATGAA AATTGACAAA
AAAGAGCTGG AAGGATTGAA CAATAACGAA TTCCGCATTT CCCTCATGTT CATTCTCGCC
AACGGGCTGC ACATGCAGAA GCATTTCACG TTCCATTGGG ACGCTCTCCA GGATTACAGG
GACGGAAGCA AATCCTGA
 
Protein sequence
MLKRLLSAFF SLFFLGAASG TSFAEVTVPD ALKDRIALKK TARQLNIVYF LGSDTEPVPD 
YERRLSELLL YLQQFYGKEM QRHGYGARSF GLDIKSPGRV NIIEYKAKNP AAHYPYENGG
GWKAAQELDE FFKAHPDRKK SQHTLIIMPT WNDEKNGPDN PGGVPFYGMG RNCFALDYPA
FDIKHLGQKT REGRLLTKWY GGMAHELGHG LNLPHNHQTA SDGKKYGTAL MGSGNYTFGT
SPTFLTPASC ALLDACEVFS VTPSQQFYEG KPEVEVGDVA ISFKGDQILV SGNYKSPQTV
KALNVYIQDP PYAVNQDYDA VSFSRRLGKK SGKFSMKIDK KELEGLNNNE FRISLMFILA
NGLHMQKHFT FHWDALQDYR DGSKS