Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_1934 |
Symbol | |
ID | 6275248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | - |
Start bp | 2347223 |
End bp | 2348329 |
Gene Length | 1107 bp |
Protein Length | 368 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 642613994 |
Product | hypothetical protein |
Protein accession | YP_001878528 |
Protein GI | 187736416 |
COG category | [R] General function prediction only |
COG ID | [COG3489] Predicted periplasmic lipoprotein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 45 |
Fosmid unclonability p-value | 0.0548586 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACGCAT ACAAACGCAT GTTGCTGGCC GGCTTTTGCG CCGGAAGTAC AATCCTGGCT TTCGGTAGCC AGGCTCAGGC GGCCAAGGCC GCTTCCCCCA AGGTGGACAA AACGAGCGCG GCGATTGCCG CCTACGCGGA TGAGCTTGTC ATTCCCACCT ACAAGACGAT GAGTGATAAC GCTCTCAAAT TCGCCAAGGC CGCTAAAGAG CTGAAGGCTG CTCCTACCGA TGCCAAGGTT GCTGAAGCAG GGAAGCTTCT TCTTGAAACG CGCGTGCCGT GGGAACTTTC AGAATCCTTC CTCTTTGGTC CGGCTGCTTT CGCCAACCTT GACCCGAAGC TGGACTCCTG GCCCCTGGAT ACCACGAACC TTGACGCCGT CGCCAAAAAC GCGGACAGCA AGAGCGTAAC GATTGACGCC GCCTACGTCC GCAATTCCCT CGGCGCGGAA ACGCGCGGTT TCCATGCTGC CGAATACCTC TTGTTCCGAG ATGGGCAACC CCGCAAGGCC GCCGATCTGA CGCCCGGACA GCTTTCCTAC CTTGCCGCTG TGGCCGAGGT GATTGCAGAG GATGCCATCA CGCTTGAAGC CTGGTGGGCG GGTTCTGACA AGATCAGCGA AGAGAAGGCC AAGATTCTTG AAGAAGCTGA AATAGAACCC GGCAAGTCCT ATGCGGGGGA ATTCAAAAAA GCTGGCCAGG CAGGCAGCCG CTACGAGTCA AACTCTGAAG TGCTTGATGA AATCATCGGC GGCAGCAAGG ACATTATTGA CGAAATAGCC GATTCAAAGG TGGGCAAGCC CTACGAAACG GCTGACGCCG CCGACTGTGA ATCCCTTTAC AGCTACACTT CTCTGGTGGA CTCCCGCCAC AACGTACAGA GCGTAGAAAA ATCCTACAAT GCGATTTCCC CTCTCGTAGC CGCCAAGTCC GCGAAGGTTG GCCAGGCTGT GAAAGGTTCT ATCGCCAAGG TATTCAAGAG CCTCGACGCC ATACAGGGCC CCCTGGTGAA AAACCTCGAC AAAAAGGAGC AGCTCAAGGC AATCATCGAC AGTTGCAAGG AACTTTCCGA AAACCTCGAC AAGGTTCAGG AACTTCTTGT GAAGTAA
|
Protein sequence | MNAYKRMLLA GFCAGSTILA FGSQAQAAKA ASPKVDKTSA AIAAYADELV IPTYKTMSDN ALKFAKAAKE LKAAPTDAKV AEAGKLLLET RVPWELSESF LFGPAAFANL DPKLDSWPLD TTNLDAVAKN ADSKSVTIDA AYVRNSLGAE TRGFHAAEYL LFRDGQPRKA ADLTPGQLSY LAAVAEVIAE DAITLEAWWA GSDKISEEKA KILEEAEIEP GKSYAGEFKK AGQAGSRYES NSEVLDEIIG GSKDIIDEIA DSKVGKPYET ADAADCESLY SYTSLVDSRH NVQSVEKSYN AISPLVAAKS AKVGQAVKGS IAKVFKSLDA IQGPLVKNLD KKEQLKAIID SCKELSENLD KVQELLVK
|
| |