Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Amuc_2020 |
Symbol | |
ID | 6274671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Akkermansia muciniphila ATCC BAA-835 |
Kingdom | Bacteria |
Replicon accession | NC_010655 |
Strand | + |
Start bp | 2453657 |
End bp | 2454676 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 642614080 |
Product | LAO/AO transport system ATPase |
Protein accession | YP_001878611 |
Protein GI | 187736499 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family |
TIGRFAM ID | [TIGR00750] LAO/AO transport system ATPase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.00526211 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGCAACA TCAAATTTGC ACGTCCGCAC CGCCCTTCCG TAGAAGAACT GGCCCAGGGA GTACTGGCCG GAAACCGCGC CCTGCTGGGA AGGGCCATTA CACTGATAGA AAGCAATGCC GTCCGGGACC AGGAATCTTC CCGCGCCCTC ATCTCCAGGC TCCTTCCCCA TTCGGGCAAC GCCGTCCGCA TCGGCATTAC GGGCGTTCCG GGCGCCGGGA AATCCTCTTT CATTGAAGCC TTCGGCACTT ACCTGTGCAA AAAAGGGTTC AAGGTGGCTG TGCTGGCTAT TGACCCGTCT TCTTCAGTCT CCCGCGGTTC CATTATGGGA GACAAAACAC GCATGGAGGA ACTCTCCGGA GAGGAAAACG CCTTCATCCG CCCTTCCCCC TCCGGCGGCT CTTTGGGCGG CGTAGCCCGG AAAACGCGTG AAACCATGAT TGCATGCGAA GCTGCGGGCT TTGACATTAT TCTCATTGAA ACCGTGGGAG TCGGCCAGTC GGAAACTACG GTGCGCTCCA TGGTGGACAT TTTCATGCTC CTGCTCATCA CCGGAGCCGG GGACGATCTC CAGGGCATCA AGCGGGGCAT CATGGAACTG GCGGATATCC TAGTAGTTAC CAAAGATGAC GGCGACAACC GCCAGCGCGC CGCAGCCCAC TGCCAGGAAC TGAAAATGGT ACTCCACTAC CTGCAAAGCC CCACTCCCGG CTGGACGCCC TCCGTCCTCA CCTGTTCCTC CCTGGAGGGA CGCGGCCTGG ACACCATTGA AGAGACGCTC TTCCGCTTCC GGGACAGCAT GAAGGAATCC GGATTCTGGT ACAGCCGCCG CCGGAGCCAG TCCCTTTCAT GGGTCCAGTC CCTGGTGCAT GAAGCCCTGC TCACCGCTTT TGAACAGCAC CCCGCCGTAG CGTCCCGCAT GCCCATTCTG GAAAACATGG TGGCGGGGGA CAAAATGGAC CCCGTTTCCG CCGCACATGA CCTGCTGAGC CACTTTACTT ATCCCGCGCC CGGACATTAA
|
Protein sequence | MSNIKFARPH RPSVEELAQG VLAGNRALLG RAITLIESNA VRDQESSRAL ISRLLPHSGN AVRIGITGVP GAGKSSFIEA FGTYLCKKGF KVAVLAIDPS SSVSRGSIMG DKTRMEELSG EENAFIRPSP SGGSLGGVAR KTRETMIACE AAGFDIILIE TVGVGQSETT VRSMVDIFML LLITGAGDDL QGIKRGIMEL ADILVVTKDD GDNRQRAAAH CQELKMVLHY LQSPTPGWTP SVLTCSSLEG RGLDTIEETL FRFRDSMKES GFWYSRRRSQ SLSWVQSLVH EALLTAFEQH PAVASRMPIL ENMVAGDKMD PVSAAHDLLS HFTYPAPGH
|
| |