Gene Amuc_2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2020 
Symbol 
ID6274671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2453657 
End bp2454676 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content59% 
IMG OID642614080 
ProductLAO/AO transport system ATPase 
Protein accessionYP_001878611 
Protein GI187736499 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1703] Putative periplasmic protein kinase ArgK and related GTPases of G3E family 
TIGRFAM ID[TIGR00750] LAO/AO transport system ATPase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.00526211 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAACA TCAAATTTGC ACGTCCGCAC CGCCCTTCCG TAGAAGAACT GGCCCAGGGA 
GTACTGGCCG GAAACCGCGC CCTGCTGGGA AGGGCCATTA CACTGATAGA AAGCAATGCC
GTCCGGGACC AGGAATCTTC CCGCGCCCTC ATCTCCAGGC TCCTTCCCCA TTCGGGCAAC
GCCGTCCGCA TCGGCATTAC GGGCGTTCCG GGCGCCGGGA AATCCTCTTT CATTGAAGCC
TTCGGCACTT ACCTGTGCAA AAAAGGGTTC AAGGTGGCTG TGCTGGCTAT TGACCCGTCT
TCTTCAGTCT CCCGCGGTTC CATTATGGGA GACAAAACAC GCATGGAGGA ACTCTCCGGA
GAGGAAAACG CCTTCATCCG CCCTTCCCCC TCCGGCGGCT CTTTGGGCGG CGTAGCCCGG
AAAACGCGTG AAACCATGAT TGCATGCGAA GCTGCGGGCT TTGACATTAT TCTCATTGAA
ACCGTGGGAG TCGGCCAGTC GGAAACTACG GTGCGCTCCA TGGTGGACAT TTTCATGCTC
CTGCTCATCA CCGGAGCCGG GGACGATCTC CAGGGCATCA AGCGGGGCAT CATGGAACTG
GCGGATATCC TAGTAGTTAC CAAAGATGAC GGCGACAACC GCCAGCGCGC CGCAGCCCAC
TGCCAGGAAC TGAAAATGGT ACTCCACTAC CTGCAAAGCC CCACTCCCGG CTGGACGCCC
TCCGTCCTCA CCTGTTCCTC CCTGGAGGGA CGCGGCCTGG ACACCATTGA AGAGACGCTC
TTCCGCTTCC GGGACAGCAT GAAGGAATCC GGATTCTGGT ACAGCCGCCG CCGGAGCCAG
TCCCTTTCAT GGGTCCAGTC CCTGGTGCAT GAAGCCCTGC TCACCGCTTT TGAACAGCAC
CCCGCCGTAG CGTCCCGCAT GCCCATTCTG GAAAACATGG TGGCGGGGGA CAAAATGGAC
CCCGTTTCCG CCGCACATGA CCTGCTGAGC CACTTTACTT ATCCCGCGCC CGGACATTAA
 
Protein sequence
MSNIKFARPH RPSVEELAQG VLAGNRALLG RAITLIESNA VRDQESSRAL ISRLLPHSGN 
AVRIGITGVP GAGKSSFIEA FGTYLCKKGF KVAVLAIDPS SSVSRGSIMG DKTRMEELSG
EENAFIRPSP SGGSLGGVAR KTRETMIACE AAGFDIILIE TVGVGQSETT VRSMVDIFML
LLITGAGDDL QGIKRGIMEL ADILVVTKDD GDNRQRAAAH CQELKMVLHY LQSPTPGWTP
SVLTCSSLEG RGLDTIEETL FRFRDSMKES GFWYSRRRSQ SLSWVQSLVH EALLTAFEQH
PAVASRMPIL ENMVAGDKMD PVSAAHDLLS HFTYPAPGH