Gene Amuc_0443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0443 
Symbol 
ID6275629 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp526077 
End bp527204 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content61% 
IMG OID642612493 
ProductDNA protecting protein DprA 
Protein accessionYP_001877062 
Protein GI187734950 
COG category[L] Replication, recombination and repair
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0758] Predicted Rossmann fold nucleotide-binding protein involved in DNA uptake 
TIGRFAM ID[TIGR00732] DNA protecting protein DprA 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0740883 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones79 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCCC GGGAAGCAGC CATAGCCCTG AACCTGATAC CGGGCCTGGG CCCTGTCAGA 
ATCATGCGTC TCCTGCAGGT ATTCGCCTCT CCGGAACTGA TTCTGGAATC CCCTTCCTCC
CTGTTGATGG AAATTCCCGG CGTGGGGGCG CAGCTGGCTT GCCGCATTTC CTCTTGGCGG
AGCACCGTCA ACCCGTACAG GGAACTGGAA CTGGCGGATA ACGCGGGGGC CGCGGTAACC
ACGGTTTTTG ACGACTCCTA CCCATCCTCC CTGCGTGCTC TCCCAGACCC GCCCATTGTC
CTCTACTCCT GGGGAAACTG GACCGGAACG GATGCCGAAC GCTCCATTGC CGTCGTCGGC
TCACGGATGG CCACCCATTA CGGCAGGCTT TGCGCCAGAA ACATCTCCCA TGACCTGGCG
GAAGCCGGAA TAACTGTCAT TTCCGGACTG GCGCGCGGCG TGGACACGGA AGCCCATACC
GGGGCCATGG ACGCGGAGGG GCGCACCATC GCCGTTATCG GGGCAGGTCT CAACAAACTG
TATCCTCGGG AAAACAGAAA CTTGGCGCAG CGCATTGCGG ACGGGCATGG AGCGGTAGTC
TCCGAGTTCC CGATGGACCT GCCCCCCTCC CGCACCACTT TTCCCATGCG CAACCGCATC
GTGAGCGGCT GGAGCCGCGC TACGCTGGTG GTGGAGGCGT CCGGACGCAG CGGGGCCCTG
ATCACGGCCC GAACGGCGGC CGAACAGGGT CGAGACGTTT TTTGCATTCC CGGACCGGTT
GACCGGCATT CTTCCGACGG ATGCCATGCC CTCATCCGGG ACGGAGCCAT CCTAGCTACC
GGAGCCTCCG ATATTCTGCA GGACATGAAC TGGGCCGTTC CGGAACAGGG ACTGCCTCTC
TTCTCGCCAT GCTCCCCTGC CGGAGCCTCA ACCCCACCGC TTCCCACTTT GGAAGAAAAG
GAGATTCTCC ACGCCATCAG ACTGGGTTTC AATACTATTG ACACCCTCTG CACCTCTCTG
GGAAAAGCGG CGCATACCAT CACCCCGCTT TTGGCCAAAA TGCAAATTGC AGGGCAAATT
ACTCCGGACG CCGGAGGGTA TTTCTCCATT AACGGCAGGG AACTTTAG
 
Protein sequence
MTPREAAIAL NLIPGLGPVR IMRLLQVFAS PELILESPSS LLMEIPGVGA QLACRISSWR 
STVNPYRELE LADNAGAAVT TVFDDSYPSS LRALPDPPIV LYSWGNWTGT DAERSIAVVG
SRMATHYGRL CARNISHDLA EAGITVISGL ARGVDTEAHT GAMDAEGRTI AVIGAGLNKL
YPRENRNLAQ RIADGHGAVV SEFPMDLPPS RTTFPMRNRI VSGWSRATLV VEASGRSGAL
ITARTAAEQG RDVFCIPGPV DRHSSDGCHA LIRDGAILAT GASDILQDMN WAVPEQGLPL
FSPCSPAGAS TPPLPTLEEK EILHAIRLGF NTIDTLCTSL GKAAHTITPL LAKMQIAGQI
TPDAGGYFSI NGREL