Gene Amuc_0467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0467 
Symbol 
ID6274782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp555713 
End bp557056 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content51% 
IMG OID642612517 
Productprotein of unknown function DUF344 
Protein accessionYP_001877086 
Protein GI187734974 
COG category[L] Replication, recombination and repair
[R] General function prediction only
[S] Function unknown 
COG ID[COG0494] NTP pyrophosphohydrolases including oxidative damage repair enzymes
[COG2326] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones59 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATACCC CCCCCACAGC AAAAAACTGG AGACTCAACG CAGCAGCCAT CATCATGGAC 
GCTGAAGGCT GCGTATTGCT GGGCAAGGAC AGCGGCCGCA ACCCGTACTG GCACTTTCCG
CAGGGAGGCG TCATTAAACA CGAAAGCATT GAACAGACTC TTGCGCGGGA GGTATGGGAG
GAAGTGGGCC TGCGCCCCAC GGAATACACC ATTGTCAGCC GCCTGTCCGG CCTGCGTTAC
AAATACCCTT CCGGCAACCG TAAAGTTACG CGCTGGATAG GCCAGGAACA AACCTACTTT
CTTGTGCGCT GCAAAACCAG GCGCCCTAAA ACGGATTTGC ACCGCAGCCC CGAATTTTCA
AAAACGAAAT GGATTCCCCT CCAGAATCTC AAACTGGAAA TGTTCCCCAA ATTCAAAAGG
AAAGTCATCA AAAACGCCCT TCAGCAATTC TTCGGGCCCG GCTTTCCTTC CAAACACGCC
GCTGTGAAAA CATCTTCCCC CTCCTCACCC TCATCTTCAA CACTAACTTC GCGTACGATG
AACCGTTACC TGGTGCCTCC GGGCAAAAAA CTGCGTTTAA AGGATTATTC TCCGGATGAC
AAATCTCTCT TCTCCGGAAC CAAGGAAGAA TCCCTGATTG AATTCGACAA ACTGAGGGAA
GAACTGCAGG AACTGCAAAA AAAACTTTTT GCTCAGCACA AGCACAAAAT TCTGGTTATT
CTTCAGGCCA TGGATGCAGG AGGCAAGGAC GGCTGCGTCA AGCATGTCTT CTCCCGGGTG
GATCCGCAGG GGCTGCACGT AGTCCCCTTC AAAAAACCCA CTACTGAGGA ACTGGACCAC
GATTTCCTGT GGCGCGTTCA CAAGGAGGTC CCCGCCAAAG GGCAGATCGC CATCTTCAAC
CGTTCCCATT ACGAAGATAT CATTGCCGTC CGCGTGAAAA AAATCTTCCC GGACCCAGTC
TGGAAACGCC GCTACAAGCA CGTCCTCGAC TTTGAAGCCA TGCTTGCGGA AGAAGGCACC
GTCATCATCA AGCTATTCCT GAATATCTCC AAGGCGGAAC AGAAAAAACG GCTGGAATCC
AGACTTCAGG ACCCGGATAA ACTTTGGAAA TTCTGCATGG ATGACCTGGA TGACCGAAAT
CGTTGGGATG AATTCCAGAC AGCCTACCAG GATCTCATTG AAAAAACATC TACTCCGGAA
GCTCCCTGGT ACATTATCCC GGCAGACCGG AAATGGTACA GAAATCTGGT TGTCGCCCGC
CTGATGGTAG AAAAACTGCG CCATCTCCAG CTTTCGCTCC CCACTCCCAA CTTTGATCCA
GCCTCCATCA TCATTCCAGA TTGA
 
Protein sequence
MDTPPTAKNW RLNAAAIIMD AEGCVLLGKD SGRNPYWHFP QGGVIKHESI EQTLAREVWE 
EVGLRPTEYT IVSRLSGLRY KYPSGNRKVT RWIGQEQTYF LVRCKTRRPK TDLHRSPEFS
KTKWIPLQNL KLEMFPKFKR KVIKNALQQF FGPGFPSKHA AVKTSSPSSP SSSTLTSRTM
NRYLVPPGKK LRLKDYSPDD KSLFSGTKEE SLIEFDKLRE ELQELQKKLF AQHKHKILVI
LQAMDAGGKD GCVKHVFSRV DPQGLHVVPF KKPTTEELDH DFLWRVHKEV PAKGQIAIFN
RSHYEDIIAV RVKKIFPDPV WKRRYKHVLD FEAMLAEEGT VIIKLFLNIS KAEQKKRLES
RLQDPDKLWK FCMDDLDDRN RWDEFQTAYQ DLIEKTSTPE APWYIIPADR KWYRNLVVAR
LMVEKLRHLQ LSLPTPNFDP ASIIIPD