Gene Amuc_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1934 
Symbol 
ID6275248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2347223 
End bp2348329 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content56% 
IMG OID642613994 
Producthypothetical protein 
Protein accessionYP_001878528 
Protein GI187736416 
COG category[R] General function prediction only 
COG ID[COG3489] Predicted periplasmic lipoprotein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0548586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCAT ACAAACGCAT GTTGCTGGCC GGCTTTTGCG CCGGAAGTAC AATCCTGGCT 
TTCGGTAGCC AGGCTCAGGC GGCCAAGGCC GCTTCCCCCA AGGTGGACAA AACGAGCGCG
GCGATTGCCG CCTACGCGGA TGAGCTTGTC ATTCCCACCT ACAAGACGAT GAGTGATAAC
GCTCTCAAAT TCGCCAAGGC CGCTAAAGAG CTGAAGGCTG CTCCTACCGA TGCCAAGGTT
GCTGAAGCAG GGAAGCTTCT TCTTGAAACG CGCGTGCCGT GGGAACTTTC AGAATCCTTC
CTCTTTGGTC CGGCTGCTTT CGCCAACCTT GACCCGAAGC TGGACTCCTG GCCCCTGGAT
ACCACGAACC TTGACGCCGT CGCCAAAAAC GCGGACAGCA AGAGCGTAAC GATTGACGCC
GCCTACGTCC GCAATTCCCT CGGCGCGGAA ACGCGCGGTT TCCATGCTGC CGAATACCTC
TTGTTCCGAG ATGGGCAACC CCGCAAGGCC GCCGATCTGA CGCCCGGACA GCTTTCCTAC
CTTGCCGCTG TGGCCGAGGT GATTGCAGAG GATGCCATCA CGCTTGAAGC CTGGTGGGCG
GGTTCTGACA AGATCAGCGA AGAGAAGGCC AAGATTCTTG AAGAAGCTGA AATAGAACCC
GGCAAGTCCT ATGCGGGGGA ATTCAAAAAA GCTGGCCAGG CAGGCAGCCG CTACGAGTCA
AACTCTGAAG TGCTTGATGA AATCATCGGC GGCAGCAAGG ACATTATTGA CGAAATAGCC
GATTCAAAGG TGGGCAAGCC CTACGAAACG GCTGACGCCG CCGACTGTGA ATCCCTTTAC
AGCTACACTT CTCTGGTGGA CTCCCGCCAC AACGTACAGA GCGTAGAAAA ATCCTACAAT
GCGATTTCCC CTCTCGTAGC CGCCAAGTCC GCGAAGGTTG GCCAGGCTGT GAAAGGTTCT
ATCGCCAAGG TATTCAAGAG CCTCGACGCC ATACAGGGCC CCCTGGTGAA AAACCTCGAC
AAAAAGGAGC AGCTCAAGGC AATCATCGAC AGTTGCAAGG AACTTTCCGA AAACCTCGAC
AAGGTTCAGG AACTTCTTGT GAAGTAA
 
Protein sequence
MNAYKRMLLA GFCAGSTILA FGSQAQAAKA ASPKVDKTSA AIAAYADELV IPTYKTMSDN 
ALKFAKAAKE LKAAPTDAKV AEAGKLLLET RVPWELSESF LFGPAAFANL DPKLDSWPLD
TTNLDAVAKN ADSKSVTIDA AYVRNSLGAE TRGFHAAEYL LFRDGQPRKA ADLTPGQLSY
LAAVAEVIAE DAITLEAWWA GSDKISEEKA KILEEAEIEP GKSYAGEFKK AGQAGSRYES
NSEVLDEIIG GSKDIIDEIA DSKVGKPYET ADAADCESLY SYTSLVDSRH NVQSVEKSYN
AISPLVAAKS AKVGQAVKGS IAKVFKSLDA IQGPLVKNLD KKEQLKAIID SCKELSENLD
KVQELLVK