Gene Amuc_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1675 
Symbol 
ID6274481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2029859 
End bp2030950 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content61% 
IMG OID642613734 
Productintegrase family protein 
Protein accessionYP_001878274 
Protein GI187736162 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTTGA CAAGTTTCGT TATTTTGGGC TTGATGAAGA GTATTATTTA TCCTGTTGCC 
ATGAACACCG ACGCTAGCAC CGCCCTTGCC CTCCTCGCCT CTCTCCCCTT CTCCCTCACC
GATGTTGCTC GTCTCATGCT GGAGCTGGTC GAAGGAAGCG GAGGCTCTTC CGTAAGAAAG
AAAGAAGCTC TGCTCCTTCA CTGCCGCCGC GTCATCGCCT TGGGGTGCGA AGCCGAGCAT
CTCGCCACGC AAACGGTCTC CTTCTCCAAA GCCGTGGCAG AAACCCTGCG CGTCAAGGCC
GACCGTTCGG CGCTCACCCT GCGCGATATC CGCTCCTTCA CGCAGAGGAT GATGCGAGAT
GTGCCCGATC TGGCGGCGCG TCCCATGCGC TCGATGACGA CGGCGGACTG TGCGGCTGTG
TTGGAGAAAG TCTTTCTTTC ACCCTCGCAA CGTCGTCATG CGCGGGCGAT TCTCTCGGGA
GTGTTCACGG TGGCGTGGAA GAAAGGCTGG TGTGCGCATA ATCCCGTGCG GCTGGTGGAT
GTGCCGCGCG TGACGGAGCG GGAGATCGTC CCCCTCCGTA TCGAGGAAGT GCGCCGCCTG
CTGCGGACGG CAGAGAGGGA GGAGTTTTCC CCCTGTGCGG CGGGCGTGGT GATGATGCTC
TACGGCGGGA TTCGTCCGTA TGAGGTGCGG CGTTTGACCT GGGGCGATGT GGATTGGGAG
GAGGGGGAGG TGCGGATTCG CCCGAGGCAG AGCAAGACGG GCGGCGGTCG GCAGGTGCCG
CTGTCTCGCT CCGTGTTGGC ATGGTTGAGG AGTTATTATC CGCAGGGGGC AGAGAGGGAG
TCGGTTTTCA TCTGTCCGCC AGATTGGAAC AGGCGGTGGC GCGCTTTGCG CTCGGCAGCG
GGTTTTCAGA CATGGCGTCA GGACGTCCTG CGGCATACCT TTGCGTCGTA TCATGCGAAG
ATGTTTCACG ATTGGGGGCG TTTGCAGGCG GCGATGGGGC ATCGGGACGG GACGCTGTTG
CAGACGCGCT ATGTGCATAC GCAGGGGATT CGAGGATGCG AGGTGCGAGC ATTTTGGGAG
TTGGCGGCGT GA
 
Protein sequence
MFLTSFVILG LMKSIIYPVA MNTDASTALA LLASLPFSLT DVARLMLELV EGSGGSSVRK 
KEALLLHCRR VIALGCEAEH LATQTVSFSK AVAETLRVKA DRSALTLRDI RSFTQRMMRD
VPDLAARPMR SMTTADCAAV LEKVFLSPSQ RRHARAILSG VFTVAWKKGW CAHNPVRLVD
VPRVTEREIV PLRIEEVRRL LRTAEREEFS PCAAGVVMML YGGIRPYEVR RLTWGDVDWE
EGEVRIRPRQ SKTGGGRQVP LSRSVLAWLR SYYPQGAERE SVFICPPDWN RRWRALRSAA
GFQTWRQDVL RHTFASYHAK MFHDWGRLQA AMGHRDGTLL QTRYVHTQGI RGCEVRAFWE
LAA