Gene Amuc_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1378 
Symbol 
ID6275784 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1642372 
End bp1643622 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content52% 
IMG OID642613434 
Productintegrase family protein 
Protein accessionYP_001877983 
Protein GI187735871 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.545263 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.00000000645491 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCTCCA TCTATAAAAA GCCGAACAGC CCTTACTGGT ACGCACAATA CCGCGTGAGA 
ACCGCTACAG GCTGGAAACT GGTCCGGCTG TCAACCAAAA TCAAGCATAC CCCCGCCACG
GTAACAAGGG AAGTAAAAGA AGCCGCAGAG TCCATGGGGA AGCAGCTGAA CGTCCTGACC
AGGGAACAGG CTATGACCAA GGCACAACGC CTGGCGGACG CCCTTGAATC AACGGCGCGG
GCAAACCTGC CGGCCTATCA ATTACGCCGG GCCATTTCCG CATTGTCCAC GGAATTGACC
GGAGAATCTA TGGAAATGCC CTCTGTCAAA TTATGGCTTG ATGACCACAT GCGGCGCATT
ACGCGCAATG GGCTTAAACC CGCATCCATA GCGAACTACA AACAAGCCTT TGACAAATTT
CGCGCCTCAA TGGGAGAACG TATCAACCTG CCTCTGGATC GCATTACTCC TCTGATGCTG
GACGATTTCA AAAACCATCT TCTTTCCCGT GTCTCACCAT CTACCGCCAA TATTGCTCTT
ACGCTGGTTT CCGCGGCGTT CCAGGCGGCA GTTGATTATA AAATTATTGA AACCAACCCC
TTTACGGCGA TTACCAAGCC TCACAAGGGG AAAGCCGTCA AACGGCGGAA ATTCGAATTG
GAAGAGCTTG AAAAGGTAAT GGCCGCATGC AATCCGGAAT GGCGCTCCAT GGTGAAAACG
TGCCTCTATA CGGGCGGTCA AAGATTGGGA GACGTGGCAA CGCTCCGGTG GTCCCAGGTT
GACGAGAAAC GAGGCGTTAT CCGGATGACC ACGCAGAAAA AGGGAAAGCC TCTGATGATT
CCGATTTTTC CGGCGCTGAA AAAACACCTG CAGCAACGGA AGAAAGAAGC TCCTGGGGAC
TTCCTGCATC CTGAATGCGC GAATATTTTT GAAAGCAAGG GATCCGGACG CCTGTCAAAT
ATCTTTAGCC ACATCCTGTA CCAGTGTGGC CTTATTGCCA AAGACCCTCT GGCTGCAGGC
AAAAAATACA AAAAGCAGGA AGGAAACGGC ACAGAGACGC GGCGCCACGT CAATGAATTG
TCCTTCCACA GCCTCCGCTA TACGGCAACA ACCATGTTAC ATGACGCCGG TGTTCCCCCT
GCTCTTGTGC AAGCCATTGT GGGGCACGAT TCCCGGGAAG TCCATGAAGG ATACATCGAC
TTTGGAGCCA AGGAGTTTAC ACAAGCCCTT GAAAAGCTTC CCAAATTGTA G
 
Protein sequence
MASIYKKPNS PYWYAQYRVR TATGWKLVRL STKIKHTPAT VTREVKEAAE SMGKQLNVLT 
REQAMTKAQR LADALESTAR ANLPAYQLRR AISALSTELT GESMEMPSVK LWLDDHMRRI
TRNGLKPASI ANYKQAFDKF RASMGERINL PLDRITPLML DDFKNHLLSR VSPSTANIAL
TLVSAAFQAA VDYKIIETNP FTAITKPHKG KAVKRRKFEL EELEKVMAAC NPEWRSMVKT
CLYTGGQRLG DVATLRWSQV DEKRGVIRMT TQKKGKPLMI PIFPALKKHL QQRKKEAPGD
FLHPECANIF ESKGSGRLSN IFSHILYQCG LIAKDPLAAG KKYKKQEGNG TETRRHVNEL
SFHSLRYTAT TMLHDAGVPP ALVQAIVGHD SREVHEGYID FGAKEFTQAL EKLPKL