Gene Amuc_2078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_2078 
Symbol 
ID6274006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2527629 
End bp2530031 
Gene Length2403 bp 
Protein Length800 aa 
Translation table11 
GC content52% 
IMG OID642614140 
Productcapsular exopolysaccharide family 
Protein accessionYP_001878668 
Protein GI187736556 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG0489] ATPases involved in chromosome partitioning 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value0.325756 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAATA ATACCGCTCC CACTCCGCCT GTTACGGAAA ACACAGAAGA AACAGCTCTT 
TCCCTGGACA CGGTGCTGAC GATCCTGCGC CGTTACTGGT TCATCATCAT TCTGGCAGCT
CTCGCCGGAG GCACGGCGGC CTATTACCTG GCCGGCAAAC AGAATTATAT TTACCAAAAA
ACGGCCAGCG TCCTGATGCG CGATGCCAAA ACAGGCAGCG ACGCCTCTTC CGAACGCATC
ATGGCGGAAT TGAACATAGA CCCCAACGCC GCTAATCTGG CCAATGAAAG CCTCGTTCTC
AAATCCACCG CATTAATGAA CAAAGTGGTG GAAGACCTCA GCCTCAACAC ATCCTATTGG
CAAAAAAAAG ACTTCAGGGA GCTTGATCTT TACCATGCCA CCCCCTTATT GGTGCACTTT
GAACAGATCG ACAAACAACG AGCCTGCACC CTGAACATCA CGCCGCTGGA TGAAAAACGC
TTCATGCTTG GCCATCCCAA TGATCAGGGG GAACTCATCC TGCTGGAAGG TTTTTACGGA
AAACCGCTTA CGCTTCCCTT TGCCACCATT TCCGTCCATC CCACCTCCCT GATGACCGAC
GCATGGAACG GAAAAACCGT CATCGTAAGA CACTCTCCCG TTCTTGAAAC CGCCAACGCC
CTGCTCCGTG GCCTGACAAT TACCCGTCCA GACTCCAAGG AATCCAGCCT TCTGGAGATG
ACTCTGACAT CCAGCAATCC CCAGAAAGCC GAAGACACGC TCAACCACCT TATCCAGGTT
TACAACCAAA TTTCCAAGGA CGAACGGAAC AAGGCGTCCC TTAAAACGAA AATCTTCATT
AGGGATCGAC TAAAAGAACT TGGAGCCTCC CTGAGCGACG TGGACAAAAA ACTTACCGAA
TTTAAAACGA AGAGTGACAT CGTCAAAGAT GCGGACACAA CCATGAGCGC GGACTTCAGC
ACCTCCCAGG CGCTGGAAAA GGAAATCTTT GATCTTGAAA CCCAAATCAA ACTGGCGTCC
ACCCTTGCTG ACAATCTCAA GGAAAGCGAA CGCAAACATG GGCTGATCTC CGTAGAAACC
GGTCTTCCCG ATTCCGGCAT CGCCCGGCAG ATAGAACATT ACAATGAGGC TTATCTGGAA
TATCAGAAAA TCGCCGGAAG CGCCGGCTCC CAAAACCCGA TTGCCGTGAG CTTGAGGGAC
AGGATGAATT CCACCAGAGC GGCGGCTAAC AAAGCTCTCT CCAACTACCG CAGCAATCTG
GATCTCAAAC TTAACCAGCT TATTAACAAA AGGAATTCCC TGACTGAACG CCTGACGGAA
ACTGCCATCA AGGAACAGGA AATCATTCCG CTTATCCGTG AACACAAGGT TAAGGAAGAA
CTGTACCTGA TGCTGTTGAG CAAGGAACAG GAAAACGCCC TGGCCATGGC GGTAACGGAA
TCCAATGCCC GGGTACTGGA AACCGCCCAT GGCTCCAACC TCCCTATCTC TCCTAAAACC
ATTAAATACG TCGCCGGAGG AACGGCAGGC GGAGCCCTGC TCAGTATCCT GGCCTTCATG
GGAGCGGCCA TGTTGAACAA TAAGGTCAAC AACAAGCATG ACCTCCCCGC TGCAAACAGG
CAGCCGGTCA TTGCCGAACT GCCTCAAATG AGCAAAAAAG AAAGCAAAAA CACCAAGCTT
TTCATTCAGG ACGAACATTC CGTCATCGCG GAATGCTTCC ACATTCTGCG CAATAACGTA
GATTCCATGC TCCCCAGGCC GGAACAGGGA GGACACGTCA TTCTGGTCAC CTCCACCCTC
CCCGGAGAAG GGAAAACCTT CACCTCCGCC AATCTGGCCG CCGCTTTCGC CTATGCCGGC
AAAAAAGTAC TGCTTATTGA CGGGGATTTC CGCAAATCCT CCCTGACCCG GCGTCTCGGC
GGTTCCGGAC GCAAAGGACT CACTTCCATC CTGCTCCAAC AGACCACCGA CACCACCGGC
ATCATTCGCC CCCTGGGAGA AAACTCCCGC GGCATGGATA TCCTTTACAC CGGCCCCATG
GTGCCCAATC CGGTCACCCT GCTCAGCCAT CCCCTGTTGG GCCATATCCT CGGCATCCTG
AAAAAACAGT ATGATGCCGT CATCATCGAC GCTCCGCCCT ACGGCATTCT GGCAGACACC
GCCATTCTGG CATCCCTGAG CGATATTACC CTGTACGCCG TGCGCAGCGG AAAAATCGAC
AAACGGTATC TGCTCCAAAT CCAGCAACTG GCCGATCAGG GAAAACTGCC CAATATGGCG
TACATCATCA ACGGCGTCAA CTTCAAGTCC GCCAGCTACA GCTACTATGG CTATGGCTAC
GGCTACCAGT ATGGCTACGG GACCAAAGAA CCGCAGCAAA CCAGCAGGAA ACAAGATAAA
TAA
 
Protein sequence
MTNNTAPTPP VTENTEETAL SLDTVLTILR RYWFIIILAA LAGGTAAYYL AGKQNYIYQK 
TASVLMRDAK TGSDASSERI MAELNIDPNA ANLANESLVL KSTALMNKVV EDLSLNTSYW
QKKDFRELDL YHATPLLVHF EQIDKQRACT LNITPLDEKR FMLGHPNDQG ELILLEGFYG
KPLTLPFATI SVHPTSLMTD AWNGKTVIVR HSPVLETANA LLRGLTITRP DSKESSLLEM
TLTSSNPQKA EDTLNHLIQV YNQISKDERN KASLKTKIFI RDRLKELGAS LSDVDKKLTE
FKTKSDIVKD ADTTMSADFS TSQALEKEIF DLETQIKLAS TLADNLKESE RKHGLISVET
GLPDSGIARQ IEHYNEAYLE YQKIAGSAGS QNPIAVSLRD RMNSTRAAAN KALSNYRSNL
DLKLNQLINK RNSLTERLTE TAIKEQEIIP LIREHKVKEE LYLMLLSKEQ ENALAMAVTE
SNARVLETAH GSNLPISPKT IKYVAGGTAG GALLSILAFM GAAMLNNKVN NKHDLPAANR
QPVIAELPQM SKKESKNTKL FIQDEHSVIA ECFHILRNNV DSMLPRPEQG GHVILVTSTL
PGEGKTFTSA NLAAAFAYAG KKVLLIDGDF RKSSLTRRLG GSGRKGLTSI LLQQTTDTTG
IIRPLGENSR GMDILYTGPM VPNPVTLLSH PLLGHILGIL KKQYDAVIID APPYGILADT
AILASLSDIT LYAVRSGKID KRYLLQIQQL ADQGKLPNMA YIINGVNFKS ASYSYYGYGY
GYQYGYGTKE PQQTSRKQDK