Gene Amuc_1799 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1799 
Symbol 
ID6274674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2186319 
End bp2187326 
Gene Length1008 bp 
Protein Length335 aa 
Translation table11 
GC content56% 
IMG OID642613863 
Productuncharacterized Fe-S center protein, putative ferredoxin 
Protein accessionYP_001878398 
Protein GI187736286 
COG category[R] General function prediction only 
COG ID[COG2768] Uncharacterized Fe-S center protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.160059 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.00269064 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAATA AGATGACGAG AAGAACATGG CTCCAGTCTT CCACCGCAGC CCTGGCATGC 
CTGGCTTTGC CGGAGTATGC CCTGGGCGCG GTTGCGGAGA AGGGTGGAGC TTCCAGGGTC
TGGATGACGA AGGAAATTTC TCCGGAGGCT CTCGTGAGGA TTTACGAGGC TCTGGGGCGT
CCGGCCGGGG GAAAGGTAGC TGTCAAGATC AGCACCGGGG AACCGGGAGG CCGCAATTTT
CTGAGTCCGG CCTTAATCAA AGACCTGGTG CGCCGGGTGA ACGGCACCAT TGTGGAATGC
AATACGGCTT ACGGAGGCAA ACGCTCCCGG ACGGAGGACC ATTTGCAGGC TGCGAAGGAT
CACGGTTTTT CCGATATTGC GCGGGTGGAC ATCATGGATG CGGAAGGGGA GTTCACTATC
CCGGTGAGGG ACAGGAAGCA CCTGGAATAC GATATCGTGG GGGATCATTT AAAGAATTAT
GATTTCATGG TCAATCTGGC CCATTTCAAA GGGCATGCCA TGGGCGGCTT CGGCGGTGTG
ATCAAGAACC AGTCCATCGG TGTTGCCTCG GCAGCCGGGA AGGCGTACAT CCATTCCGCC
GGAAAGACGC GGGATGTTTC CTCCGTGTGG AACAATCTGG CCAGTCAGGA TGATTTCCTT
GAGTCCATGG CGGCTTCCGC GCAGGCGGTG GCGGATTACT TCGGGGACAG AATTTTGTAC
ATCAATGTGA TGAATAATCT GTCCATTGAC TGTGATTGCG ATTCCCACCC CCATGCGCCG
GAAATGAAGG ACATCGGTAT TCTGGCCTCC CTTGATCCGG TTGCTCTTGA CCAGGCCTGC
CTGGATCTCG TTTACGCCGT CAGGCCGTCC GAAGGGAATG ACAACAGGCC CCTGGTGGCG
CGTATTGAAA GCCGCCATGG ACGGCATACG GTAGAGTATG CCGAGAAGAT AGGTCTTGGC
AGCAGGAAGT ATGAACTGAA AGAGCTGAAA CCGCAGCAGG CCGTTTAG
 
Protein sequence
MENKMTRRTW LQSSTAALAC LALPEYALGA VAEKGGASRV WMTKEISPEA LVRIYEALGR 
PAGGKVAVKI STGEPGGRNF LSPALIKDLV RRVNGTIVEC NTAYGGKRSR TEDHLQAAKD
HGFSDIARVD IMDAEGEFTI PVRDRKHLEY DIVGDHLKNY DFMVNLAHFK GHAMGGFGGV
IKNQSIGVAS AAGKAYIHSA GKTRDVSSVW NNLASQDDFL ESMAASAQAV ADYFGDRILY
INVMNNLSID CDCDSHPHAP EMKDIGILAS LDPVALDQAC LDLVYAVRPS EGNDNRPLVA
RIESRHGRHT VEYAEKIGLG SRKYELKELK PQQAV