Gene Amuc_1626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1626 
Symbol 
ID6275426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1959219 
End bp1961285 
Gene Length2067 bp 
Protein Length688 aa 
Translation table11 
GC content55% 
IMG OID642613686 
ProductRNA polymerase, sigma 70 subunit, RpoD family 
Protein accessionYP_001878227 
Protein GI187736115 
COG category[K] Transcription 
COG ID[COG0568] DNA-directed RNA polymerase, sigma subunit (sigma70/sigma32) 
TIGRFAM ID[TIGR02393] RNA polymerase sigma factor RpoD, C-terminal domain
[TIGR02937] RNA polymerase sigma factor, sigma-70 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAAAT CCGGTGACCC ATCCCCCTCT TCTCCCAAGA AGACTTCCGC CAACAAAAAA 
GCTTCTGAAT CCAAGGAAAA AATAGCGGCC CGGCCCGCCG CAAAAAATTC CAAAACCGCT
CCGAAGGCGG CTCCTGTCAA AAAAAACTCT GCCAAGGCTG CCGCTCCGGC TGCGGAAAAG
GCCGCCGCCA AGCCCGTTGC GGAAAAAAAG GCCCCGGCAA AAAAAGCGTC TCCGGCTTCT
GCCTCCAAGA AGGCGGCGCC GTCCGCCAAA AAGACTCCGG CAAAGGCGCC TGAAGCTGTC
CCTGCCAAGA AAGCTCCAGC CAAGGCTGCG GAGAAGAAGC CGGCTGAGAA GAAGGCCGCC
AAACCTGCCG CGAAGGAGGA AACGCCCAAA AAATTAAGCG AAATACTGGA AGAGGAACGC
AAGAGGGCTG CCAGCCGCAA GATCACGAAT CCGATTGACG CTCCGGAAAT TCAGGAAAAA
ATCCGTGAAC TTATCAAGCT CGCCAAGGAA CAGGGGTATC TGACTTTTGA CGATATTAAT
GACTCCCTTC CCAACGACAT TGTCGATCAG CAGGATTACG AGGCCATCAT GGACCGCCTG
CGCAGCATGG CGTTTGACAT CATTGACGCG TCTGATGTGG ACAGCTACAC GGACCGCACC
CGCATCAGCA CGGAAGAAGA GGACGAAGAG GAAAAGCTCG AAGCCAAGAT GGACATTCTG
GATGATCCCG TCCGCATGTA CCTGAAACAA ATGGGCCAGG TTTCCCTGCT TACCCGTGAA
GAGGAAGTGG CCATTTCCAA GAGAATTGAG GACGCGGAAC AGAACGTCCA GCGCTGTGTG
CACCGCTTCG GTTTCATCGC GAATGCCTAT CTGGACGTAG CCTATCGCCT GCTCGACAAT
GAGGAACGCT TTGACCGCGT TATCCTTGAC AAAAAGATAG ACTCCCGTGA GCGCTATATG
AAAGGGCTGG CCCAGCTCTG CGCCCAGATA CAGCAGACCC ATCAGGACGC ATCCGGTTCC
TTCCGCAAGC TGTACCGCAG CAAGGAGGCG GCCAAGTCCG TCAAGGCCCG CCAGGCCGAA
TTCGACAAAG TTGCCGGCGC TTTGGTGAAG TTCTTCGGTC GCCTGTATTT CAAGCACAAG
GTGATTGAGG ATTTCTGTTC CATGATTGAC GAGGCCCGCG ACCGCGTGCT CCGCATGCAG
AAGAAGGTTG CCCTGGAACC GGACAACAAG GAACTCAAGG AACATCTGGC AGAGCTGGAG
CTTCGCATGT GGATGACTGC GGATGAGATG GGAGATGCCT ACCAGGAACT CCGCAAATGG
CTTCGCGAAG CCCGCAGGGC CAAGGATGAA ATGGTGGAGG CCAATCTGCG CCTGGTCATT
TCCATTGCCA AAAAATATAC CAACCGCGGC CTCTCCTTCC TGGATCTGAT TCAGGAAGGC
AACATGGGCC TGATGAAAGC GGTGGAAAAA TTTGAATACC GCCGCGGCTA CAAATTCTCC
ACCTATGCCA CCTGGTGGAT TCGCCAGGCC ATCACCCGCT CCATTGCAGA CCAGGCCCGC
ACCATCCGCA TTCCCGTGCA CATGATTGAA ACCATCAATA AATTGATGCG CGTGCAAAAG
CAGTTGGTGC AGGAGTACGG TAGGGAACCC ACTCCGGAAG AAATCGCGGA AGAAATCCAT
CTGCCTGTGG AACGCGTGCG TTCCGTTTTG AAAATGGCCC AGCAGCCCAT TTCCCTGCAA
GCCCCCGTGG GGGATTCCGA CGATACCTCC TTTGGTGACT TTATTGAAGA CAAGGCGGCG
GAAAATCCCA TGGAGGAAGC CTCCTTCTCT TTCCTGAAAG AAAAAATCAA GGATGTTCTG
GACACCCTCA CGGATCGTGA ACGCGAAGTT ATTGAACAGC GCTTCGGCCT CCGGGACGGC
AGTCCCCGCA CTCTGGAGGA AGTGGGCCGC CAGTTCAGCG TCACCCGCGA ACGCATTCGC
CAGATTGAAG CCAAAGCCCT CCGCAAGCTG CGCCATCCCA CGCGCATCAG CAAGATCAAG
GGATTCCTGG AAATGACGGA ATCCTAA
 
Protein sequence
MSKSGDPSPS SPKKTSANKK ASESKEKIAA RPAAKNSKTA PKAAPVKKNS AKAAAPAAEK 
AAAKPVAEKK APAKKASPAS ASKKAAPSAK KTPAKAPEAV PAKKAPAKAA EKKPAEKKAA
KPAAKEETPK KLSEILEEER KRAASRKITN PIDAPEIQEK IRELIKLAKE QGYLTFDDIN
DSLPNDIVDQ QDYEAIMDRL RSMAFDIIDA SDVDSYTDRT RISTEEEDEE EKLEAKMDIL
DDPVRMYLKQ MGQVSLLTRE EEVAISKRIE DAEQNVQRCV HRFGFIANAY LDVAYRLLDN
EERFDRVILD KKIDSRERYM KGLAQLCAQI QQTHQDASGS FRKLYRSKEA AKSVKARQAE
FDKVAGALVK FFGRLYFKHK VIEDFCSMID EARDRVLRMQ KKVALEPDNK ELKEHLAELE
LRMWMTADEM GDAYQELRKW LREARRAKDE MVEANLRLVI SIAKKYTNRG LSFLDLIQEG
NMGLMKAVEK FEYRRGYKFS TYATWWIRQA ITRSIADQAR TIRIPVHMIE TINKLMRVQK
QLVQEYGREP TPEEIAEEIH LPVERVRSVL KMAQQPISLQ APVGDSDDTS FGDFIEDKAA
ENPMEEASFS FLKEKIKDVL DTLTDREREV IEQRFGLRDG SPRTLEEVGR QFSVTRERIR
QIEAKALRKL RHPTRISKIK GFLEMTES