Gene Amuc_0021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0021 
Symbol 
ID6275215 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp26737 
End bp28134 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content62% 
IMG OID642612061 
ProductRNA polymerase, sigma 54 subunit, RpoN 
Protein accessionYP_001876649 
Protein GI187734537 
COG category[K] Transcription 
COG ID[COG1508] DNA-directed RNA polymerase specialized sigma subunit, sigma54 homolog 
TIGRFAM ID[TIGR02395] RNA polymerase sigma-54 factor 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGAGG TATCCCTGCA GCATGGAATG ACGCGCCAGA TGGCCGTCTC CCAGTCCATG 
CAGACAGGGA TGCAGATTCT TCAGGCTTCT GCTTTGGAAT TAAGGCAGAT TATCCGTCAG
GCGCTGGAAA CCAATCCCAT GCTTGAGGAG TTGCCGGACG CTTCCCCGGA AGCGCTGGAA
GACGGTCCGG ACGGGGATGC CTGGAATGAG CGGGAGGACG GGTGGAACGA GTTTACGGCG
GAAGGCCGCC TGTCTGGGGA CGCGGCCGCC CGCCGTGATT TCATGTATGA ATCCGTCGTG
GCTCCGGAAT CCCTGAAAAC CCATTTGATG GATCAGGCGC AGCATTCCGC ATTGACGGGC
CGCGCCCGGG ACGCCCTGTT CCTGCTGATT GACGCTTTGG ACGAACGCGG GTTTCTGACG
GAGTCTCCGC AGGAGCTGGA AGAGCAGGGG TGTTTCAGCA TGCGGGACAT GGAGGAAGCT
CTGGCTGCGC TGAGGGAAAT GGACCCGCCT GGAGTGGGGG CGGCGAATTT AAGGGATTCC
CTGCTGATCC AGCTTGAACA GCGCGGTTTG AAGCGTTCGC TTGCTTTCCG CCTGGTCAAG
CGCTGCTGGC GGGAACTGGC GGCGCACAAG TATGAGGAAG CCGCACGCCT GCTGGACGTG
GAACCGGGGG CCGTGGCGGC GGCTCTGGAG GTGATTAGGA GCCTGACTCC TGATCCAGGG
GGGGCCTACG CCCCCGGCGG CAACCCTCAT CTGCTGCCTG ACGTCATTGT GGAGGAGGGG
CCTTCCGGCG TGCTGGAGGT TATCCTCACT TCCGAGTATC TTCCGCGCCT GTCCATGAAT
GAACGGTATA TGGAGTTAAT GGCTGAGGGT TCCGGAAGCC GGGAGCTGAG GCAATATCTC
CGCAGGGCGT TCCGTGAGGG ACAGGAGCTG TTGCGGGCTT TGGATATGAG GCAGGAGACC
GTTTTGCGCC TGGCCCGTGT CATTGTGCGG AGGCAGGAGG ATTTTTTCAG GTCCGGCCCG
TCCCGCCTGA AGGCCATGGG CATGGAAGAA GTGGCGGAGG AGATGGGGGT TCATGTTTCC
ACGGTTTCCC GGGCGTGCCG GGACAAGTAC CTGTTGTGCA GGTGGGGGAT GAAAGAATTG
AGATCCTTTT TCAGTGCGGG AGTTCCGTCT GAAGGCGGCG TTTCCCCGGA CGGGGCGGCT
TCCGGAGCCG TGGCTGCCGG CGCCGTCCAA GAGCTGATGC GGCGGCTGAT TGCGGAGGAA
GATTCTTCCA AACCCCTCAG TGACGCCGGG CTGGCGGCCG CGCTGCGGGA AAAAGGGGTG
AACATCGCCC GCAGGACGGT GGCCAAATAT CGGGAGCAGA TGAAGATACT GCCCGCTTCC
CTGCGCCGGG GAATATGA
 
Protein sequence
MSEVSLQHGM TRQMAVSQSM QTGMQILQAS ALELRQIIRQ ALETNPMLEE LPDASPEALE 
DGPDGDAWNE REDGWNEFTA EGRLSGDAAA RRDFMYESVV APESLKTHLM DQAQHSALTG
RARDALFLLI DALDERGFLT ESPQELEEQG CFSMRDMEEA LAALREMDPP GVGAANLRDS
LLIQLEQRGL KRSLAFRLVK RCWRELAAHK YEEAARLLDV EPGAVAAALE VIRSLTPDPG
GAYAPGGNPH LLPDVIVEEG PSGVLEVILT SEYLPRLSMN ERYMELMAEG SGSRELRQYL
RRAFREGQEL LRALDMRQET VLRLARVIVR RQEDFFRSGP SRLKAMGMEE VAEEMGVHVS
TVSRACRDKY LLCRWGMKEL RSFFSAGVPS EGGVSPDGAA SGAVAAGAVQ ELMRRLIAEE
DSSKPLSDAG LAAALREKGV NIARRTVAKY REQMKILPAS LRRGI