Gene Amuc_0957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0957 
Symbol 
ID6274206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1142034 
End bp1143329 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content58% 
IMG OID642613011 
ProductNusA antitermination factor 
Protein accessionYP_001877570 
Protein GI187735458 
COG category[K] Transcription 
COG ID[COG0195] Transcription elongation factor 
TIGRFAM ID[TIGR01953] transcription termination factor NusA 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones65 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAAACG ATATTAAAGC TTTGATTGAC TACTACGAGA GGGAAAAAGG GCTCTCCCGT 
GAAAAAATTC TCCTCGCTCT GGAGTCCGCC TTTCTCTCCG CCTACCGCAA AATGGTTCCC
GGTTCCGGCA GCATCAACTA CCTGAGGGCC GAAATCAACG TGGACAAGGG CAAGGTGCGT
ATTTTTGCGG ACCTGGAAGT GGTGCCGGAT GAAGAATATT CCGACAAATT CAACCAGATC
CCCCTTTCCC TGGCCGTCAA GCTGGACAAA AACGCCGTGC TTCACGACCT GCTCCCCACC
AACATCACGC CCAAGGGCTT CGGCCGCATC GCGGTGCAGA CCGCCCGCCA GACCATGCTC
CAGAAACTGC TGGATGCGGA AAAGGAAATG CTCTACGACG AATTCAAGGA CCGCGCCGGA
GATCTGGTAA CGGGCACCAT CCGCCGTTTT GAAAAGGGGG ATATTTTTGT GGACCTCGGC
AAATTCGAAG GCGTCATGAC CTCCCGCGAA CGCGTGCCGA ATGAAGACTA CAGCGTCGGC
GACCGCATGC GCTTCTACGT GGTGGAAGTG CGCACGGAAG CACGCGGCCC GGAAGTCATC
CTTTCCCGCA GCCATCCGAA CCTGGTGCGC CGCCTCTTTG AATCGGAAGT GGTGGAAATA
GGCGACCAGA CCGTGGAAAT CCACGGCATC GCCCGCGAAG CCGGCTACCG CACCAAAGTG
GCCGTCATCA GCCATGACGA CAAAGTAGAT CCGGTAGGGG CATGCGTAGG CATGCGCGGC
GCCCGCGTCA AAAACATCGT CCGGGAGCTC AACAATGAAA AAGTGGACAT CCTGGAATGG
ACGGAAGACC CCGTCACCTT CGTCCGGGAA GCTCTCAGCC CCGTGGAACC GCGGGAAATC
ACCGTGGACG AGGAAGCCAG AAAAATCTTC GTTATCGTCC AGGACGACAA AGACCTCTCC
AAGGCCATCG GCCGCAGGGG CCAGAATGCC CGCCTCACCT CCCGCCTGAT GGGCTGGGAT
GTCCAGGTGC GCGTCTTTGA TGTCCAGGAA GCGGAAAAAC GCCAGAGCCA GGCTGCGGCC
GAAGAAGTCA TGCGCCAATG CCAGGCTGCG GCCAAAACCC TCAGCGAACA ATTGGAAATC
CCGGAAGAAA CCGCCATGGG CCTGGTGACC ATGGGCGGAA CGGACCTGGT GGCCCTCACC
GGATTTGAAG CTTCCGACAT CGCGGAAAGC ATGGGCATTC CCGCAGAGGA AGCCGCCCAA
ATTCTGGACA AGGCCCGGGA CCTTATCTCC CAATAA
 
Protein sequence
MTNDIKALID YYEREKGLSR EKILLALESA FLSAYRKMVP GSGSINYLRA EINVDKGKVR 
IFADLEVVPD EEYSDKFNQI PLSLAVKLDK NAVLHDLLPT NITPKGFGRI AVQTARQTML
QKLLDAEKEM LYDEFKDRAG DLVTGTIRRF EKGDIFVDLG KFEGVMTSRE RVPNEDYSVG
DRMRFYVVEV RTEARGPEVI LSRSHPNLVR RLFESEVVEI GDQTVEIHGI AREAGYRTKV
AVISHDDKVD PVGACVGMRG ARVKNIVREL NNEKVDILEW TEDPVTFVRE ALSPVEPREI
TVDEEARKIF VIVQDDKDLS KAIGRRGQNA RLTSRLMGWD VQVRVFDVQE AEKRQSQAAA
EEVMRQCQAA AKTLSEQLEI PEETAMGLVT MGGTDLVALT GFEASDIAES MGIPAEEAAQ
ILDKARDLIS Q