Gene Amuc_1096 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1096 
Symbol 
ID6274005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1308450 
End bp1309874 
Gene Length1425 bp 
Protein Length474 aa 
Translation table11 
GC content57% 
IMG OID642613147 
Producttranscription termination factor Rho 
Protein accessionYP_001877703 
Protein GI187735591 
COG category[K] Transcription 
COG ID[COG1158] Transcription termination factor 
TIGRFAM ID[TIGR00767] transcription termination factor Rho 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000220234 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0000128314 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACTGAAG AACTCGACAA CACCCCTTCC CCCGCCGGAG ATATTCCGCA GGAAACCCTC 
CCCAAGCCCC TGCCCGCGCC GGAAGAAACC GCCGGGGAAC AGCCGGCTGC CGCTCCGGAG
GAAAATAGAG GAAACGCAGT CCGGGAGGAG GAGGAAGCAG CCCCTGTTCT GGAGCAGATT
GACATCAATG AACTGCGGAA ACGCCCCCTG AACGATCTTC AGGAAATGGC GGAAGGGCTG
CCCATCCGGA ACGCGGCCTC CCTCACCAAG TCCCAGCTGG TTTTTGAATT GGGGAAACAG
CTTCTGGCAA AGGGCCATGA AGTCGTAGTT TCCGGGGTCA TGGAACAAGC CAAGGATAAT
TACGCCATGC TGAGGGATCC GGTGAAAAGT TTCCGCACCT CTCCGGATGA TATTTATCTG
GGCGGCAATC TCATCAAACC CCTGCATCTC CGCGTAGGCC AGCAGATCAA GGTCAGGCTG
CGCAAATTGC GGCCTCATGA CAAGTACCTT TCTGCAGCCT CCGTCATCAG CGTGGAGGAC
ATCCCTGCGG AAGACTACCG GGCGCGCAGC GATTTTGAAC GCCTCACCCC CCTCTTCCCC
AAGGAACGCC TCCTTCTGGA AAACAAGGGG GTCAATTCCG CCGCCATGCG CGTGCTGGAC
CTCATGACGC CCTTCGGCAA AGGGCAGCGC GGCCTGATTG TGGCCCCCCC GCGCGGAGGA
AAAACCGTTC TTCTGAAAAC AATCGCCCGT TCCATCAGGG CCAATTATCC GGAAGTGGAA
CTGATTGTGC TGTTGCTGGA CGAACGTCCG GAGGAAGTAA CGGATTTTGA AGAAACCGTG
GATGCTCCGG TATTCGCCTC CACTTTTGAC GAACCTTCCC GGCGCCATGC CCAGGTTTCC
GATCTGGTTA TCGAACGGGC CAAACGCCTG GTGGAAATGG GCAGAGACGT CGTCATCCTG
CTGGATTCCC TCACCAGGCT GGCCCGCGGC TACAATGCCA ACCAGACGGG AGGACGCATC
ATGTCCGGCG GCCTGGGGTC CAATGCATTG GAAAAACCGC GCAAATTCTT TTCCGCGGCG
CGCAATGTGG AAGAAGGAGG CAGCCTGACC ATCATCGCCA CATGCCTGGT AGACACGGAA
TCAAGAATGG ACGAAGTGAT TTTTGAAGAA TTCAAGGGAA CGGGCAATCT GGAAATCCGC
CTGGACCGGG AACTTTCCGA ACGGCGCATT TATCCGGCCA TTTCCCTTTC CCAGAGCGGC
ACCCGCAATG ACGACAGGCT GTATAACGAA CAGGAATTCG TCAAAATCAT GCAATTGCGC
CGCCAGCTCG CCATGAAACC GGGCTGGGAA GGCCTTCAGA CTCTCCTGCA AAATATCTCC
AAGACACAGA ATAACGCGGA ACTTCTGCTG ACGGGGCTGC GGTAA
 
Protein sequence
MTEELDNTPS PAGDIPQETL PKPLPAPEET AGEQPAAAPE ENRGNAVREE EEAAPVLEQI 
DINELRKRPL NDLQEMAEGL PIRNAASLTK SQLVFELGKQ LLAKGHEVVV SGVMEQAKDN
YAMLRDPVKS FRTSPDDIYL GGNLIKPLHL RVGQQIKVRL RKLRPHDKYL SAASVISVED
IPAEDYRARS DFERLTPLFP KERLLLENKG VNSAAMRVLD LMTPFGKGQR GLIVAPPRGG
KTVLLKTIAR SIRANYPEVE LIVLLLDERP EEVTDFEETV DAPVFASTFD EPSRRHAQVS
DLVIERAKRL VEMGRDVVIL LDSLTRLARG YNANQTGGRI MSGGLGSNAL EKPRKFFSAA
RNVEEGGSLT IIATCLVDTE SRMDEVIFEE FKGTGNLEIR LDRELSERRI YPAISLSQSG
TRNDDRLYNE QEFVKIMQLR RQLAMKPGWE GLQTLLQNIS KTQNNAELLL TGLR