Gene Amuc_1070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1070 
Symbol 
ID6274040 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1277393 
End bp1278406 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content45% 
IMG OID642613121 
Productglycosyl transferase family 2 
Protein accessionYP_001877677 
Protein GI187735565 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.0299214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCAG CCATTCTTCT TTGTTACAAT CAGAAACGTT TTATCAAGGA ACAGTTCCGG 
GCTATTTTAA AACAGGATTA TCTAGGAGAA TGGGAAATAA TTATTTCGGA CGATTTTTCT
CAGGACGGTT CTTTTGAATG CCTGGAAGAA ATGGTGGAAA AAGAGGGAGA AGGACGGCGT
ATTATTCTCC ACCGTAACGA AAGCAACCGG GGAATTGCAG GGAATTTGCA ATGTGCCGTT
CATTTGTCCC GGGGGGAATG GATTATCAAG TTTGACGGCG ATGACATCGC ACGGGAAGAC
AGGATCTCCT CGTTGGCATC TCTGGCGGAA AAATATCCCG GTCATCTGGT TTACTGCCAT
TCTTATAATG AAATCGATGA AGATGGACAA CCCGCTTATG GACGCATGTT GCCAGATTCA
GATTCCGTCG TCGTCAAACC CTACAGGGAA TGTATTTTTG ACATTTCCCA TGTTTACAGC
TGTTTTGGCG GAAATGCCAT GTACCACAGG TCTTTGTTCA GCGACTTCGA ATATTTGCCT
TCAGGGGCCG GCATTGCGGA CGATACAATG CTGTCCATGC GCGCCTATTT GAAAAAATCA
GGCATGGTCG CATCCGGCAA ACGGTGTTCG TACTATCGGA GACACAACAG TAATATCTGC
AATTTTAAGA GCGGGAACCC CAGGACAATC CTGATCAAGA GATCGGAATT CCTGATAACA
ACCTGGATAA TGATCATGAA AGAGGTATAC GGCAAGCATA AGTCCGGGGA AATAACATAC
CAGGCCGCCG ATCGCCTGAT GAGGCTGATT CAGGCGGAAC AGAGAAGGCT CCTGCTCTTC
CCCTATGCTT CATTCGATAA CAGCCTTCTT ACCAAACTCA AATGGTTTTG GAACATTCTG
CAATGCAGGC CGAGGCTCTG GCTGGTCAGC ATTCCAAGAT TGCTCCCGTT TTGCCTGCTG
CAAAGGTATT TGAATATTAA AGACCGGATA AAAAGTTTTC CTTTTTTTCA CTAA
 
Protein sequence
MISAILLCYN QKRFIKEQFR AILKQDYLGE WEIIISDDFS QDGSFECLEE MVEKEGEGRR 
IILHRNESNR GIAGNLQCAV HLSRGEWIIK FDGDDIARED RISSLASLAE KYPGHLVYCH
SYNEIDEDGQ PAYGRMLPDS DSVVVKPYRE CIFDISHVYS CFGGNAMYHR SLFSDFEYLP
SGAGIADDTM LSMRAYLKKS GMVASGKRCS YYRRHNSNIC NFKSGNPRTI LIKRSEFLIT
TWIMIMKEVY GKHKSGEITY QAADRLMRLI QAEQRRLLLF PYASFDNSLL TKLKWFWNIL
QCRPRLWLVS IPRLLPFCLL QRYLNIKDRI KSFPFFH