Gene Amuc_1780 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1780 
Symbol 
ID6274531 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2168373 
End bp2169263 
Gene Length891 bp 
Protein Length296 aa 
Translation table11 
GC content58% 
IMG OID642613843 
Productprotein of unknown function DUF58 
Protein accessionYP_001878379 
Protein GI187736267 
COG category[R] General function prediction only 
COG ID[COG1721] Uncharacterized conserved protein (some members contain a von Willebrand factor type A (vWA) domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value0.226086 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAG CCTCAGACAT TCTCAAGCGC GTACGCCGCA TTGAACTGCG CGCCAGGCAT 
CTGGCCACGG AAAACTTCGC CGGGCAATAC CAGTCCGGCT TCCGCGGACA GGGGCTGGAC
TTTGACGACT TCCGGGAATA CATGCCGGGA GATGACCCCC GCTTCATTGA CTGGAAGGTA
ACGGCCAGGA TGAACTCCCC TTTTGTCCGC CGTTTCCGGG AGGAACGGGA ACAGGCCGTC
ATTCTGGCGG TGGACGTCAG CGGCTCCATG CACTACGCCT CCTCCGCGGC CCGCGTCTCC
AAACTGGACT ATGCGGCGGA AGTAGCGGCA GTGCTCGCCT ACAGCGCTGC CCAGAGCGGA
GACAAATGCG GCCTCCTTAT CTACGGGAAC AGCCACTCCC ATTACATCCC CCCGGCCAAG
GGAGTCAAGC AGACCCTGCG CATCGTCCGC GAAATCGTAG CCAGTAAAAA CGATGGAGCC
GACCAGAACA TTTCCGATGT AGCCCGGCAA CTTGTCCTTT CCCAGAAAAA AGCGGCCATG
GTCATCATGA TCAGTGACTT TTGGGGTGAG AACAATAAAG CCGCCCTGGG GCAGCTCAAC
TTCAAGCATG ACTTCATCCC CATCCGCATC GCAGACCCGA TGGAACTGCA TCTGCCGGAT
GCCGGACGCG TCATCCTGAA AGATCCGGAA ACTGGCAAAA GCATGTTCCT GAACCTTTCC
CGTCAGGATG TCCGGGAAAC CCACGCCAAC GTCGTTCATC TGCACCGCGA GAAATGGACG
CAGGATTTCC GCCGCCTGGG CATTGACTTC CTGGACTTGC AGACCACAGA CAACTTCATG
CCTCCCCTCC GGGCCCTTTT TGCCAGAAGA TCCCGTAAAT TTTCACGCTA A
 
Protein sequence
MDKASDILKR VRRIELRARH LATENFAGQY QSGFRGQGLD FDDFREYMPG DDPRFIDWKV 
TARMNSPFVR RFREEREQAV ILAVDVSGSM HYASSAARVS KLDYAAEVAA VLAYSAAQSG
DKCGLLIYGN SHSHYIPPAK GVKQTLRIVR EIVASKNDGA DQNISDVARQ LVLSQKKAAM
VIMISDFWGE NNKAALGQLN FKHDFIPIRI ADPMELHLPD AGRVILKDPE TGKSMFLNLS
RQDVRETHAN VVHLHREKWT QDFRRLGIDF LDLQTTDNFM PPLRALFARR SRKFSR