Gene Amuc_0051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_0051 
Symbol 
ID6275121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp68018 
End bp69757 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content59% 
IMG OID642612094 
Productsingle-stranded-DNA-specific exonuclease RecJ 
Protein accessionYP_001876678 
Protein GI187734566 
COG category[L] Replication, recombination and repair 
COG ID[COG0608] Single-stranded DNA-specific exonuclease 
TIGRFAM ID[TIGR00644] single-stranded-DNA-specific exonuclease RecJ 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.458674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones46 
Fosmid unclonability p-value0.205446 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCGG GTTTTCATTG GACTCTACGC CCTTCCGTGA AGGAGGATGA TCCGGTCTTG 
AAAGCGTTTC CTGCGGAACT GCCCCTGCTG GTCAAGCAGC TCCTGCTGCA GCGCGGCTTC
ACCGGGGGAA TGGAAACGGA ACTCTTTCTG GACCCCAGGC TCTCCCACCT GAGCGACCCT
TTCCTGATGG GGGAAATGAG GGCTGCCGTG GACCGCATTT TCCGGGCCGT GGATGAAGGG
GAAACCGTGT GCATTTATGG GGACTATGAT GTGGACGGTG TCACCTCCGT GGCGCTTCTC
CGGGCCATTT TGATGGCTTA TGACCTGGAT CCCCAGTATT TTATTCCCGT CCGTTCCCGG
GAAGGCTACG GACTCAGCGA GGCCGGCATC AAGCGTTGCC TTTGCGAATG TGCGGACCCT
CCCAGCCTGC TTATTACGGT GGACTGCGGC ACTTCCTCCG TGAAGGAGGT GGACATGCTT
AACAGTCTGG GGATAGACGT AATCATTCTG GATCACCATG AGGCGGGCCC TTTGGGGCGT
CCGGATGCAG CGGCGGTGGT CAATGCCAAA ATTGAGGAAA ACAGCCCATA TACTTATCTG
TGCAGCGCGG GCGTAGTCTT CAAGCTGGCG CATGCCCTCC TGAAGGAGCG GAAACTGAAA
ACCTTTGACC TGAAACTTTA TCTGGACCTG GTGGCCGTGG CAACCGTGGC GGACATTGTT
CCCCTGGTGG CGGAAAACAG GATTCTGGTT CGCCACGGCC TGGGCAGGCT GGCGCACAGC
CGCCATACGG GCCTCAAAAC CCTGACGGAA ATAGCGGGCA TCCGACCTTC CGACTCCGTC
AACCACGCGG GCTTCCTGAA CGCCGCCCAC GTGGGGTTCA GGATAGGCCC GCGCATTAAT
GCCGCCGGGC GCATGGATTC CCCCATGGAT GCGTTGGAAC TCCTGCTGAC CATGGATGCC
AGGAGGGCCG TGCAGCTTGC CCAAATGCTG GATTCCCACA ACCGCAAGCG GCAGGAGGAA
GAGGAGGCCA TCCGGACGGA TGCGGTGGAA ATGCTTCATA ACTCCTTTGA CCCGGAGAGG
GATAACGTCA TTGTGCTGGG TTCCCGCGCG TGGCATCCCG GCGTGGTGGG CATTGTAGCC
TCCCAGCTGA TGAGGCGGTA CCACAAGCCG ACCTTCGTCA TCGCCTTTGA CGAAAGCGGC
GTGGGGAAGG GCTCCGGCCG TTCCATTCCC GGCGTGTCCC TGGTGCAGGC CATTCACCAT
TGTGCGGATA CGCTGGTTTC CGGCGGCGGC CATGACATGG CGGCGGGGCT GGTGATTGAA
GAATCCCGCA TGGACGATTT CCGCCGGGCA TTCAACCGTT ATGTGTCTGA AACCACGACG
GAGGAACAGC GCAGCCCCGT GCTCAACATA GACATGGAAG TGTCCTTCCA GGCGTTGACG
CTGGATTTGC TGGACAGCTA TGAAAAGCTG GAACCTTTCG GCAACTCCAA TCCACAGCCT
CTCTTCATGA GTTCCGACGT TTTTCCCACG GAACCGCCCA AGCGCGTGGG AACCAATCAT
CTGAAGCTTT TCATGCGCCA GGGCATCGTG GAGCGTGACG CTATTTTCTT CAACGGAGCG
GAACGGGAGC TTCCCAATCC TCCCTGGGAT ATCGCTTTCA CGATTGACCG CAACGTGTAC
CGCGGCCGCG CCTCCCTGTC CATTTCCATT CAGGAAATAC GTTCCCACCG GGAAATGTAG
 
Protein sequence
MTPGFHWTLR PSVKEDDPVL KAFPAELPLL VKQLLLQRGF TGGMETELFL DPRLSHLSDP 
FLMGEMRAAV DRIFRAVDEG ETVCIYGDYD VDGVTSVALL RAILMAYDLD PQYFIPVRSR
EGYGLSEAGI KRCLCECADP PSLLITVDCG TSSVKEVDML NSLGIDVIIL DHHEAGPLGR
PDAAAVVNAK IEENSPYTYL CSAGVVFKLA HALLKERKLK TFDLKLYLDL VAVATVADIV
PLVAENRILV RHGLGRLAHS RHTGLKTLTE IAGIRPSDSV NHAGFLNAAH VGFRIGPRIN
AAGRMDSPMD ALELLLTMDA RRAVQLAQML DSHNRKRQEE EEAIRTDAVE MLHNSFDPER
DNVIVLGSRA WHPGVVGIVA SQLMRRYHKP TFVIAFDESG VGKGSGRSIP GVSLVQAIHH
CADTLVSGGG HDMAAGLVIE ESRMDDFRRA FNRYVSETTT EEQRSPVLNI DMEVSFQALT
LDLLDSYEKL EPFGNSNPQP LFMSSDVFPT EPPKRVGTNH LKLFMRQGIV ERDAIFFNGA
ERELPNPPWD IAFTIDRNVY RGRASLSISI QEIRSHREM