Gene Amuc_1298 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1298 
Symbol 
ID6274110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp1573952 
End bp1575679 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content52% 
IMG OID642613354 
Productsulfate adenylyltransferase, large subunit 
Protein accessionYP_001877903 
Protein GI187735791 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2895] GTPases - Sulfate adenylate transferase subunit 1 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain
[TIGR02034] sulfate adenylyltransferase, large subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.307695 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones58 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCG ACTCATACCT GAACGAACAC GAAAATAAAA GCCTGCTACG CGTACTTACC 
TGCGGTTCCG TGGACGACGG GAAATCTACA CTCATCGGAC GCCTCCTTTA TGACAGCAAA
CTGATTTTTG ACGACCAGCT GGCAGAGCTG CGCAAAGCCA GTGAAAAAAA TGGAACTGCT
GGAGCAGGTA AAATTGATTA CGCCCTGTTG CTGGACGGCC TTAGAGCGGA ACGGGAACAG
GGAATAACCA TTGATGTAGC CTACCGGTAC TTCACCACCC CACGCCGCAA ATTCATCATT
GCCGACTGCC CCGGACATGA ACAATACACC CGGAACATGG CCACCGGAGC TTCCACGGCA
GATGCCGCCA TCATCCTGAT TGATGCTCGC CATGGAGTAC TCACACAAAC GAAACGGCAT
GCGTTCATCG TCTCTCTTCT GAAAATACGG CACCTCATCG TAGCCGTCAA CAAGATGGAT
CTTTTGAAAT ACTCTGAAGA AAAATTCCGG AAAATTGAAG AAGAATTCGG AAGCTTCACG
CAACAGTTGA ATATCCCGGA TGTTCGTTTC GTTCCCATTT CCGCCATTGA AGGGGAAAAT
GTGACGCAAA CAACAGGAAA AACGCCTTGG TACCAGGGCG ATCATCTGCT TTCCATTCTG
GAAACGCTGG ATGCCAGCGA CAGCAGGAAT CTCCGGGATT TCCGCTTTCC GGTACAGACA
GTCATACGGC CCAATCTCGA TTTCCGGGGG TTCGCCGGCT CCATTACCTC CGGCTCCATC
CGCAGGGGTG ATCCTATCGT GACGTTGCCT TCGTTTCAAA ACAGCCGGAT CAAAAGAATC
GTTACTCCGG ACGGAGAACT GGAAGAAGCA TTCTCTCCCC AAGCCGTCGT ATTGGAACTT
GAGGATGAAA TAGATATCAG CAGCGGTGAC ATGATTGTCA AAAAAGGGAA TCTCCCCCAT
ATAGAGGATC GGCTGGAAGC CCGTGTCATC TGGATGTCTG AAAAACCCCT TCTTCCCAGA
AGCAAATACA TTATGCGCCA TGCGGGCAGA AATATCCAGG GGAGGATAGT AGAACTCCAA
TACGACATAG ACGTCAATAC ACTGGAAAGC CGCCATGCAA CGCAGCTTCC TCTGAATCAT
GTCGGCCGTA TCGTCCTGGA AACCAGTTCC CCCTTGTTCT ATGATTATTA CCGGGATAAC
CGTTCCGGAG GAGCTTTCAT CCTGATTGAC CCGCTGAATA ACGTCACAGC GGGAGCTGGT
ATGCTCCGCC CCCCTCACAG AGATAAGGTT CCCGAAAAAG AAAAGGAACA ACTCCAAACA
TTCGTTTCAA GCGATGAACG CGCTGAAACC TTCGGGCATG GTGGAAAACA AATTTACGTA
GCGGGAGAAG ACAGCGAACT GGCACGCAGC TTCGCCAAAC AGCTGGAACG GGAACTCCAT
CGGCTCAAGG CTCATACCTA CGGTCTGGAT TTCAAGGCAG AAGGCGTATG GGGCAGATCC
GCTAGAGAAA TCGTCAATGC CTCAGGCCTG CTGGCCGAAG CGGGGCTCAT GAGCATTGCA
GTGCTGCCGG GCCTTCCCGT CCTGCCCAGA AAAGCAAAGG GAACCTACTG CATCTGGCTT
GGGAATGTCG TCTCCGCGCC AGAAACGGCA GACCGCATCC TCCCCCCTGC GAAAGCAAAC
GAAAATACTG CGTTTCTTCT GGCGCGCACT CTCTATGTGG AATTTTAA
 
Protein sequence
MDIDSYLNEH ENKSLLRVLT CGSVDDGKST LIGRLLYDSK LIFDDQLAEL RKASEKNGTA 
GAGKIDYALL LDGLRAEREQ GITIDVAYRY FTTPRRKFII ADCPGHEQYT RNMATGASTA
DAAIILIDAR HGVLTQTKRH AFIVSLLKIR HLIVAVNKMD LLKYSEEKFR KIEEEFGSFT
QQLNIPDVRF VPISAIEGEN VTQTTGKTPW YQGDHLLSIL ETLDASDSRN LRDFRFPVQT
VIRPNLDFRG FAGSITSGSI RRGDPIVTLP SFQNSRIKRI VTPDGELEEA FSPQAVVLEL
EDEIDISSGD MIVKKGNLPH IEDRLEARVI WMSEKPLLPR SKYIMRHAGR NIQGRIVELQ
YDIDVNTLES RHATQLPLNH VGRIVLETSS PLFYDYYRDN RSGGAFILID PLNNVTAGAG
MLRPPHRDKV PEKEKEQLQT FVSSDERAET FGHGGKQIYV AGEDSELARS FAKQLERELH
RLKAHTYGLD FKAEGVWGRS AREIVNASGL LAEAGLMSIA VLPGLPVLPR KAKGTYCIWL
GNVVSAPETA DRILPPAKAN ENTAFLLART LYVEF