Gene Amuc_1771 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1771 
Symbol 
ID6274798 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2156064 
End bp2157239 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content56% 
IMG OID642613834 
Productaminotransferase class I and II 
Protein accessionYP_001878370 
Protein GI187736258 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000897955 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value0.363557 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCATGA ATTGGCAGAA CAAAATAGCG GAGCAGGTAA GCTCCATACC CCGTTCCGGC 
ATCCGGGAAT TTTTTGACCT GGTCACGGGA CGCACGGATA TCATCTCCCT GGGCGTAGGG
GAGCCGGACT TCGTGACGCC GTGGAATATA CGGGAAGCGG CCATTTACTC CCTGGAAAAG
GGGCACACCT CCTACACTTC CAACTATGGG TTGGAATCCC TGCGCCGTTC CATCGTCAAA
TACGTGGACG GATTCTTCCA TGTCAACTAC GACCCCCTGC GCGAAGTGCT GGTGACGGTA
GGCGTAAGCG AAGCCATAGA TCTCGCTCTC CGTGCCATTC TGAATCCGGG GGACGAGGTT
CTTTATCACG AACCCTGTTA TGTCTCCTAT GCCCCCAGCG TCAATATGGC CTACGGCGTA
GCTACCGCCG TGCCTACAAG CAAAAGGGAT CTTTTCGCCC TGAACCCGGA GTTGCTGGAA
GCGTCCATTA CACCGCGGAC CAAGGTGCTG ATGCTCAACT TCCCGACGAA TCCGACCGGA
GCGGTGGCCC CTGTGGAAAC CCTTCAGGAA ATTGCCCGCA TTTGCATCAG GCACGACCTC
ATCGTGCTGA CGGATGAAAT TTACAGTGAA CTGCGTTATG ACGGCAAGCC GCATGTTTCC
ATAGCTTCTC TGCCGGGGAT GAAGGAACGC ACGCTCCTGC TGCACGGATT TTCCAAGGCA
TTCGCCATGA CGGGGTTCCG GCTGGGGTAT GCCTGCGGTC CGGAACCGCT TATTTCCGCC
ATGATGAAAA TTCATCAGTA TTCCATGCTC TGCGCCCCCA TTACTTCCCA GGAGGCGGCC
ATTGAAGCAT TGGAAAACGG GACATCCGCC ATGTTGAAGA TGCGGGAAAG CTACCGCCAG
CGCCGGGATT ACCTGGTGAA GCGCCTTAAT GAAATCGGCA TGGACTGCCA CCTGCCCGGC
GGCGCGTTCT ATGTCTTCCC GGACATTTCC AGATTTGGCT TGACCAGCAA GGAGTTTGCC
ACCCGGCTGC TGATGGAAAA GCAGGTGGCC GCCGTACCGG GGACCGCCTT CGGCGCAAGC
GGAGAAGGCT TCCTGCGCTG TTGCTATGCG ACCGCCTTTG ACCAGATCAA GGAGGCCTGC
AACCGCATGG AACATTTCGT GGAAACTCTT TCCTGA
 
Protein sequence
MIMNWQNKIA EQVSSIPRSG IREFFDLVTG RTDIISLGVG EPDFVTPWNI REAAIYSLEK 
GHTSYTSNYG LESLRRSIVK YVDGFFHVNY DPLREVLVTV GVSEAIDLAL RAILNPGDEV
LYHEPCYVSY APSVNMAYGV ATAVPTSKRD LFALNPELLE ASITPRTKVL MLNFPTNPTG
AVAPVETLQE IARICIRHDL IVLTDEIYSE LRYDGKPHVS IASLPGMKER TLLLHGFSKA
FAMTGFRLGY ACGPEPLISA MMKIHQYSML CAPITSQEAA IEALENGTSA MLKMRESYRQ
RRDYLVKRLN EIGMDCHLPG GAFYVFPDIS RFGLTSKEFA TRLLMEKQVA AVPGTAFGAS
GEGFLRCCYA TAFDQIKEAC NRMEHFVETL S