Gene Amuc_1930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAmuc_1930 
Symbol 
ID6275270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAkkermansia muciniphila ATCC BAA-835 
KingdomBacteria 
Replicon accessionNC_010655 
Strand
Start bp2342372 
End bp2343382 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content55% 
IMG OID642613990 
Producttransport system permease protein 
Protein accessionYP_001878524 
Protein GI187736412 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4605] ABC-type enterochelin transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.276463 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.0688299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGATT CCATTCAACA TCAAAAAGAT GATGCCTTCC GCAATGCTGC CGGCCGGCGT 
CGCGCGTTCC CGGCCTTCCT GATTCTGGGA CTGCTCTGCG TGGGACTGGC GCTCGTATAT
GTCTTTCAGG GGATGACACC CGAAACGTGG GACTTTAACA TGGCCCGCCG CATTCCCATC
GTTATTGCGT TGGTGCTGGT TGGAACGGCC GTGGGGCTGT CTTCAGTCGT CTTCCAGACG
ATCACAACCA ACTATATCCT CACGCCCAGC GTGATGGGGC TCGATAACCT CTACGTATTA
CTGCAGACGC TCGTGCTCTA CTTCGTGGGC AGCACACAAT TGACGACCAT GCAGAGCCCG
CTCTGTTTCA TGGGAGCCCT GCTGCTGATG GTCTGCGTAT CTACGGGTAT TTTTTTCTAC
ATGTTCCGTG GACAGAATGG CGGCAATATT TATTTTGTGG TGTTGGTAGG CATGATCTTC
GGCATAACCT TCGGAGGCTT GTCGAACTTC ATGCAGGTGC TGATAGATCC GAGCGAATTT
GCCATACTTG AGGGGCGCCT TTTCGCCAGT TTCAACCGCA TCAATGAAGA ACTGCTGCTT
ACGGCAGGGC TTGTGATCGC CGCGGCGGTC ATCTGGCTGG TTTGCGACCT CAGGAAGCTC
GACGTGCTTA CGCTGGGCCG CTCCACGGCC ATTACGCTGG GCGTGAACTA CAAATGGGTG
GTGCTGCGCT CCCTGATAAT CGTCTCTATT CTGGCCTCGG CCTCGACGGT GCTCGTAGGG
CCGGTGACTT TCCTGGGCAT TCTCATCGTA AGCATTGCAC GCTTCATATT CCCGACCTAC
CGCCACATCG TCCTCATGCC CGGCACGGCT CTCGTGGGCG TAGCCGCATT GACTTTCGGC
ATGCTGCTTA CCGAACGGTG GCTCAACTTC TCCGTGCCCC TGAGCGTAAT CATCAATTTC
GTTGGCGGGG TTTACTTTAT CTACCTGATC ATGAAAATTA AACGTATATG A
 
Protein sequence
MPDSIQHQKD DAFRNAAGRR RAFPAFLILG LLCVGLALVY VFQGMTPETW DFNMARRIPI 
VIALVLVGTA VGLSSVVFQT ITTNYILTPS VMGLDNLYVL LQTLVLYFVG STQLTTMQSP
LCFMGALLLM VCVSTGIFFY MFRGQNGGNI YFVVLVGMIF GITFGGLSNF MQVLIDPSEF
AILEGRLFAS FNRINEELLL TAGLVIAAAV IWLVCDLRKL DVLTLGRSTA ITLGVNYKWV
VLRSLIIVSI LASASTVLVG PVTFLGILIV SIARFIFPTY RHIVLMPGTA LVGVAALTFG
MLLTERWLNF SVPLSVIINF VGGVYFIYLI MKIKRI