Gene Mmwyl1_1075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMmwyl1_1075 
Symbol 
ID5368782 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMarinomonas sp. MWYL1 
KingdomBacteria 
Replicon accessionNC_009654 
Strand
Start bp1199487 
End bp1200554 
Gene Length1068 bp 
Protein Length355 aa 
Translation table11 
GC content43% 
IMG OID640803416 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_001339941 
Protein GI152995106 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.347968 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000316838 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAGTTC TATGGGTATT AGGAGCAGGT CAATTAGGCG CTATGCTAAA ACAAGCAGGA 
ACGCCGCTTG GCATTGATGT GCGTCCAGTA GATATTGAGT CAACCGAAAC CTTAGCGTTA
GCTCCAACTG ACATTGTGAC AGCAGAAAGA GAAGAATGGC CAGAAACCAT CGCCACCAAA
CAACTTGCTA CTCACAGTAA TTTTGTCAAC CTAGCCACCT TTCCACAACT TGCAGATCGT
CTAACCCAGA AACAATGGTT AGATCGTCTT GAGCTAGCGA CAGCACCATG GTTTCCGGTT
GAGATTGACT CTTCTGCAAC ACACTCCTAT GAAACATTAG GTGAACGCGT TCTGATGAAA
CGTCGTCGAG GTGGTTATGA TGGCAAAGGT CAATATTGGT TAAAACAATC TGAGGGCATT
GAGATACCTG AAGATTGGAA AGGCCAAGCT ATTGCAGAAC AAGCTATTAA TTTTGATGAA
GAAGTTTCCT TAGTCGGTGT TCGAGGCAAA AATGGTGAGA CACACTTTTA CCCGCTAACA
TTGAACCTTC ACATTAATGG CATCCTATAC GCATCCATTT CTCCATTAGA GCGCCTAAAG
CCTTTGCAAA GCAAAGCTGA AGCAATGCTT AGCAAGCTTA TGGAGGCTTT AGACTACGTC
GGCGTAATGG CGATGGAGTG TTTCCGTGTA GGTGATGAGC TACTCATTAA TGAGCTTGCT
CCAAGAGTCC ATAACAGCGG CCATTGGACA CAAGCAGGTG CAAGCGTATG TCAATTTGAA
AACCACGTAC GTGCTGTCAC AGGACTTCCA TTAGCGCCAG CTGAAGTCAA GAATCAAAGC
ATGATGGTCA ATTTAATTGG TGTCGATCTA AACTATGACT GGCTAAACGT ACAAGGCTTA
GAACTTTATT GGTACAAAAA GGAAGTTCGC CCTGGAAGAA AAGTTGGCCA TCTGAATTTT
TGTTCTGGTA GCTACTCAGT ATTAGAGTCA GCATTAACAA AACTAGATCT TCCACAACCT
TATCCGGAAG CTTTAGAGTG GTTAGCTAAA AACTTACCCA AGTTATAA
 
Protein sequence
MSVLWVLGAG QLGAMLKQAG TPLGIDVRPV DIESTETLAL APTDIVTAER EEWPETIATK 
QLATHSNFVN LATFPQLADR LTQKQWLDRL ELATAPWFPV EIDSSATHSY ETLGERVLMK
RRRGGYDGKG QYWLKQSEGI EIPEDWKGQA IAEQAINFDE EVSLVGVRGK NGETHFYPLT
LNLHINGILY ASISPLERLK PLQSKAEAML SKLMEALDYV GVMAMECFRV GDELLINELA
PRVHNSGHWT QAGASVCQFE NHVRAVTGLP LAPAEVKNQS MMVNLIGVDL NYDWLNVQGL
ELYWYKKEVR PGRKVGHLNF CSGSYSVLES ALTKLDLPQP YPEALEWLAK NLPKL