Gene Emin_0521 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEmin_0521 
Symbol 
ID6262665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameElusimicrobium minutum Pei191 
KingdomBacteria 
Replicon accessionNC_010644 
Strand
Start bp570975 
End bp571934 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content42% 
IMG OID642610991 
Productphosphoribosylformylglycinamidine cyclo-ligase 
Protein accessionYP_001875413 
Protein GI187250931 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.343824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value0.588771 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGT ATAAAAAAGC AGGCGTTGAT ATCGTAGCAG GTGATAAGTT TGTTGATTTT 
ATAGCCACAA AAACAAAAGG CATAGGCGGT TTTGCGGGAC TTATGAAAAA CCCCGCCAAC
AAAGAATACA GCATGGTAGC TTCTACGGAC GGTGTAGGAA CAAAGCTTAA ACTTGCTTTT
ATGTTAGGCA AGCATGATAC AATAGGGATT GACCTTGTTG CCATGTGCGT TAACGACCTT
GTGGTTTGCG GCGCTACTCC TTTATTTTTC TTAGATTATT ATGCTACAGG TAAAATAGAT
TTAAAAACAT CCAAACAAAT TATTGAAGGT ATTTTAGAAG GCTGCAGGCA GGCGAACTGC
GCTCTTTTAG GGGGCGAAAC CGCCGAAATG CCGGGCTTTT ATCAAGCGGG CGAATATGAC
CTGGCCGGGT TTTCGGTAGG CATGGTAAAA AATAAAGAAA TTATTGACGG CAAAAAAATT
AAAGAAGGCG ATATTTTGCT TGCTCTTCCT TCAAGCGGCT TTCATTCAAA CGGATATTCT
TTGGTGCGTC ACATTTTTGG CAAAGAGCTT AAAAAATACG CTGAGCAGCT TTTAACGCCT
ACCAAAATTT ACGTTCAGGA AGTTTTAAAA CTTAAAACCG CGCTTGAAAA AGCCAAAATG
CCCATATTGG GAATGGCGCA TATTACGGGC AGCGGCCTTC CGGGCAATGT GCCTAGGTTT
TTGCCCTCGG GTGTTGGCGC TTATTTAGAC ACAAGCAAAT GGCAAGTGCC CGAGATAATG
AATATTTTGC AAAAAAAGGG CAAAATTTCC GCAAAAGAAA TGTATAACAC GTTTAATATG
GGGCTTGGCA TGGTTATTTG CGTTCGCCCG CAGGCTGTTA AAGCCGCTAA AAAAGCGTTG
CCGCAATTAT TAGAAGTAGG CGTTATTGTT AAAGGCGGCG ATAAGGTTAT TTTAGATTAA
 
Protein sequence
MSTYKKAGVD IVAGDKFVDF IATKTKGIGG FAGLMKNPAN KEYSMVASTD GVGTKLKLAF 
MLGKHDTIGI DLVAMCVNDL VVCGATPLFF LDYYATGKID LKTSKQIIEG ILEGCRQANC
ALLGGETAEM PGFYQAGEYD LAGFSVGMVK NKEIIDGKKI KEGDILLALP SSGFHSNGYS
LVRHIFGKEL KKYAEQLLTP TKIYVQEVLK LKTALEKAKM PILGMAHITG SGLPGNVPRF
LPSGVGAYLD TSKWQVPEIM NILQKKGKIS AKEMYNTFNM GLGMVICVRP QAVKAAKKAL
PQLLEVGVIV KGGDKVILD