Gene Msil_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMsil_1571 
Symbol 
ID7091421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylocella silvestris BL2 
KingdomBacteria 
Replicon accessionNC_011666 
Strand
Start bp1699004 
End bp1700596 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content66% 
IMG OID643464898 
Productphosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 
Protein accessionYP_002361883 
Protein GI217977736 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0138] AICAR transformylase/IMP cyclohydrolase PurH (only IMP cyclohydrolase domain in Aful) 
TIGRFAM ID[TIGR00355] phosphoribosylaminoimidazolecarboxamide formyltransferase/IMP cyclohydrolase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCAAG AATTCCGCCG CGCCGCCCGG GCGCTGATCT CCGTTTCCGA CAAGACCGGC 
CTCATCGACT TCGCGCGCGG CCTCACCACG CTCGGGATCG AACTCATCTC GACCGGGGGC
ACCCATGCCG CTCTGGTCGA GGCGGGGCTG AAGGTCGTCG ACGTCGCCGA CGTCACCGGA
TTTCCCGAAT TGATGGACGG AAGGGTCAAA ACGCTGCACC CCAAGGTGCA TGGCGGTCTT
CTCGCCATTC GCGAAAATCC GGAACACGAG GCGGCAATGC TCGCCCATGG CATCGAGCCG
ATCGACCTGC TCGTCGTCAA TCTCTATCCG TTTCAGGCGA CGATCGACGC GGGCGCCGAC
TTCGACCAGT GCATCGAGAA CATCGACATC GGCGGCCCGG CCATGATCCG CGCCGCCTCC
AAGAATTTCA ACGACGTCGC GGTCGTCGTC GCAATCGAGG ATTACGCGCC TCTGCTTGCC
GAACTCGAGG CCAATGGCGG CGCGACGACC CTGGCGCTTC GGCAAAGGCT GGCGCAGAAG
GCGTTCGCGC GCACGGCCGT CTATGACGCC GCGATCTCGA ACTGGTTCGC AGAGCAGATC
GGCGCGGATG CGCCGGATTT CCGCGCCATC GGCGGCAAGC TCGCGCTGAA CTTGCGCTAT
GGCGAGAATC CACATCAGCA GGCGGCCTTC TACGCCACCG GCGAACGGCG CTATGGCGTT
TCGACCGCCC GCCAGCTCCA GGGCAAGCAG CTCTCCTACA ATAATATCGG CGATACGGAC
GCGGCCTACG AACTCGTCGC CGAATTCGAT CCGGAGCGGA CCGCCGCCGT CGCGATCATC
AAGCATTCAA ATCCCTGCGG CGTCGCCGAG GCCGCGACCC TGGAAGAAGC CTATCGGCTG
GCGCTGCGCT GCGATCCGGT CTCCGCCTTC GGCGGCGTTG TGGCGGTGAA CCGCAAGCTC
GACGCCAAAG CGGCGGCCGA AATCGTGCAA ATTTTCACCG AAGTCATCAT CGCGCCGGAC
GCCGATGAAG CGGCGATCGA GATCGTCGCC GCCAAGAAGA ATTTGCGGCT GCTGATCGCA
GGGGGACTGC CGGATCCCCG CGCGACAGGC CTCTTTGTTC GCCCCGTCGC CGGCGGCTTC
CTGGCGCAGG GGCGCGACAA TGCCGTCGTC GACGATATGG ATCTGCGCGT CGTCACCAAG
CGCGAGCCGA CCGAGGCGGA ATGGAGCGAT CTCACTTTCG CGTTTCGCGT CGCGAAACAT
GTGAAGTCGA ATGCGATCGT CTACGCCAGG TCCGGCGCGA CGGTTGGCGT GGGGGCAGGT
CAGATGAGCC GGGTCGATTC GACGCGCATC GCCGCAATCA AAGCCGCGGA AGCGGCGCGG
CAGGCGGGGC TGCCCGAAAG TCTGGCGCTG CGATCGGTCG TCGCCTCGGA CGCTTTCTTT
CCCTTCGCCG ATGGCGTGGA GACGGCGATC GAAGCCGGCG CGACGGCGCT GATCCAGCCG
GGCGGCTCCC TGCGCGACGC CGAGGTCATC GCCGCCGCCG ACGCGGCAGG GGTGGCGATG
GTCTTCACCG GCGTGAGACA TTTCCGTCAT TGA
 
Protein sequence
MSQEFRRAAR ALISVSDKTG LIDFARGLTT LGIELISTGG THAALVEAGL KVVDVADVTG 
FPELMDGRVK TLHPKVHGGL LAIRENPEHE AAMLAHGIEP IDLLVVNLYP FQATIDAGAD
FDQCIENIDI GGPAMIRAAS KNFNDVAVVV AIEDYAPLLA ELEANGGATT LALRQRLAQK
AFARTAVYDA AISNWFAEQI GADAPDFRAI GGKLALNLRY GENPHQQAAF YATGERRYGV
STARQLQGKQ LSYNNIGDTD AAYELVAEFD PERTAAVAII KHSNPCGVAE AATLEEAYRL
ALRCDPVSAF GGVVAVNRKL DAKAAAEIVQ IFTEVIIAPD ADEAAIEIVA AKKNLRLLIA
GGLPDPRATG LFVRPVAGGF LAQGRDNAVV DDMDLRVVTK REPTEAEWSD LTFAFRVAKH
VKSNAIVYAR SGATVGVGAG QMSRVDSTRI AAIKAAEAAR QAGLPESLAL RSVVASDAFF
PFADGVETAI EAGATALIQP GGSLRDAEVI AAADAAGVAM VFTGVRHFRH