Gene Synpcc7942_1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSynpcc7942_1829 
Symbol 
ID3774404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus elongatus PCC 7942 
KingdomBacteria 
Replicon accessionNC_007604 
Strand
Start bp1898205 
End bp1898975 
Gene Length771 bp 
Protein Length256 aa 
Translation table11 
GC content62% 
IMG OID637800270 
Product1-(5-phosphoribosyl)-5-[(5- phosphoribosylamino)methylideneamino] imidazole-4-carboxamide isomerase 
Protein accessionYP_400846 
Protein GI81300638 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0106] Phosphoribosylformimino-5-aminoimidazole carboxamide ribonucleotide (ProFAR) isomerase 
TIGRFAM ID[TIGR00007] phosphoribosylformimino-5-aminoimidazole carboxamide ribotide isomerase 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.314161 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTAA TCCCAGCCAT TGACCTCCTC GGAGGGCAGT GCGTACGCCT TTTCCAAGGG 
GATTACGATC AGGCGGAAGT CTACGGCAAA GATCCCGTGG GAATGGCCTT GCGCTGGGCC
GAAGCCGGAG CCCAGCGTCT GCACTTGGTC GACCTAGATG GTGCCAAGGA GGGTTCACCT
GTCAATGCGG AGGCGATCGC AACGATCGCG CAACGGCTCT CAATTCCGGT TCAAGTGGGG
GGCGGCCTGC GCGATCGCGA CACCGTGGCT CGCCTGCTTG ACAGCGGGGT TGAACGGGCC
ATTCTCGGGA CTGTGGCGGT CGAAAGGCCA GCGCTCGTTG AAGCCTTAGC TGGGGAATTT
CCGGGTCAGA TTGCCGTGGG GATCGATGCC CGCAGCGGCA AAGTCGCCAC AAGGGGCTGG
CTGGAAGATT CTGGGCTCAC AGCCGTTGCA CTGGCACAGC AGATGGCAGA CTTGGGCGCT
TGCGCACTGA TCTGCACCGA CATTGGGCGA GATGGCACGC TTCAAGGTCC GAACCTGGAG
GAATTGCGGG CGATCGCGGC TGCAGTCTCC ATTCCGGTCA TTGCGTCGGG TGGTGTCGGA
TCGCTAACTG ATCTCCTCAG CTTGCTGCCC CTCGAGGCCC AAGGGGTGAG CGGCGTGATC
GTTGGCAAAG CTCTCTATAC CGGTGCCGTC GATCTCCAAG AGGCCCTACG GGCGATCGGT
TCAGGACGCT GGCAAGATGT GGCCGTGGAT GATTCCTCCC GCTTGGCTTA A
 
Protein sequence
MDVIPAIDLL GGQCVRLFQG DYDQAEVYGK DPVGMALRWA EAGAQRLHLV DLDGAKEGSP 
VNAEAIATIA QRLSIPVQVG GGLRDRDTVA RLLDSGVERA ILGTVAVERP ALVEALAGEF
PGQIAVGIDA RSGKVATRGW LEDSGLTAVA LAQQMADLGA CALICTDIGR DGTLQGPNLE
ELRAIAAAVS IPVIASGGVG SLTDLLSLLP LEAQGVSGVI VGKALYTGAV DLQEALRAIG
SGRWQDVAVD DSSRLA