Gene Syncc9902_1916 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1916 
Symbol 
ID3743796 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1831933 
End bp1832970 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content56% 
IMG OID637772111 
Productphosphoribosylaminoimidazole synthetase 
Protein accessionYP_377917 
Protein GI78185482 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0150] Phosphoribosylaminoimidazole (AIR) synthetase 
TIGRFAM ID[TIGR00878] phosphoribosylaminoimidazole synthetase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.344897 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACTACA AGAGTGCAGG TGTTGATGTA GAAGCTGGAC GGGCTTTTGT TCAGCGCATC 
AAAGCTTCGG TAGAGGCCAC CCACCGCCCA GAAGTCGTGG GAGGTCTTGG CGGATTTGGC
GGCCTCATGC GCCTTCCAAC TGGCCTGCGC AAACCTCTTC TCGTGTCCGG AACGGACGGA
GTCGGCACCA AGCTGGAACT CGCCCAAAAT CATCACTGCC ATCACGGGGT GGGCATTGAT
CTTGTTGCGA TGTGCGTCAA CGACGTGATT ACGTCTGGGG CCGCTCCACT GTTTTTCCTC
GACTACATGG CCACAGGCGC CCTAAGCCCA GCCGCCATGG CCGAAGTGGT CGAGGGAATC
GCAGATGGAT GCCGTCAGAG CGGTTGCGCA CTTCTAGGAG GCGAAACAGC AGAAATGCCC
GGGTTTTATC CCCAAGGGAG ATACGACCTC GCCGGCTTCT GCGTTGCCGT CGTCGAGGAA
GACGACCTCA TCGATGGACG ATCCATTTCC CCGGGGGATC AAATCATCGG CATCGCTAGC
AGTGGTGTGC ACAGCAACGG ATTCAGCCTC GTCAGGAAGG TTTTAGAAAA AGCAGGCATC
AACGAAAACA GCCAATACGG ACCAGACAAC AGACGACTCC TCAACGACCT GCTCGCGCCG
ACAACGCTCT ACGCCTCACT TGTTCAAGAA CTGCTCAGCA ACGCCATCAA GATCCATGGC
ATGGCCCACA TCACTGGCGG GGGATTGCCT GAAAATTTGC CCCGCTGTCT GCCGGAGGGA
ATGACGGCCA AAATCGAGGC TGAGGCATGG CCTCGATCTC CTTTATTTCA GTGGCTGCAA
TCCGCAGGAG CGATTCCAGA ACGTGATCTT TGGCATACGT TCAACATGGG AATCGGGTTC
TGCCTCGTCG TTCCAAAAGA AGCGGAACAA ACTGCATTAG ATGTTTGTCA TTTGAACAAC
CATCAGGCAT GGGTCATTGG TGAAGTGCTG AAGACCCCTC CAGGGGAGCA TTCAGCCTTA
CAAGGGCTGC CCAGCTGA
 
Protein sequence
MDYKSAGVDV EAGRAFVQRI KASVEATHRP EVVGGLGGFG GLMRLPTGLR KPLLVSGTDG 
VGTKLELAQN HHCHHGVGID LVAMCVNDVI TSGAAPLFFL DYMATGALSP AAMAEVVEGI
ADGCRQSGCA LLGGETAEMP GFYPQGRYDL AGFCVAVVEE DDLIDGRSIS PGDQIIGIAS
SGVHSNGFSL VRKVLEKAGI NENSQYGPDN RRLLNDLLAP TTLYASLVQE LLSNAIKIHG
MAHITGGGLP ENLPRCLPEG MTAKIEAEAW PRSPLFQWLQ SAGAIPERDL WHTFNMGIGF
CLVVPKEAEQ TALDVCHLNN HQAWVIGEVL KTPPGEHSAL QGLPS