Gene Syncc9902_1052 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1052 
Symbol 
ID3742475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1016752 
End bp1017951 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content54% 
IMG OID637771227 
Productphosphoribosylaminoimidazole carboxylase ATPase subunit 
Protein accessionYP_377060 
Protein GI78184625 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0026] Phosphoribosylaminoimidazole carboxylase (NCAIR synthetase) 
TIGRFAM ID[TIGR01161] phosphoribosylaminoimidazole carboxylase, PurK protein 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.19177 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGTG CTGAGCCAGG GATGTCCTCT TCGGACACGA TGATCGGTGT AGTGGGTGGC 
GGTCAGCTGG CGAGAATGCT GGTGCAAGCT GCTTCTGCGC GTTCGATACC GATCGCAGTC
CAAACCATCT CCGCGAGTGA CCCTGCTGCT GAGGCTGCGG CCCGCGTTGT GGAAGCTGAT
CCCCGTGATG TAGCAGGAAC GCGCGAGTTG GTGGAGGGTT GCGAAGGCAT CACTTTCGAA
AATGAGTGGG TCAATATCGA CGCGTTGATT CCCCTCGAAC AGCAGGGCGT TCGCTTTCGT
CCCTCTCTGG CGTCTCTTGC TCCACTGGTC GACAAGTTGT CTCAGCGAAA GTTGTTGGAC
GACTTAGCGA TACCCAGTCC ACCTTGGTGT CCCCTAAGCC TGATTTCTCC AGCGCAACCA
TCTCTACCTC CGGGATGGAA TTTTCCGGTG ATGGCGAAAG CGGCCCGCGG TGGATACGAC
GGAAAGGGAA CGATTGTTCT GAAAAATATT GATGCTCTGG CTCAGTTGCT CCGATCCGTC
GATATTTCCG ATTGGCTTTT AGAAACGTGG GTGCATTACG AGCGTGAGCT GGCCCTTGTG
GTGAGCCGAG ATTCCCAGGG TCGTCTTCGG AGCTTCCCAC TGGTGGAAAC GCACCAACAT
GATCAAGTTT GCAACTGGGT TTTGGCACCA GCAGGAGTGG ACCAGGATGT TGAAGCTCTC
GCTTACAACG TTGCTGCTTC CTTGCTCACC AAATTGAATT ACGTGGGTGT GTTGGCCCTT
GAATTCTTTT ATGGACCTGC CGGTTTACAG GTGAATGAGA TTGCGCCTCG TACCCACAAC
TCTGGTCATT TCTCAATCGA AGCTTGTACC AGCAGCCAGT TTGATCAACA AGTGTGCATT
GCAGCGGGTC TTCCCGTACC TACGCCAGAA CTGAGGAGTG ATGGCGCATT GATGGTGAAT
CTTTTGGGCC TTAATCCAAC CCAAGCTGCC CCGCTCGAGC AGAGATTGAC TGCGCTTCGT
GAAATCCCCA ATGCACATCT TCATTGGTAT GGAAAATCAC CTGAAACCCC AGGCCGCAAA
CTCGGCCACA TCACTGTGTT GTTGAACGCC AGTGATGCAG AACATCGTGA TCGGCAAGCG
AAGGACGTTT TGACTGTTGT GCGAGGAATA TGGCCCGAGT TTCCCTCAGT TCAGGACTAA
 
Protein sequence
MSSAEPGMSS SDTMIGVVGG GQLARMLVQA ASARSIPIAV QTISASDPAA EAAARVVEAD 
PRDVAGTREL VEGCEGITFE NEWVNIDALI PLEQQGVRFR PSLASLAPLV DKLSQRKLLD
DLAIPSPPWC PLSLISPAQP SLPPGWNFPV MAKAARGGYD GKGTIVLKNI DALAQLLRSV
DISDWLLETW VHYERELALV VSRDSQGRLR SFPLVETHQH DQVCNWVLAP AGVDQDVEAL
AYNVAASLLT KLNYVGVLAL EFFYGPAGLQ VNEIAPRTHN SGHFSIEACT SSQFDQQVCI
AAGLPVPTPE LRSDGALMVN LLGLNPTQAA PLEQRLTALR EIPNAHLHWY GKSPETPGRK
LGHITVLLNA SDAEHRDRQA KDVLTVVRGI WPEFPSVQD