Gene Syncc9902_1098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_1098 
Symbol 
ID3743312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp1058147 
End bp1059718 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content56% 
IMG OID637771274 
Productbifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II/unknown domain fusion protein 
Protein accessionYP_377106 
Protein GI78184671 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0108] 3,4-dihydroxy-2-butanone 4-phosphate synthase 
TIGRFAM ID[TIGR00505] GTP cyclohydrolase II
[TIGR00506] 3,4-dihydroxy-2-butanone 4-phosphate synthase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0688555 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGCCATTC GCAATGGAGC CTGCGTGGTC GTCGTGGACG ACGAGCAACG TGAAAATGAA 
GGAGATCTAA TTTGTGCTGC CCAGTTCGCC ACCCCAGAAG CGATCAACTT CATGGCAACG
GAAGCCAGAG GATTGATCTG TCTCGCTATG GAGGGAGACC GGCTCGATGA ACTGGACCTT
CCACTCATGG TGGATCGCAA TACGGATGCC AATCAAACAG CCTTCACCGT CAGCATCGAC
GCTGGTATTG AACATGGCGT CACCACTGGA ATTTCAGCGG ACGATCGGGC TCGCACCATT
CAAGTTGCTC TCAACCCGTC AACACGCCCT GCAGATCTCC GCCGTCCAGG CCATATCTTC
CCCCTCCGTG CACGCTCCGG GGGCGTCCTA AAGCGTGCAG GTCATACGGA GTCGGCTGTT
GATTTATCCC TGTTGGCTGG CCTGAGCCCA GCTGGTGTCA TTTGTGAAAT TCAGAACACC
GACGGCTCCA TGGCACGGCT GCCAGAGCTC AGGGCCTATG CCGACCAATG GGGCTTAAAA
CTCATCAGCA TTGCCGATCT GATTCGCTAC AGACTTGAAA ACGAGCGTTT CGTCAAGCGG
CTAGCGCACG CCGAACTCCC CAGTCAGTTC GGCGCATTTC AAGCGATCGG CTACAAAAAT
GATCTCGATG GTTCGGAACA CGTTGCCCTG GTGAAAGGAG ATCCAGCGTC TTTGAAAGAA
CCGGTTCTGG TGCGAATGCA CTCGGAATGC CTCACCGGTG ATGCTTTCGG ATCACTGCGC
TGCGACTGTC GTCCCCAACT CGAGGCGGCG CTTCGCCAGA TCGAAGCCGA GGGAGAAGGC
GTCGTGGTTT ATCTGCGACA GGAGGGACGC GGCATCGGCT TGATCAACAA ACTGAGGGCC
TACAGCCTTC AGGACGGTGG ACTGGACACC GTTGAAGCGA ATGAGCGATT GGGTTTTCCC
GCCGACCTGC GCAATTACGG GGTTGGAGCA CAAATTTTGT CCGACCTTGG AATCCACAGG
TTGCGCCTAC TCACCAACAA TCCACGCAAA ATTGCTGGAT TGGGTGGATA CGGACTGCAG
GTGGAAGAAC GCGTCCCCCT TGTGATGGAT GCAGGAGACC ACAATGCCGA TTATCTCGCT
GCCAAGCGAG ACAAACTTGG CCACTTACTT GAGGCAGATA CGCCTTGCAC CGTGTTGGCC
ATGGCGGTTC ACGGGCAACC TGACACCTGG CCACAGGTGC GTCGACAGGT CGAGTCAGTG
GCGCACGAAC ATGGATTTCA AATGGATGCG CTCCATGAAC CAAGGCTGCT CGCCCTTTGG
GACAGACCGC AATTCGTTTG GAAAATCAAG CCTGGTGATC AGGATCCATA CCAGTTAATC
CAAGCGTTGG CGAAGGTATC GAGCACGAAG GCCTTGGGCC TCATGCGCGT TCCCAGCGAG
CGGATGGCAC TTCACCCACC CCAAACATTG GAACGCCTCG ATCGAGACCT CTCAGAATTG
GAGTCGGATC AGAGGGCTGG CCTGATCCAG ACCAGCCCGG TGTTGTTGTT TTGGCGTCAA
GGACAACAAT GA
 
Protein sequence
MAIRNGACVV VVDDEQRENE GDLICAAQFA TPEAINFMAT EARGLICLAM EGDRLDELDL 
PLMVDRNTDA NQTAFTVSID AGIEHGVTTG ISADDRARTI QVALNPSTRP ADLRRPGHIF
PLRARSGGVL KRAGHTESAV DLSLLAGLSP AGVICEIQNT DGSMARLPEL RAYADQWGLK
LISIADLIRY RLENERFVKR LAHAELPSQF GAFQAIGYKN DLDGSEHVAL VKGDPASLKE
PVLVRMHSEC LTGDAFGSLR CDCRPQLEAA LRQIEAEGEG VVVYLRQEGR GIGLINKLRA
YSLQDGGLDT VEANERLGFP ADLRNYGVGA QILSDLGIHR LRLLTNNPRK IAGLGGYGLQ
VEERVPLVMD AGDHNADYLA AKRDKLGHLL EADTPCTVLA MAVHGQPDTW PQVRRQVESV
AHEHGFQMDA LHEPRLLALW DRPQFVWKIK PGDQDPYQLI QALAKVSSTK ALGLMRVPSE
RMALHPPQTL ERLDRDLSEL ESDQRAGLIQ TSPVLLFWRQ GQQ