Gene Syncc9605_0123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_0123 
Symbol 
ID3737248 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp119212 
End bp120666 
Gene Length1455 bp 
Protein Length484 aa 
Translation table11 
GC content61% 
IMG OID637774702 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_380454 
Protein GI78211675 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0039044 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGATCCACG GCCAAACGTC GACCCCTGGG CTGTCCGTCG CACCTCAGAT CATGCGCGCT 
TCCTGGGTTG AGTCCCGCAA GGGTCAGGCC AACGTCTCTC AGATGCACTA CGCCCGTCAG
GGCGTGGTGA CCGAAGAAAT GGCTCATGTG GCGAAGCGGG AGAACCTGCC CGAATCGCTG
GTGATGGAAG AGGTGGCCCG GGGGCGGATG ATCATCCCGG CCAATATCAA CCACACCAAT
CTGGAGCCGA TGGCAATCGG CATCGCCAGC AAGTGCAAGG TGAACGCCAA CATCGGCGCC
TCTCCAAATG CGTCCGATGC CGCTGAGGAG GTGAAAAAGC TCAAGCTGGC GGTGAAGTAC
GGCGCTGACA CCGTGATGGA TCTTTCCACT GGCGGCGTCA ACCTCGATGA GGTGCGCACC
GCAATCATCG GTGCATCTCC CGTGCCGATC GGCACCGTGC CTGTTTATCA GGCTCTTGAG
AGCGTCCACG GATCGATCGA GAAGCTCGAT GAGGACGACT TCCTCCACAT CATTGAGAAG
CACTGTCAGC AGGGCGTCGA CTACCAGACC ATCCACGCTG GCCTGCTGAT TGAGCACCTT
CCCAAGGTGA AGGGCCGCAT CACCGGCATC GTCAGCCGCG GCGGCGGGAT CCTGGCTCAA
TGGATGCTGT ATCACCACCG TCAAAACCCG CTCTACACGC GGTTTGACGA CATCTGCGAG
ATCTTCAAGC GCTACGACTG CACCTTCTCC CTCGGTGACT CGCTGCGCCC CGGTTGCCAG
CACGATGCGT CGGATGCTGC TCAACTGGCT GAATTGCACA CCCTCGGTGA ACTGACCCGT
CGCGCCTGGA AGCACGACGT GCAGGTGATG GTGGAGGGTC CCGGCCACGT TCCCCTCGAT
CAGATCGAGT TCAACGTGAA GAAGCAGATG GAGGAGTGCA GCGAAGCACC CTTCTATGTG
CTCGGCCCCC TGGTCACTGA CATTGCTCCC GGCTACGACC ACATCACTTC GGCCATCGGC
GCGGCGATGG CCGGTTGGCA TGGCACGGCG ATGCTCTGTT ATGTGACGCC GAAGGAGCAC
CTCGGTCTGC CCAACGCTGA TGATGTGCGC GAAGGCCTGA TCGCTTACAA GATCGCTGCC
CATGCGGCAG ATATTGCCCG CCATCGCCCC GGCGCCCGGG ACCGTGACGA CGAGCTCAGC
CGCGCCCGCT ACAACTTCGA TTGGAACAAG CAGTTTGAGC TGTCCTTGGA TCCTGAGCGG
GCCAAGGAGT ATCACGACGA AACCCTGCCG GCTGACATCT ACAAGCAGGC TGAGTTCTGC
TCCATGTGCG GACCGAAGCA CTGCCCGATG CAGACCAAGA TCACTGATGA AGATCTTGAG
GGTCTGGAGA AGGTGCTCGA AGCCAACACC GGCGCTGCAG AGCTGACGCC GGTCAAACTC
GACAAAGCCG ATTGA
 
Protein sequence
MIHGQTSTPG LSVAPQIMRA SWVESRKGQA NVSQMHYARQ GVVTEEMAHV AKRENLPESL 
VMEEVARGRM IIPANINHTN LEPMAIGIAS KCKVNANIGA SPNASDAAEE VKKLKLAVKY
GADTVMDLST GGVNLDEVRT AIIGASPVPI GTVPVYQALE SVHGSIEKLD EDDFLHIIEK
HCQQGVDYQT IHAGLLIEHL PKVKGRITGI VSRGGGILAQ WMLYHHRQNP LYTRFDDICE
IFKRYDCTFS LGDSLRPGCQ HDASDAAQLA ELHTLGELTR RAWKHDVQVM VEGPGHVPLD
QIEFNVKKQM EECSEAPFYV LGPLVTDIAP GYDHITSAIG AAMAGWHGTA MLCYVTPKEH
LGLPNADDVR EGLIAYKIAA HAADIARHRP GARDRDDELS RARYNFDWNK QFELSLDPER
AKEYHDETLP ADIYKQAEFC SMCGPKHCPM QTKITDEDLE GLEKVLEANT GAAELTPVKL
DKAD