Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Synpcc7942_1096 |
Symbol | |
ID | 3775046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Synechococcus elongatus PCC 7942 |
Kingdom | Bacteria |
Replicon accession | NC_007604 |
Strand | - |
Start bp | 1113354 |
End bp | 1114724 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637799522 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_400113 |
Protein GI | 81299905 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.0295127 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0112677 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGCG ACTGGATCGC ACCCCGCCGA GGCCAAGCCA ACGTCACTCA AATGCACTAC GCCCGCCAAG GCGTGATCAC CGAAGAAATG GACTTCGTGG CGCGGCGCGA AAATCTGCCA GCCGATCTAA TTCGGGATGA AGTGGCACGG GGTCGGATGA TTATCCCCGC CAACATCAAC CACACCAATT TGGAGCCGAT GGCGATCGGC ATTGCCTCCA AGTGCAAGGT CAACGCCAAC ATCGGTGCTT CGCCTAACGC CTCCAACATC GATGAAGAAG TCGAGAAGCT GAAGCTCGCG GTCAAATACG GTGCCGATAC CGTCATGGAC CTCTCGACCG GCGGCGGCAA CCTCGATGAG ATTCGCACCG CGATCATCAA TGCTTCGCCG GTACCGATCG GCACCGTGCC GGTCTACCAA GCCCTGGAAT CCGTTCACGG GCGCATCGAA AAACTCAGCG CCGACGACTT CTTGCATGTG ATCGAAAAGC ACTGCGAACA GGGCGTCGAC TACCAAACCA TCCACGCCGG TCTGCTGATT GAACACCTGC CCAAGGTCAA GAGCCGGATC ACCGGGATTG TTTCGCGGGG CGGCGGCATC ATTGCCCAGT GGATGCTCTA CCACCACAAG CAAAACCCGC TCTATACCCA CTTTCGCGAC ATCATCGAAA TCTTCAAGCG CTACGACTGT AGCTTCAGCT TGGGTGACTC GCTGCGGCCG GGTTGCCTGC ACGATGCTAG CGACGATGCC CAGCTCAGCG AGCTGAAGAC TCTCGGTCAA CTGACGCGGG TTGCTTGGGA ACACGACGTG CAAGTCATGG TCGAAGGGCC AGGCCACGTT CCCATGGACC AGATCGAGTT CAACGTCCGC AAGCAAATGG AAGAGTGCTC AGAAGCTCCC TTCTACGTCT TGGGTCCCCT CGTGACCGAC ATTGCACCGG GCTATGACCA CATCACCAGC GCGATCGGGG CAGCAATGGC GGGCTGGTAT GGCACGGCAA TGCTCTGCTA CGTCACGCCC AAAGAGCACT TGGGTCTGCC CAATGCGGAA GATGTGCGCA ATGGTTTGAT CGCCTACAAA ATTGCGGCTC ATGCAGCAGA TATCGCTCGC CACCGTCCGG GTGCTCGCGA TCGCGATGAT GAACTGAGTC GGGCACGCTA CGCCTTCGAC TGGAACAAGC AATTTGACTT GAGCCTCGAT CCAGAGCGGG CGCGGGAATA CCACGACGAA ACTCTGCCAG CAGATATCTA CAAAACGGCA GAATTCTGTT CGATGTGTGG ACCGAAGCAC TGTCCGATGC AAACCAAGAT CACCGAGGAA GATCTAACCG AGTTGGAAAA ATTCCTCGAG AAAGATAGCG CTCTGGCGTA G
|
Protein sequence | MRSDWIAPRR GQANVTQMHY ARQGVITEEM DFVARRENLP ADLIRDEVAR GRMIIPANIN HTNLEPMAIG IASKCKVNAN IGASPNASNI DEEVEKLKLA VKYGADTVMD LSTGGGNLDE IRTAIINASP VPIGTVPVYQ ALESVHGRIE KLSADDFLHV IEKHCEQGVD YQTIHAGLLI EHLPKVKSRI TGIVSRGGGI IAQWMLYHHK QNPLYTHFRD IIEIFKRYDC SFSLGDSLRP GCLHDASDDA QLSELKTLGQ LTRVAWEHDV QVMVEGPGHV PMDQIEFNVR KQMEECSEAP FYVLGPLVTD IAPGYDHITS AIGAAMAGWY GTAMLCYVTP KEHLGLPNAE DVRNGLIAYK IAAHAADIAR HRPGARDRDD ELSRARYAFD WNKQFDLSLD PERAREYHDE TLPADIYKTA EFCSMCGPKH CPMQTKITEE DLTELEKFLE KDSALA
|
| |