Gene Syncc9902_0167 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9902_0167 
Symbol 
ID3743680 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9902 
KingdomBacteria 
Replicon accessionNC_007513 
Strand
Start bp172076 
End bp173476 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content50% 
IMG OID637770335 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_376185 
Protein GI78183751 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTACTC AATGGGTTTC TGCTCGCAAG GGCCAAGCGA ATGTGTCTCA GATGCATTAC 
GCCCGCAAGG GTGTGGTCAC TGAAGAGATG GCCTATGTGG CCAAGAGGGA GAACCTGCCC
GAATCGTTAA TCATGGAGGA GGTGGCGCGT GGTCGCATGA TCATCCCCGC CAACATCAAT
CACACGAATT TGGAGCCAAT GGCGATTGGC ATCGCCAGTA GCTGCAAAGT GAACGCGAAT
ATCGGTGCTT CCCCGAATGC TTCCGATGCC GCGGAAGAGG TGAACAAGCT GAAGCTGGCG
GTGAAATATG GCGCTGACAC CGTCATGGAT TTATCCACAG GTGGTGTGAA TCTCGACGAG
GTACGTACCG CAATCATTGG TGCATCTCCC GTTCCGATTG GAACAGTTCC TGTTTATCAA
GCGCTGGAAA GTGTGCATGG TTCGATTGAA AAGTTGGATG AAGATGATTT TCTCCACATT
ATTGAGAAGC ACTGCCAGCA GGGAGTTGAC TATCAGACGA TTCATGCGGG TCTGCTAATT
GAACATCTAC CTAAGGTGAA GGGTCGAATC ACAGGCATTG TGAGTCGTGG TGGTGGGATT
TTGGCGCAGT GGATGTTGTA CCACCACCGT CAAAATCCTT TGTTCACTCG CTTTGACGAT
ATTTGCGAAA TTTTTAAGCG CTACGACTGC ACGTTCTCTT TGGGGGATTC ACTGCGGCCC
GGTTGCCAAC ATGATGCTTC TGATGCTGCG CAACTGGCCG AGCTCAAAAC TCTTGGAGAG
TTAACGCGGC GGGCTTGGAA GCACGATGTA CAGGTCATGG TTGAAGGGCC AGGTCATGTG
CCTTTGGATC AGATTGAATT CAACGTGAAA AAACAAATGG AAGAGTGCAA TGAGGCACCC
TTCTATGTTC TTGGACCATT GGTGACTGAT ATTGCACCTG GTTATGACCA CATCACCTCT
GCGATCGGTG CAGCAATGGC TGGCTGGCAC GGCACTGCGA TGCTCTGTTA TGTGACTCCA
AAAGAGCATC TTGGTCTTCC GAATGCAGAG GATGTTCGTG AGGGACTGAT TGCTTACAAA
ATTGCTGCGC ATGCGGCAGA TATTGCCCGT CACCGCCCTG GTGCTCGTGA TCGCGATGAT
GAATTGAGCC GCGCACGTTA TGCCTTTGAT TGGAACAAAC AATTTGAGTT GTCATTAGAT
CCAGAAAGGG CGAAGGAGTA TCACGACGAG ACGTTGCCTG CAGACATCTA TAAGCAAGCA
GAGTTTTGCT CGATGTGCGG TCCGAAGCAC TGTCCGATGC AAACCAAGAT CACGGATGAA
GATCTCGAAG GGCTTCAAAA AGTTCTTGAG TCACAGGGTG CCGCTGAACT TGCCTCAGTA
AAACTTGATA AGGCTGAATA A
 
Protein sequence
MRTQWVSARK GQANVSQMHY ARKGVVTEEM AYVAKRENLP ESLIMEEVAR GRMIIPANIN 
HTNLEPMAIG IASSCKVNAN IGASPNASDA AEEVNKLKLA VKYGADTVMD LSTGGVNLDE
VRTAIIGASP VPIGTVPVYQ ALESVHGSIE KLDEDDFLHI IEKHCQQGVD YQTIHAGLLI
EHLPKVKGRI TGIVSRGGGI LAQWMLYHHR QNPLFTRFDD ICEIFKRYDC TFSLGDSLRP
GCQHDASDAA QLAELKTLGE LTRRAWKHDV QVMVEGPGHV PLDQIEFNVK KQMEECNEAP
FYVLGPLVTD IAPGYDHITS AIGAAMAGWH GTAMLCYVTP KEHLGLPNAE DVREGLIAYK
IAAHAADIAR HRPGARDRDD ELSRARYAFD WNKQFELSLD PERAKEYHDE TLPADIYKQA
EFCSMCGPKH CPMQTKITDE DLEGLQKVLE SQGAAELASV KLDKAE