Gene Syncc9605_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSyncc9605_2507 
Symbol 
ID3737922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSynechococcus sp. CC9605 
KingdomBacteria 
Replicon accessionNC_007516 
Strand
Start bp2314474 
End bp2315490 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content43% 
IMG OID637777095 
ProductThiF family protein 
Protein accessionYP_382793 
Protein GI78214014 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0476] Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.415604 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.720809 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAATTAC AACTTAAGAG ATCAATCTCA ATTAGACAAG ATCGTAACGG TGGGTTGACC 
TTTGGGATGG CACCACCACG ACACTTTATC CTGGAAAGCC CACCACCCTT TCTTGCCCTT
CTTCTAGAGA TTCTAAATAA GCCACAATCT CTAGAAGAAG TAATTGAAAA ATTAACACAG
GAAAACCAGA ACTGGAAGCC AAGTGAAATC AGCCAGATAT GGCAAGAATT AATTGACCTA
AATATTCTGG AAGAGCCTAG AAAGGCTGGA AGATATGACC GTCATGAATT ATATTATGAT
ATTTTCAATG TCAATCGCGA ACACTATGGC TGTCTGGCAG AAAAGAAGGT TGGCTTAATT
GGAGCCGGCG GGATTGGATC CACCTGTGCA CTGCTCCTTG CCGCCGCAGG TATTGGAATT
CTTTCACTGG CAGATGATGA CCTTTTAGAG GAAACCAATC TACCAAGAGT CGTGCTACTA
GAAGAGCAAG ACATTGGCCT GCCTAAAATA GACCAGATCA AAGAGAGAAT CAATGCCAGA
AATAGCGCCA CAATCATTGA AGCAACTCGG TCAAAAATTA ATGGCCCAGA CGATATTCTG
GCATTCTTCG GAAATTGCGA TGCTTGGATA TTATCAGCAG ATACTCCAAC TCAGCTCATT
CAAGAATGGA CAAATGCAGC CTCTCTAGAT ACCAATACAC CCTATATTTC AGCTGGGTAC
GCTGAAATCA ATGGAATGGT AGGCCCTTTT ATAATTCCCG GCGAAACACC GTGTCATCAG
TGTAGGATTC TCCAGGGGAA TGTTCCATCT GGCCGGCAAA TCAATAAGAA AGTACAAGCA
ACATCTTATG GACCTCTGAA CACGATTGTT TCCGCAATGG CAGTGAATGA GGTAATCAGA
TACCTTCTTG GTCTTGATAT AGCAACAAAA GGTACTCAAA TTATATTGAA TTCGAGCAAC
TACAGCACGA CATTTGAGCC TCTCACATTT GCACCAGATT GCAGATGCCA TGCTTAA
 
Protein sequence
MQLQLKRSIS IRQDRNGGLT FGMAPPRHFI LESPPPFLAL LLEILNKPQS LEEVIEKLTQ 
ENQNWKPSEI SQIWQELIDL NILEEPRKAG RYDRHELYYD IFNVNREHYG CLAEKKVGLI
GAGGIGSTCA LLLAAAGIGI LSLADDDLLE ETNLPRVVLL EEQDIGLPKI DQIKERINAR
NSATIIEATR SKINGPDDIL AFFGNCDAWI LSADTPTQLI QEWTNAASLD TNTPYISAGY
AEINGMVGPF IIPGETPCHQ CRILQGNVPS GRQINKKVQA TSYGPLNTIV SAMAVNEVIR
YLLGLDIATK GTQIILNSSN YSTTFEPLTF APDCRCHA