Gene Cyan8802_3442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3442 
Symbol 
ID8392778 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3513603 
End bp3514985 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content46% 
IMG OID644981377 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_003139103 
Protein GI257061215 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.309449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATCAC AATGGGTTGC GAAGCGTCGC GGACAGAGCA ATGTATCCCA AATGCACTAT 
GCTCGTCAAG GCATGATCAC CGAGGAAATG GATTATGTTG CCAAACGGGA AAATCTTCCC
CCTGACTTAA TTCGTCAAGA AGTCGCACGG GGACGGATGA TTATTCCCGC CAATATTAAC
CATCTTAACC TAGAACCGAT GGCCATTGGT ATTGCCTCAA AATGCAAGGT TAATGCCAAT
ATTGGGGCAT CTCCTAACTC TTCTAACCTA GAGGAAGAAG TCGCTAAACT CAACCTAGCC
GTCAAATACG GTGCAGATAC CGTGATGGAC TTGTCTACGG GGGGAGGAGA CTTAGACACC
ATTCGGACGG CAATTATTAA CGCTTCTCCT GTCCCTATTG GAACTGTTCC CATTTATCAA
GCCGTGGAAA GCGTCCACGG GAATATCGAA AAGCTAACCC CTGATGATTT CTTGCACATC
ATTGAGAAAC ACGCTCAACA GGGTGTGGAC TACATGACCA TCCATGCGGG ACTGTTAATA
GAATACCTTC CCTTGGTCAG AAGTCGTCTA ACAGGGATTG TCTCTCGCGG CGGTGGTATT
ATTGCTAAGT GGATGCTGCA CCATCACAAG CAAAACCCGC TTTATACCCA TTTTGATGAG
ATTATTGAGA TCTTTAAGAA ATACGACGTT TCTTTTAGTT TAGGAGATTC ATTGCGCCCT
GGTTGTACCC ACGATGCGTC CGATGAAGCT CAACTGTCTG AGTTGAAAAC CCTTGGACAA
TTAACCCGTC GTGCTTGGGA GCATGATGTT CAGGTGATGG TGGAAGGTCC AGGCCATGTT
CCGATGGATC AAATTGAGTT TAATGTCAAA AAACAAATGG AAGAGTGTAG CGAAGCACCT
TTCTATGTTT TGGGTCCATT GGTGACAGAT ATTGCTCCAG GATATGATCA TATTACCTCA
GCGATCGGGG CAGCGATGGC CGGTTGGTAT GGAACGGCAA TGTTATGCTA TGTTACTCCG
AAAGAGCATT TAGGGTTGCC TGATGCGGAG GACGTGCGTA ATGGGTTAAT TGCCTATAAA
ATTGCGGCTC ATGCTGCCGA TATTGCTCGT CAACGTCCAG GAGCACGCGA TCGGGATGAT
GAACTGTCGA AAGCCCGTTA TAATTTTGAC TGGAACCGTC AGTTTGAACT ATCGTTAGAT
CCCGATCGCG CCAGGGAATA TCACGATGAA ACTTTGCCCG CAGATATCTA TAAAACGGCG
GAGTTTTGTT CAATGTGTGG ACCGAAGTTC TGTCCCATGC AAACGAAAGT AGATGCGGAT
GCGTTGACGG AATTGGAGAA ATTCCTAGCC GAACAAAAGA ACAAAGAAGC GATTGCTCAT
TAA
 
Protein sequence
MRSQWVAKRR GQSNVSQMHY ARQGMITEEM DYVAKRENLP PDLIRQEVAR GRMIIPANIN 
HLNLEPMAIG IASKCKVNAN IGASPNSSNL EEEVAKLNLA VKYGADTVMD LSTGGGDLDT
IRTAIINASP VPIGTVPIYQ AVESVHGNIE KLTPDDFLHI IEKHAQQGVD YMTIHAGLLI
EYLPLVRSRL TGIVSRGGGI IAKWMLHHHK QNPLYTHFDE IIEIFKKYDV SFSLGDSLRP
GCTHDASDEA QLSELKTLGQ LTRRAWEHDV QVMVEGPGHV PMDQIEFNVK KQMEECSEAP
FYVLGPLVTD IAPGYDHITS AIGAAMAGWY GTAMLCYVTP KEHLGLPDAE DVRNGLIAYK
IAAHAADIAR QRPGARDRDD ELSKARYNFD WNRQFELSLD PDRAREYHDE TLPADIYKTA
EFCSMCGPKF CPMQTKVDAD ALTELEKFLA EQKNKEAIAH