Gene PCC8801_2662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_2662 
Symbol 
ID7102061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp2746505 
End bp2747887 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content47% 
IMG OID643475701 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002372820 
Protein GI218247449 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATCAC AATGGGTTGC GAAGCGTCGC GGACAGAGCA ATGTATCCCA AATGCACTAT 
GCTCGTCAAG GCATGATCAC CGAGGAAATG GATTATGTTG CCAAACGGGA AAATCTTCCC
CCTGACTTAA TTCGTCAAGA AGTCGCACGG GGACGGATGA TTATTCCCGC CAATATTAAC
CATCTTAACC TAGAACCGAT GGCCATTGGT ATTGCCTCAA AATGCAAGGT TAATGCCAAT
ATTGGGGCAT CTCCTAACTC TTCTAACCTA GAGGAAGAAG TCGCTAAACT CAACCTAGCC
GTCAAATACG GTGCTGATAC CGTGATGGAC TTGTCCACAG GGGGAGGAGA CTTAGACACC
ATTCGCACCG CCATTATTAA CGCTTCTCCC GTTCCTATTG GAACCGTTCC CATTTATCAA
GCCGTGGAAA GCGTCCACGG GAATATCGAA AAGCTGACCC CTGATGATTT CTTGCACATC
ATTGAGAAAC ACGCTCAACA GGGTGTGGAC TACATGACCA TCCATGCGGG ACTGTTAATA
GAATACCTTC CCTTGGTCAG AAGTCGTCTA ACAGGGATTG TCTCTCGCGG CGGTGGTATT
ATTGCTAAGT GGATGCTGCA CCATCACAAG CAAAACCCGC TTTATACCCA TTTTGATGAG
ATTATTGAGA TCTTTAAGAA ATACGACGTT TCTTTTAGTT TAGGAGATTC ATTGCGCCCT
GGTTGTACCC ACGATGCGTC CGATGAAGCT CAACTGTCTG AGTTGAAAAC CCTTGGACAA
TTAACCCGTC GTGCTTGGGA GCATGATGTT CAGGTGATGG TGGAAGGTCC AGGCCATGTT
CCGATGGATC AAATTGAGTT TAATGTCAAA AAACAAATGG AAGAGTGTAG CGAAGCACCT
TTCTATGTTT TGGGTCCATT GGTGACAGAT ATTGCTCCAG GATATGATCA TATTACCTCA
GCGATCGGGG CAGCGATGGC CGGTTGGTAT GGAACGGCAA TGTTATGCTA TGTTACTCCG
AAAGAGCATT TAGGGTTGCC TGATGCGGAG GACGTGCGTA ATGGGTTAAT TGCCTATAAA
ATTGCGGCTC ATGCTGCCGA TATTGCTCGT CAACGTCCAG GGGCACGAGA CCGGGATGAT
GAACTGTCGA AAGCCCGTTA TAATTTTGAC TGGAACCGTC AGTTTGAACT ATCGTTAGAT
CCCGATCGCG CCAGGGAATA TCACGATGAA ACTTTGCCCG CAGATATCTA TAAAACGGCG
GAGTTTTGTT CAATGTGTGG ACCGAAGTTC TGTCCCATGC AAACGAAAGT AGATGCGGAT
GCGTTGACGG AATTGGAGAA ATTCCTAGCC GAACAAAAGA ACAAAGAAGC GATTGCTCAT
TAA
 
Protein sequence
MRSQWVAKRR GQSNVSQMHY ARQGMITEEM DYVAKRENLP PDLIRQEVAR GRMIIPANIN 
HLNLEPMAIG IASKCKVNAN IGASPNSSNL EEEVAKLNLA VKYGADTVMD LSTGGGDLDT
IRTAIINASP VPIGTVPIYQ AVESVHGNIE KLTPDDFLHI IEKHAQQGVD YMTIHAGLLI
EYLPLVRSRL TGIVSRGGGI IAKWMLHHHK QNPLYTHFDE IIEIFKKYDV SFSLGDSLRP
GCTHDASDEA QLSELKTLGQ LTRRAWEHDV QVMVEGPGHV PMDQIEFNVK KQMEECSEAP
FYVLGPLVTD IAPGYDHITS AIGAAMAGWY GTAMLCYVTP KEHLGLPDAE DVRNGLIAYK
IAAHAADIAR QRPGARDRDD ELSKARYNFD WNRQFELSLD PDRAREYHDE TLPADIYKTA
EFCSMCGPKF CPMQTKVDAD ALTELEKFLA EQKNKEAIAH