Gene PCC8801_3702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3702 
Symbol 
ID7102947 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3893122 
End bp3894318 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content42% 
IMG OID643476711 
Productthioester reductase domain protein 
Protein accessionYP_002373814 
Protein GI218248443 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3320] Putative dehydrogenase domain of multifunctional non-ribosomal peptide synthetases and related enzymes 
TIGRFAM ID[TIGR01746] thioester reductase domain 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCTATT CTTTAGCGAA GGGTAACTTA AATCGGGAAG CATACCTAGA TCCATCTATC 
CAATATCTAC CCCTATCCCC GTTTAGACAA CGGACTAAAC GGAGAGCATT TTTGACGGGA
GCAACGGGTT TTCTGGGTGC TAACCTACTC CACGATCTGC TCAAACACAC TCTTTTTGAA
GTATATTGCT TAGTACGCGC ATCAAACGCC GATGAAGGGA AAGTCAAACT ACGCCAGGCT
CTTAAAGCGC AAAATCTCTG GATGAAAGCC TTTGAATTTA GGATACATCC AGTTGTTGGC
GACTTAAGCA AACCTCAGTT AGGACTTTCC GATGCTGCCT TTGCAAGTTT GGGTAAACAG
ATAGAAGTCA TCTATCACAA TGCTTCTTGG CTCAATCTCT CCTATCCTTA CTCGACTCTC
AAAGCGACTA ACGTCAAGGG AACAGAAGAA ATTCTTCGAC TCGCAGCGAT TAAACCACAG
ATCGCCGTAC ACTATGTTTC AACGCTTTCT GTCTTCAGTC CTCGCGTTTA CAATAATCAA
TCAGAAATTG CGGAATGCTT CTGGGTACAA GAACCCATTG GATTACAGCA AGGTTATCCT
CAAAGTAAGT GGGTTGCGGA ACAGTTAATC AATATCGGGT CACAACGCGG ACTTTTTGCC
TGGATTTACC GACCTGGAAT GATTACCGGC CATAGTGAAA CGGGTATCTG TAACAATAAG
GATAAATTCT CTATTCTGCT ACGGACTTGC CTTGAGCTTG GTTTAGTGCC AGAATTTAAA
GGAACTGTTT ACATGACTCC TGTTGATTAT GTCAGTCGCT CAATTATCGA GCTTTCAGAA
TTAGTCAGCG AAACTGATCA AGCTTTTCAT CTGATTACAC CCCAACCAAT GTCTTGGACA
AAAGTTGTTA AAACAATGCT TGATACTTAT CCTACTATGA ACTCGATGCC TTATAGTCTT
TGGTTTAAGG AAGTTCAGCA ATCAGCTAGA CAGTCGGTGA GTCAGGAACT GCGGACTTTA
GTTGCCTTAC TTTTTCATCC GACCATTCCA CCCTTTTCGA CTAATCAAGA CGTACAGTTT
AGCTGTGAGA ACACCATGAA GGTGTTATCC ACTAAGTTTA ACATTGAATG GACACAAGAT
AATCCAACCT TATTAAGACG CTACCTCTCT TATTTAGCGG AAGTTCCCTC CTGGTAA
 
Protein sequence
MSYSLAKGNL NREAYLDPSI QYLPLSPFRQ RTKRRAFLTG ATGFLGANLL HDLLKHTLFE 
VYCLVRASNA DEGKVKLRQA LKAQNLWMKA FEFRIHPVVG DLSKPQLGLS DAAFASLGKQ
IEVIYHNASW LNLSYPYSTL KATNVKGTEE ILRLAAIKPQ IAVHYVSTLS VFSPRVYNNQ
SEIAECFWVQ EPIGLQQGYP QSKWVAEQLI NIGSQRGLFA WIYRPGMITG HSETGICNNK
DKFSILLRTC LELGLVPEFK GTVYMTPVDY VSRSIIELSE LVSETDQAFH LITPQPMSWT
KVVKTMLDTY PTMNSMPYSL WFKEVQQSAR QSVSQELRTL VALLFHPTIP PFSTNQDVQF
SCENTMKVLS TKFNIEWTQD NPTLLRRYLS YLAEVPSW