Gene PCC8801_0078 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0078 
Symbol 
ID7103737 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp80859 
End bp82700 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content37% 
IMG OID643473193 
ProductHTTM domain protein 
Protein accessionYP_002370340 
Protein GI218244969 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATAATA AATTATTCTC TAAACTAAAG GAAATTTTCG CCTTAGACTT AAGATCATTA 
GGGATTTTTC GGATAGGTTT AGCCTTGGTT GTTATGACTG ATTTAATCAG TCGAGCTAGG
GGATTAACTG ACCATTATAG CGATGCTGGA GTGATGCCTA GAGAGGCATT AACCAAAGAA
TTACTACACC CTTGGTATTG GTCTTTTCAT CTGATAAGTG GGAATTATTT ATTTCAAATT
GTCCTATTTA TTTTAGCTTT TTTGATAGCA ATAGCCATGT TAGTTGGTTA TCGAACTCGA
CTAGCCACTA TTGCTACTTG GGCGTTTAAT ATTTCTGTCC AAAATCGCAA TCCTGCCTTG
ATTTTTGCCG GAGACGATGT CTTACGCGCC ATGCTTTTTT GGGCAATGTT CTTACCATTA
GGAGCCGCTT ATTCCATCGA TAGTGCCCTT AATTCTTCCC AGAAACCTCT CCCCAAACAG
GTCATTAGTG GGGCAACTGT TGCTTTTTTA GTGCAACTGA TTTTTATTTA TACTTGGTCA
GCCGCTTATA AAACCAAAAG TGAAATTTGG TGGCCTAATG GGGAAGCCGT TTATTATTCC
CTAAGCTTTG ATCAATACGT CACAGAATTA GGTCAATTGT TATTAGGATT TCCCTTACCA
TTATTACAAT TTCTTACCTT TTCAGCCCTA ATTTTTGAAT GGGTTGCCCC CTTTATGATC
TTTATCCCTT GGCGAACAAC CTTTTGGCGT TGTTTAGCTA TTATTTCTTT TATTTTATTA
CATATTGGCT TTGAATTATC CTTTTCTATT GGGGTTTTAA GTTACCTTAG TATGGTCAAT
TGGTTAGCCT TAATTCCTAC TCCGGTATGG GATAAAATAG CCCATCAATT AAAGACCCCC
CAACGAGAAG GATTAATCAT TTATTATGAT CAAGATTGCG GGTTTTGTAA AAAAGTCGTT
CATTTAATTC GGACATTTTT AATTTTACCC GGAACCCCTT TATTAGTTGC CCAAGATAAC
GAGTCAATTT ATAGCGATAT GCTTGCCCAA AATTCCTGGG TTATCGTAGA TTGGCAAGGA
AATCGTTATT TCAAATTTGA AGGCATTATT TATGTTTGTA GCTTATCTCC TATTTTTCAA
TTTATCACCC CTATTCTGCG TTGGCAACCC ATCAAAACAG GAGGCACAAA ATTCTATGAA
ACCATTGCCT CTAATCGAAA ATTTGCCGGA AATTTTACAA AACCCTTCCC GTTCCGTCCT
CTGGAAATTA ATAACTCATT ACCCCTAAAT ATTGTCACAC TACTGTTACT GTTTTTAACC
ACCCTATGGA ACTTAAAAAG CTTTGTTGAT CAAACCGTTT ACCGTCGTCC CTTTAAAGAT
GATTGGATTA ACACTACCCA TAAAATCTTT ACCAAAAGAA CCTTTCAAGC AATTAATATC
ATTAGCTATC TAACCCGTTT AGATCAGTCT TGGAGTATTT TTGCCCCCGC CCCCCCTAGG
GATGATGGAT GGCACGTTAT TGTCGGGAAA CTCAATGACG GAACCGAGGT TAATCTCCTC
AATGAAAACA GCCCCATTCG ATGGGAAAAA CCCACCCTAA AACAACGACA AAACCTTTAT
CAAACCATAC AATGGCGGGT TTATTTCATC AATCTCAATC GTGCCATGGG GCAAAAACTG
TATCCCCACT TTGCTGAATA TTTATGTAAT CAGTGGAATA CGAATCATAC AAGAGATAAA
AAATTAAAAA GCCTAGAAAT TTATTTTATG GATGAAAGAA CCGTTCCTGC GGATCAAAAA
CAACCGATTA AAAAAGAACT CCATTTCAAA AAAGAATGCT AA
 
Protein sequence
MNNKLFSKLK EIFALDLRSL GIFRIGLALV VMTDLISRAR GLTDHYSDAG VMPREALTKE 
LLHPWYWSFH LISGNYLFQI VLFILAFLIA IAMLVGYRTR LATIATWAFN ISVQNRNPAL
IFAGDDVLRA MLFWAMFLPL GAAYSIDSAL NSSQKPLPKQ VISGATVAFL VQLIFIYTWS
AAYKTKSEIW WPNGEAVYYS LSFDQYVTEL GQLLLGFPLP LLQFLTFSAL IFEWVAPFMI
FIPWRTTFWR CLAIISFILL HIGFELSFSI GVLSYLSMVN WLALIPTPVW DKIAHQLKTP
QREGLIIYYD QDCGFCKKVV HLIRTFLILP GTPLLVAQDN ESIYSDMLAQ NSWVIVDWQG
NRYFKFEGII YVCSLSPIFQ FITPILRWQP IKTGGTKFYE TIASNRKFAG NFTKPFPFRP
LEINNSLPLN IVTLLLLFLT TLWNLKSFVD QTVYRRPFKD DWINTTHKIF TKRTFQAINI
ISYLTRLDQS WSIFAPAPPR DDGWHVIVGK LNDGTEVNLL NENSPIRWEK PTLKQRQNLY
QTIQWRVYFI NLNRAMGQKL YPHFAEYLCN QWNTNHTRDK KLKSLEIYFM DERTVPADQK
QPIKKELHFK KEC