Gene PCC8801_4034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4034 
Symbol 
ID7104610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4225337 
End bp4226563 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content32% 
IMG OID643477029 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_002374129 
Protein GI218248758 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCAA ATCCTCAACT AAATTTAATG ACTAACCTGT TACAACTAGA AGGAGTGACA 
GTCATCAATT ATCAGATAAT AAAAGAGATA GGAATAGTTT TATCTGTAGA GAAAATAGAG
CCAAATGCTA CCTGTATTTA CTGTGGTTCA AAAACGAGGA AAGTTCATCA AAATAACGAA
TTAACAATTA GGGATTTACC CTGGGGAGAA AAATCGGTTT ATTTAAAAAT TAATCGTCGG
CAAATGAGAT GTGAGCATTG TCAAAAGAAA TTCACAGAGG AATTGAGTTA TGTGCCCAAA
AAAAGAACTT ATACTGAGAG ATTTAGAAAG AAAATAATTG AAGAAGTTTT AAATAGTGAC
ATCAAGAATG TAGCGAAAAG AAATGGAGTT AGTGAACAAG AAATAGAAAC GATGCTGAAA
GATGTAGGAG AAGACTTAAA CCAAGAAAAA CCGAGGGAAT TAAGGCGATT AGGAATTGAT
GAAATCGCGG TGATTAAAGG ACAAGGAAAT TATTATGTTG TCTTAGTTGA TTTAGAGAGA
GGAGTGATAG TAGGAATTCT AGAAAAACGA ATAGAAGAGG AAGTTTTAAA ATATCTAGAA
GCATGGGGAG AAGAGGTTTT GACGAAGATT GAAGAAGTGA GTATAGATCT TTGGAAACCT
TATAAAAATA TTGTGAATAA ATTAATGCCC CAAGCTGAAG TCGTAGCTGA TAGATTTCAT
GTAATGAAAC AAGTTAATGA GGAATTAGAT GCTCAAAGAA AAACTCTTAA AAGAGAAGCT
AAAGAGCTAA AAGATACTAA TCAAAAAGAA GAAATATTGT CAGGATTAAA TAAGAGTAAA
TATGTTTTAT TGAAAAATGA AGAAGATTTA AACGAAGAGC AAAAAGAAAA ATTAGAGCAA
GTCTATAAAA CGTCAGAAGT CCTATCAAAA ATGCACCAAT TGAAGGAGGA ATTTAGAGAC
ATTTTTGAAA CCCAGTCAGA CTGGGTTTCA GGACTATTTG AATTAGCAGA TTGGTGTCAA
AAGGCTTATT CATTGTACCC GAAAAGTTGT GGAACAATTA GGCGTTGGAT TGGAGAAATT
ATTGCCTATT TTGACCAAGG AACAACTCAA GGAATAGTCG AAGGTATTAA CAATAAATTA
AAGTTGATTA AAAGGAGAGC TTATGGCTTT AGAAATTTTG GTAATTTTCA ACTCAGAAGT
TTCTTAACTT GGCATTTTAC TCGTTAA
 
Protein sequence
MPSNPQLNLM TNLLQLEGVT VINYQIIKEI GIVLSVEKIE PNATCIYCGS KTRKVHQNNE 
LTIRDLPWGE KSVYLKINRR QMRCEHCQKK FTEELSYVPK KRTYTERFRK KIIEEVLNSD
IKNVAKRNGV SEQEIETMLK DVGEDLNQEK PRELRRLGID EIAVIKGQGN YYVVLVDLER
GVIVGILEKR IEEEVLKYLE AWGEEVLTKI EEVSIDLWKP YKNIVNKLMP QAEVVADRFH
VMKQVNEELD AQRKTLKREA KELKDTNQKE EILSGLNKSK YVLLKNEEDL NEEQKEKLEQ
VYKTSEVLSK MHQLKEEFRD IFETQSDWVS GLFELADWCQ KAYSLYPKSC GTIRRWIGEI
IAYFDQGTTQ GIVEGINNKL KLIKRRAYGF RNFGNFQLRS FLTWHFTR