Gene PCC8801_3949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3949 
Symbol 
ID7103892 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4137456 
End bp4139153 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content37% 
IMG OID643476947 
ProductTn7-like transposition protein C 
Protein accessionYP_002374048 
Protein GI218248677 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAGTA AAGATTTTAC CAATAAATAT GACTATTATA ATTGGATAGA AATTCCTGAT 
GGTACACGCG CTGCTGTGTC TAAATATCGT CAGTTTAAGC AGATAGAATA TAATAATAAT
CCTATTATTG AAGCTTTACC ACCGATATTT TCTCAAGCTG AATTTGTAGA TTTAGCCACA
AGATTACCGT TATATGACCC TTTAGAAAGA CAATTAGAAG CACAAGATAG GTTTCATTGT
ATTGAGCGAT TATCTCGCTA TTTTGACCCC CTTTCTATCA CCATTGATTT ACAACAAACT
ATCTGTGTTT TATTAATGAG TGGTTATATT TCTCGGAATC CTTTACAACC AGAATATGCA
AGACGTTCAA GACAAATTTA TGGGTCAATT CAAGCTAAAG ATGGTCACAA TTTAGAACAA
TATGTGACAG TTCCTACGAC TGCTTCAGGA CTAACTATTA TTGGAGAATC AGGGTTAGGA
AAGTCAACAA ATTTAGCGAA TATTCTTGAT ATTTATCCAC AAGTGATTCT TCATCCTCAA
TATAATGTGA CTCAAATTGT TTGGTTAAAG GTAGATTGTC CTCATGCGGG TTCTTTGAAA
GGGTTATGTA CGGATATTTT TCTCGCTGTT GACAGGTTAT TAGGGACAAA TCACTTCAAA
AAATTCGGTT CTAAGGGTAA TTCTGAGGAT TATATGTTAG CGCAAGTGGC ACAAATTGCC
CATACTCATC ATTTAGGGTT ATTAGTGATT GATGAGATGC AAAATTTAGC TAATGCGAGG
AGAGGACGGG ATGATTTACT GAATTTTTTG GTGAAAATGG ATAATATTAT TGGTATTGCC
GTGATAAGAG TGGGAACCAA TGAAGCAGAA CCGATTTTGA CAGGAAATTT TAGGAATGCG
AGACGGGGAA CAGGAGAAGG TGCAGTACGC TGGAAACGGA TGGAAAATAA CGGAAATTGG
CAGTTTTTTG TCGAGGGAAT GTGGGATTAT CAGTGGACAA AAACTGAGGT TCTCTATTCT
GAGGAAATTA GTGACGCACT CTATGAAGAA ACCCAAGGAA TCATCGATAT TGTGATTAAA
TTATACAAAA TGGTGCAATG GCGAGCTATT TCTCTGGGTG ACGATGAAAT AATCACGGTT
GATTTAATTC AGCAAGTTGC ACAAGAGGGG TTATATCTGG TGAAACCGAT GTTAGATGCG
ATACGTTCGG GGAATCTGGT ACAAATGAAA AAGTACCGAG ATATTGCCCC TGTAGATATT
TCTGACTATC GAGAAAAGTG TTTAAATGAT ATCAATTTTG AGGATTTAGC AGAATTAAGA
CGCATCAGAC GCAATAATAA ACAGTCAGCA ACTCTGTCTC CTCTGCTTAA GCAAGTGATT
GTGGAATTAC TAGAGTTGGA GGTTGAACCC ACTTTAGCGA AACGGTTGGG GGAAAGGATG
GTTAATGAAA ATCCCCAGGA AACGGATATT TCTAAGTTAG TTAATCAAGC GTATAAGATA
GCATTACAAG GGGAAGCATT TAAGGGCAAT AAATCAAGAA AACCGCAGTC CAAAGGTAAA
TTAAATCCGA ATTATGTTGA AAATGACATG AGAAAGATTC TAGAGGAGGC TAAAAATAAT
CAAATTCCTG TTTATGAACC GTTGGTAGAA GCAAAGATTA TTAAAGATAG TCCTGAAGTC
GATTTTTTCT TAATTTAG
 
Protein sequence
MESKDFTNKY DYYNWIEIPD GTRAAVSKYR QFKQIEYNNN PIIEALPPIF SQAEFVDLAT 
RLPLYDPLER QLEAQDRFHC IERLSRYFDP LSITIDLQQT ICVLLMSGYI SRNPLQPEYA
RRSRQIYGSI QAKDGHNLEQ YVTVPTTASG LTIIGESGLG KSTNLANILD IYPQVILHPQ
YNVTQIVWLK VDCPHAGSLK GLCTDIFLAV DRLLGTNHFK KFGSKGNSED YMLAQVAQIA
HTHHLGLLVI DEMQNLANAR RGRDDLLNFL VKMDNIIGIA VIRVGTNEAE PILTGNFRNA
RRGTGEGAVR WKRMENNGNW QFFVEGMWDY QWTKTEVLYS EEISDALYEE TQGIIDIVIK
LYKMVQWRAI SLGDDEIITV DLIQQVAQEG LYLVKPMLDA IRSGNLVQMK KYRDIAPVDI
SDYREKCLND INFEDLAELR RIRRNNKQSA TLSPLLKQVI VELLELEVEP TLAKRLGERM
VNENPQETDI SKLVNQAYKI ALQGEAFKGN KSRKPQSKGK LNPNYVENDM RKILEEAKNN
QIPVYEPLVE AKIIKDSPEV DFFLI