Gene PCC8801_4522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4522 
Symbol 
ID7095903 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011723 
Strand
Start bp14695 
End bp15969 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content45% 
IMG OID643467504 
Productputative transposase IS891/IS1136/IS1341 family 
Protein accessionYP_002364800 
Protein GI218203947 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones75 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.839734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTACAC GCAGAGTTAC GTTTCGGCTA TATCCTTCCA AGTCTCAATC GGCAAAACTG 
TTTGAGGCCA GAAGACTCCA TGCCTATCTG TATAACGCCT GTGTGGAAGA CCGTAAAACC
AGTTATCAGA AATTCGGAAA GTCTGTAAGC TATTTTGACC AACAGGCCGC TCTCGTCCCC
TTTAAAGGAT GTTGGCCTGA ATATAAATCA TTGAATCACG GCTCATTGCA AGCAACTGTT
AAGCGAGTCG ATTTTGCGTT TCAACGCTTC TTTAAGGGAT TGGGTGGCTA TCCTAAATTT
CGTTCGATTC GCCAATACTC AGGTTGGACT TATCCCGATG CCCGTCAAGG GTTTCGAGTT
CATAGTATCG GTGAAAACGG GTACCTAGAG CTTCGAGACT TGGGTATTCA GGTTCAAATG
CGGGGGAAAG CACGTCAATG GGGAACTCCT AGTACCTGCA CGATTGTTTA TCGTCATGGG
AATTGGTATG CCTCCATCAC TGTTAAATGC GAAGAGATCC TTCGTGAAAC AGGAACAGGA
GCCATTGGAA TAGATTTTGG GACTCTCACT GCTATTGCAT TAAGTGACGG GACTAAAATA
GAGAATCCTC GCTTTCTTGC CAATGCTAAG GAGAAAATTA AAAGGGCTTC TAAGCAGAAA
AGACGCAAAA AAGCCCCTGA CCATAAAAAA CGGGTTAGAG GCTCTAACCG ATGGAAGAAA
GCGTCCAAAA AGGTTGCGAA ACTGCAAACA AAAGTAGCTA GTCAACGTCA AGATTGGGCG
CATAAGGTGT CAACACAAAT TGTTAGCTGT AATAGCATGG TTGCCACTGA AAAATTGAAT
ATCAAAGGAA TGACCCGCAA GGCTAAAAAA GGAAAGCGGA AACGCCAGAA ATCTGGATTG
AACCGCTCTA TTTTAGACGT GGGATGGGGA ATAACCCGTG ACATGATTGA GTATAAACTC
TCGGAATGTA ACGGAGTTTT TGTTGAGGTT CCCACTCAAA AAGTAAAACC TTCTCAAACC
TGTCCTAAAT GCGGTCATCA GGAGAAAAAG ACCTTGGAGC AACGCATTCA CGAATGCAAG
CAATGTGGTT ACACCAATGA CAGGGATGTA GCTAGTGCCG AGGTTATGCT GTCATGGGCG
TTAGGAACTA GCGTCCCTAA TCGTGGAGGG GAAAGCTCTA CTGAGAAACC CACAGTTAAA
TCCTGTGGAG GTTTTCAGCA ACTTGCCTCC GTGAAGCGAC AGAAACTCCA ATCTCAGCGT
AGCGGATTGG AGTAG
 
Protein sequence
MITRRVTFRL YPSKSQSAKL FEARRLHAYL YNACVEDRKT SYQKFGKSVS YFDQQAALVP 
FKGCWPEYKS LNHGSLQATV KRVDFAFQRF FKGLGGYPKF RSIRQYSGWT YPDARQGFRV
HSIGENGYLE LRDLGIQVQM RGKARQWGTP STCTIVYRHG NWYASITVKC EEILRETGTG
AIGIDFGTLT AIALSDGTKI ENPRFLANAK EKIKRASKQK RRKKAPDHKK RVRGSNRWKK
ASKKVAKLQT KVASQRQDWA HKVSTQIVSC NSMVATEKLN IKGMTRKAKK GKRKRQKSGL
NRSILDVGWG ITRDMIEYKL SECNGVFVEV PTQKVKPSQT CPKCGHQEKK TLEQRIHECK
QCGYTNDRDV ASAEVMLSWA LGTSVPNRGG ESSTEKPTVK SCGGFQQLAS VKRQKLQSQR
SGLE