Gene PCC8801_3945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3945 
Symbol 
ID7103889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4130731 
End bp4131972 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content36% 
IMG OID643476943 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002374044 
Protein GI218248673 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTGTTT TAGAGTTCAA GGTTAAAGCT AAAACTCAAC AGTATCAAGC GATAGACGAC 
GCTATCCGTA CCGCTCAATT TATCCGTAAC AAATGTGTAA GATTATGGAT GGATACGAAA
GGGACGGGGA AAAACGATTT ATACAAATAC TCAAAAGAAT TAGCCCACAA CTTTAAATTT
GCTGACGAAC TTAATTCAAC TGCTAGACAA GCTAGTGCTG AACGTGCTTG GAGTTCAATT
AGTCGTTTCT ACGATAACTG CAAACGTAAA ATATCAGGAA AAAAAGGTTA TCCTCAATTT
AAGTTTAGTC GCTCTGTTGA GTATAAACAG TCAGGATGGA AACTTCTTAA TCCTAAAACT
ATCAAGTTCA CTGACAAGAA AAACATAGGT ATTCTTAAAT TAGTGGGGAC TTGGGATTTA
GCCTATTTTC AAGAATCCGA TATTAAACGA GTTAGGTTAA TTCGTAGAGC CGATGGATAT
TATTGTCAAT TTGTTCTTTC TTGTGAAGTT AAAGAGGATG TCAAACCATC AGGTAAATGT
ATCGGGTTAG ATGTTGGACT GAGTTCTTTT TATACTGATC AAGAAGGTAA TAAAATTGAT
AATCCTAAGT TCTTGAGAAA GTCTGAGAAG CAATTAAAAA GACTTCAAAG AAGATTATCA
AAAAAGAAAA AGGGGTCATC TAATCGTCAA AAAGCTAGAC AGAGATTGGC TAAAGTTTAT
CTTAAAGTAA GTAGGCAGCG TAAAGACTTT GTTGTTAAAT TAGCAAGGTG CGTAGTTCAC
TCTAACGATG TGATTGCCTA TGAAGATTTA AGAATTAAGA ACTTAGTTAA AAATCATTGT
TTAGCTAAAA GTATTAATGA TGCTGCGTGG TATCAGTTTC GAGAATGGTT AGAATATTTT
GGACAAAAGA TGGGCAAAAT AACTATTGCT GTTGCTCCTC ATTATACGTC TCAAAACTGT
TCTAATTGTG GTGAAACTAT TAAGAAAACT CTATCGACTC GTACTCATGT TTGTAAATGT
GGATGCGAGT TAGACAGAGA TGAAAACGCC GCAATTAATA TTCTTAAAAA AGCGATTCAG
TGCGGTCTTG GCGGTTCCCG TCGAGAGCAA CTGAATCAAG AAGGATTAAG TACCGTAGGG
CATACGGGAT CTAAAGCTTG GGGAGAGAAT GCCTCTACTT TGATAGAAGA AATTCTGTCA
AAGCAAGCAA TCTCTGTGAA CCAAGAATCC ACTCGGCTTT AG
 
Protein sequence
MFVLEFKVKA KTQQYQAIDD AIRTAQFIRN KCVRLWMDTK GTGKNDLYKY SKELAHNFKF 
ADELNSTARQ ASAERAWSSI SRFYDNCKRK ISGKKGYPQF KFSRSVEYKQ SGWKLLNPKT
IKFTDKKNIG ILKLVGTWDL AYFQESDIKR VRLIRRADGY YCQFVLSCEV KEDVKPSGKC
IGLDVGLSSF YTDQEGNKID NPKFLRKSEK QLKRLQRRLS KKKKGSSNRQ KARQRLAKVY
LKVSRQRKDF VVKLARCVVH SNDVIAYEDL RIKNLVKNHC LAKSINDAAW YQFREWLEYF
GQKMGKITIA VAPHYTSQNC SNCGETIKKT LSTRTHVCKC GCELDRDENA AINILKKAIQ
CGLGGSRREQ LNQEGLSTVG HTGSKAWGEN ASTLIEEILS KQAISVNQES TRL