Gene Cyan8802_3994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3994 
Symbol 
ID8393344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4110758 
End bp4111948 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content35% 
IMG OID644981918 
Producttransposase, IS605 OrfB family 
Protein accessionYP_003139632 
Protein GI257061744 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.437884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.653956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTGTTT TAGAGTTCAA GGTTAAAGCT AAAACTCAAC AGTATCAAGC GATAGACGAC 
GCTATCCGTA CCGCTCAATT TATCCGTAAC AAATGTGTAA GATTATGGAT GGATACGAAA
GGGACGGGGA AAAACGATTT ATACAAATAC TCAAAAGAAT TAGCCCACAA CTTTAAATTT
GCTGACGAAC TTAATTCAAC TGCTAGACAA GCTAGTGCTG AACGTGCTTG GAGTTCAATT
AGTCGTTTCT ACGATAACTG CAAACGTAAA ATATCAGGAA AAAAAGGTTA TCCTCAATTT
AAGTTTAGTC GCTCTGTTGA GTATAAACAG TCAGGATGGA AACTTCTTAA TCCTAAAACT
ATCAAGTTCA CTGACAAGAA AAACATAGGT ATTCTTAAAT TAGTGGGGAC TTGGGATTTA
GCCTATTTTC AAGAATCCGA TATTAAACGA GTTAGGTTAA TTCGTAGAGC CGATGGATAT
TATTGTCAAT TTGTTCTTTC TTGTGAAGTT AAAGAGGATG TCAAACCATC AGGTAAATGT
ATCGGGTTAG ATGTTGGACT GAGTTCTTTT TATACTGATC AAGAAGGTAA TAAAATTGAT
AATCCTAAGT TCTTGAGAAA GTCTGAGAAG CAATTAAAAA GACTTCAAAG AAGATTATCA
AAAAAGAAAA AGGGGTCATC TAATCGTCAA AAAGCTAGAC AGAGATTGGC TAAAGTTTAT
CTTAAAGTAA GTAGGCAGCG TAAAGACTTT GTTGTTAAAT TAGCAAGGTG CGTAGTTCAC
TCTAACGATG TGATTGCCTA TGAAGATTTA AGAATTAAGA ACTTAGTTAA AAATCATTGT
TTAGCTAAAA GTATTAATGA TGCTGCGTGG TATCAGTTTC GAGAATGGTT AGAATATTTT
GGACAAAAGA TGGGCAAAAT AACTATTGCT GTTGCTCCTC ATTATACGTC TCAAAACTGT
TCTAATTGTG GTGAAACTAT TAAGAAAACT CTATCGACTC GTACTCATGT TTGTAAATGT
GGATGCGAGT TAGACAGAGA TGAAAACGCC GCAATTAATA TTCTTAAAAA AGGATTAAGT
ACCGTAGGGC ATACGGGATC TAAAGCTTGG GGAGAGAATG CCTCTACTTT GATAGAAGAA
ATTCTGTCAA AGCAAGCAAT CTCTGTGAAC CAAGAATCCA CTCGGCTTTA G
 
Protein sequence
MFVLEFKVKA KTQQYQAIDD AIRTAQFIRN KCVRLWMDTK GTGKNDLYKY SKELAHNFKF 
ADELNSTARQ ASAERAWSSI SRFYDNCKRK ISGKKGYPQF KFSRSVEYKQ SGWKLLNPKT
IKFTDKKNIG ILKLVGTWDL AYFQESDIKR VRLIRRADGY YCQFVLSCEV KEDVKPSGKC
IGLDVGLSSF YTDQEGNKID NPKFLRKSEK QLKRLQRRLS KKKKGSSNRQ KARQRLAKVY
LKVSRQRKDF VVKLARCVVH SNDVIAYEDL RIKNLVKNHC LAKSINDAAW YQFREWLEYF
GQKMGKITIA VAPHYTSQNC SNCGETIKKT LSTRTHVCKC GCELDRDENA AINILKKGLS
TVGHTGSKAW GENASTLIEE ILSKQAISVN QESTRL