Gene PCC8801_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1857 
Symbol 
ID7105546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1948287 
End bp1949471 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content38% 
IMG OID643474923 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002372056 
Protein GI218246685 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAAAAG TCGTCAAAGT TCGGCTATAC CCAAATACAG AGCAACAGCA GTTACTAGAA 
CAAAGTTTTG GTAATGTTCG TTGGCTGTGG AATTATTGCC TAAATTTGAT GAATCAAACA
TATTTGGATA CTGGAAAAGG ATTATCAGGA TATGAGGTCA AAAAACTGAT TCCTTCTCTT
AAAAAAGAGC ATGAATGGTT AACTTTGACT TATTCTCAGT GCTTGCAACA AACCTGCTTA
AACCTTGGAG TTGCTTTTAA TAACTTTTTT GAGCGTAGAG CAAAGTATCC TAGGTTTAAG
TCAAAACATG GGAAACAATC TATTCAGTAT CCTCAAAATG TCAAGGTATT AGATTGTGGC
TTAAATCTTC CTAAAATTGG GGCAGTAAAA GCAGTAATTC ACCGTCCAAT CGAAGACAAG
ATTAAGACTG TTACCGTCTC TAAAAATAGC TGCAATCAAT ACTTTGCATC CATTTTGTTT
GAAGATGGCA AAGAAACCCC CCTAATAGGG GGGACAGAGG GGGGTGAGGG AAAAGCAGTA
GGAATTGACG TAGGCTTAAC TCATTTTTGC ATTACTTCAG ATGGCTCTAA ATTTGACAAT
CCCCGATTTT TAACCAAGCA CGAAAGGAAT TTAAAACGGA AACAGCAGCA ACTATCTAGA
AAGCAAAAAG GGTCTAATAA TCGTAATAAA GCTAGAAAGA AAGTTGCTAA AGTGCATCGA
AAAATAACTA ACTGTCGTGA AGATTTTCTA CACAAACTAT CTCGTAGGAT AGTAGACGAA
AACCAAGTTA TTGTGACAGA GAATCTTAAC GTTAAGGGCA TGATGAAAAA CCACTTCCTA
GCTAAAGCTA TTGCACAAGT TGGGTGGGGA ATGTTCATGA CTATGCTTAA ATACAAAGCA
GAAAATGATG GAAAAACCTA TCAAGAAGTT GATAGGTTTT TCCCTTCATC TAAAACTTGT
CATGTTTGCT TAAATCAGGT GGGAAGTTTG CCGCTTGATA TCAGACATTG GACTTGTGAA
AACTGCCAAA CAAAACACGA CAGAGATGTT AACGCCGCAA TCAACCTCCG CGATGAGGGA
CTACGAATCT TGACCTGTGG AACGCGGGAC AAAGCTTATC GCCAGACTGT AAGTCGTAGT
AATAGAGGAC GCAAGAAATC TACTACTGCG CTTGTCTCTG GGTAA
 
Protein sequence
MLKVVKVRLY PNTEQQQLLE QSFGNVRWLW NYCLNLMNQT YLDTGKGLSG YEVKKLIPSL 
KKEHEWLTLT YSQCLQQTCL NLGVAFNNFF ERRAKYPRFK SKHGKQSIQY PQNVKVLDCG
LNLPKIGAVK AVIHRPIEDK IKTVTVSKNS CNQYFASILF EDGKETPLIG GTEGGEGKAV
GIDVGLTHFC ITSDGSKFDN PRFLTKHERN LKRKQQQLSR KQKGSNNRNK ARKKVAKVHR
KITNCREDFL HKLSRRIVDE NQVIVTENLN VKGMMKNHFL AKAIAQVGWG MFMTMLKYKA
ENDGKTYQEV DRFFPSSKTC HVCLNQVGSL PLDIRHWTCE NCQTKHDRDV NAAINLRDEG
LRILTCGTRD KAYRQTVSRS NRGRKKSTTA LVSG