Gene PCC8801_3220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3220 
Symbol 
ID7103946 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3363426 
End bp3364640 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content38% 
IMG OID643476241 
Producttransposase, IS605 OrfB family 
Protein accessionYP_002373351 
Protein GI218247980 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTACAGAA GCGTAAAGGT TAGAATCTAC CCAACATCTG AGCAGTCCCA AAAACTAAGT 
CAAGTTATGG GGTCAGCAAG ATGGTGGTGG AATTATGCCT TGAATCTGTG CAATCAAACT
TATAAGGAAA CGGGTAAAGG ATTGACACAA ATAGCTCTTA ACAAGGTTTT GCCTAAGCTT
AAAAAAGAAG AAGAAACTGC ATGGCTAAAA GACTGTTATT CTCAGGTCTT ACAGTCAACA
ACTCTTAACT TGACTAAAGC TTTCAAAAAC TTTTTTGACA AAAGAGCGAA ATATCCCAGA
TTCAAGTCCT ATCATGGCAA ACAATCCTGT CAATATCCTC AAAACTGTCA AGTTGTTGAA
AAGGGAATAA AGATCCCCCA AGTTGGGGTT ATAAAAGCTT CAATTCATCG ACTTTTTGAT
GGACAACTCA AAACCGTTAC TATTACAAAA ACACCAACCG GAAAATATTA TGCTTCATTG
TTGTTTGACA CTGAACAAGA GATTCCTGGT TTGGTAGTAA CAGGTAAAAC AATTGGGATT
GACTTAGGAC TTACAGACTT TTGTATTACC CATGATGGGC AAAAAACGTC TAAATTTGCC
AATCCTAGAC ACATCAAAAA ACATGAGAAG AATTTAGCCA GAAAACAAAC TAAATTAGCT
CGTAAAAAGA AAGGGAGTAA ATCTAGAGAA AAAGCACGAA AGCTTGTAGC TAGAGTTCAC
GAACGTATTA GTAATGCCCG TCAAGATTTT CTACATAAAT TATCAAGAAA AATTGTCAAT
GATAATCAAG TAGTCGTCGT TGAGAATTTA AACGTCAAGG GTATGGTTCG TAATCACAAC
TTAGCTAAAG CTATTTCTGA TGTCGGATGG GGAATATTTG TCAATTTTCT TGACTATAAA
CTACAACAAA AAGGCGGTTT TTTGGTAGAA ATTGATAGAT GGTTCCCGTC TTCTAAAACT
TGCTCTAATT GTCTACATCA AATGTCAGAA ATGCCATTAG ATGTAAGACA ATGGACTTGT
CCGAGTTGTG GGACACACCA CGATAGAGAT GAAAATGCAG CCAAAAACAT TAGAGCAGAA
GGCATCAGGC AATTATCGGT CTTGGGAACC AGGACTGCTG CTGAAGGAGG AGAAGTAAGA
CCAAAAGGTG GACGTAAGTC TGTCTTGAGG CATTCTCCTG TGAGTTCAGA ACCCCCAACT
ATACCGATAG GTTAG
 
Protein sequence
MYRSVKVRIY PTSEQSQKLS QVMGSARWWW NYALNLCNQT YKETGKGLTQ IALNKVLPKL 
KKEEETAWLK DCYSQVLQST TLNLTKAFKN FFDKRAKYPR FKSYHGKQSC QYPQNCQVVE
KGIKIPQVGV IKASIHRLFD GQLKTVTITK TPTGKYYASL LFDTEQEIPG LVVTGKTIGI
DLGLTDFCIT HDGQKTSKFA NPRHIKKHEK NLARKQTKLA RKKKGSKSRE KARKLVARVH
ERISNARQDF LHKLSRKIVN DNQVVVVENL NVKGMVRNHN LAKAISDVGW GIFVNFLDYK
LQQKGGFLVE IDRWFPSSKT CSNCLHQMSE MPLDVRQWTC PSCGTHHDRD ENAAKNIRAE
GIRQLSVLGT RTAAEGGEVR PKGGRKSVLR HSPVSSEPPT IPIG