Gene PCC8801_4536 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4536 
Symbol 
ID7095915 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011723 
Strand
Start bp28123 
End bp29277 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content33% 
IMG OID643467516 
Producttransposase IS4 family protein 
Protein accessionYP_002364812 
Protein GI218203959 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones54 
Plasmid unclonability p-value0.0600651 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.65428 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTT TGCCTTTCTA TCAGGACTAT TTACAAAACG CATTATCAAA AAGTAAATTT 
TTACTTTTAC GAATATTAAT ATGGCTTTTA CAAGTTCATA AACAAGTTAG AATAGAACGG
TTAGCGGCTT ATCTTCCTCT TCCTATTCTA TACGAAAGTC GTAGAAAGAA GATTCAAAGA
TTTTTAGTCG AACCGTGCTT AAGCCTTGTC TTATTATGGT TTCCTCTGAT AAAATTAATA
GTAGAACGAG AATTTAAACC AGGAAGTCGT TTAACTTTAG TTTTGGATAG GACTCAGTGG
CAGGATAAAA ATGTGTTCAT GATTAGTGTA GTTTGGAGAA AGAGAGCCTT CCCTATTTAC
TGGCAAATTC TAGAGAAAAA AGGAAGCAGC AACGTCAAAG AACAAATCGC TTTAATCCGA
CCGGTCTTGA AATTATTTGC CGACTATGAG TTATTAATTT TAGGGGATAG GGAGTTTCAT
GGGGTAGAAT TATCTTATTG GTTAAAGAAA CGAAACCGAA CGGCTAAAAA TCCCATCTAT
TTTGCTTTTC GAGAAAGGAA AAATGTCTAC ATTAGAAGAA GTAAGAAGAA TCAAAAACGC
TTTCAAGATT TAACCCTGAC CCCAGGAGTC AAAGTTTTTG AAAAAAACAT TTTTATCACC
AAGCAAAAAG GGTTTGGTCG CTTTAATGTA TTGGCTTATC AGAAGAGAAA ATATAGAAAC
CATCAGGAAG AAGAACCTTG GTTTATTATA ACCAATTTAG ATAACCCATC CGAAGTCATA
AAATATTATA AAATCAGAGG TGGAATTGAA GCTATGTTTC GAGATTATAA GAGTGGAGGA
TATAATCTCG AAGGGAGTAA AGCTAATATT CATCGACTTA CTAACTTGAT TTTATTAATA
GCTATTGCTT ATACTTTATC GGCTTTAAAA GGGAAGTCAA TTAAAAATAG AGGATATCAA
AAGTATATAT CTAGACTAAC AGAACCGAAA AGACAAGTCA GAAGACATAG TGAATTTTGG
GTAGGGCTAT ATGGACAAAG TTGGGTCTTA GCCTGGGATT TCTGTTACTT GTTTGTTGAA
CAAATTATGA GAATTAACCT TCACAAAATT AATGAATATA ACCGAGGTTT AAAAGCCTTA
TCTGCTATTA GTTAA
 
Protein sequence
MDFLPFYQDY LQNALSKSKF LLLRILIWLL QVHKQVRIER LAAYLPLPIL YESRRKKIQR 
FLVEPCLSLV LLWFPLIKLI VEREFKPGSR LTLVLDRTQW QDKNVFMISV VWRKRAFPIY
WQILEKKGSS NVKEQIALIR PVLKLFADYE LLILGDREFH GVELSYWLKK RNRTAKNPIY
FAFRERKNVY IRRSKKNQKR FQDLTLTPGV KVFEKNIFIT KQKGFGRFNV LAYQKRKYRN
HQEEEPWFII TNLDNPSEVI KYYKIRGGIE AMFRDYKSGG YNLEGSKANI HRLTNLILLI
AIAYTLSALK GKSIKNRGYQ KYISRLTEPK RQVRRHSEFW VGLYGQSWVL AWDFCYLFVE
QIMRINLHKI NEYNRGLKAL SAIS