Gene PCC8801_4514 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4514 
Symbol 
ID7095895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011723 
Strand
Start bp7330 
End bp8484 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content33% 
IMG OID643467496 
Producttransposase IS4 family protein 
Protein accessionYP_002364792 
Protein GI218203939 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones69 
Plasmid unclonability p-value0.810745 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.887559 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTT TGCCTTTCTA TCAGGACTAT TTACAAAACG CATTATCAAA AAGTAAATTT 
TTACTTTTAC GAATATTAAT ATGGCTTTTA CAAGTTCATA AACAAGTTAG AATAGAACGG
TTAGCGGCTT ATCTTCCTCT TCCTATTCTA TACGAAAGTC GTAGAAAGAA GATTCAAAGA
TTTTTAGTCG AACCGTGCTT AAGCCTTGTC TTATTATGGT TTCCTCTGAT AAAATTAATA
GTAGAACGAG AATTTAAACC AGGAAGTCGT TTAACTTTAG TTTTGGATAG GACTCAGTGG
CAGGATAAAA ATGTGTTCAT GATTAGTGTA GTTTGGAGAA AGAGAGCCTT CCCTATTTAC
TGGCAAATTC TAGAGAAAAA AGGAAGCAGC AACGTCAAAG AACAAATCGC TTTAATCCGA
CCGGTCTTGA AATTATTTGC CGACTATGAG TTATTAATTT TAGGGGATAG GGAGTTTCAT
GGGGTAGAAT TATCTTATTG GTTAAAGAAA CGAAACCGAA CGGCTAAAAA TCCCATCTAT
TTTGCTTTTC GAGAAAGGAA AAATGTCTAC ATTAGAAGAA GTAAGAAGAA TCAAAAACGC
TTTCAAGATT TAACCCTGAC CCCAGGAGTC AAAGTTTTTG AAAAAAACAT TTTTATCACC
AAGCAAAAAG GGTTTGGTCG CTTTAATGTA TTGGCTTATC AGAAGAGAAA ATATAGAAAC
CATCAGGAAG AAGAACCTTG GTTTATTATA ACCAATTTAG ATAACCCATC CGAAGTCATA
AAATATTATA AAATCAGAGG TGGAATTGAA GCTATGTTTC GAGATTATAA GAGTGGAGGA
TATAATCTCG AAGGGAGTAA AGCTAATATT CATCGACTTA CTAACTTGAT TTTATTAATA
GCTATTGCTT ATACTTTATC GGCTTTAAAA GGGAAGTCAA TTAAAAATAG AGGATATCAA
AAGTATATAT CTAGACTAAC AGAACCGAAA AGACAAGTCA GAAGACATAG TGAATTTTGG
GTAGGGCTAT ATGGACAAAG TTGGGTCTTA GCCTGGGATT TCTGTTACTT GTTTGTTGAA
CAAATTATGA GAATTAACCT TCACAAAATT AATGAATATA ACCGAGGTTT AAAAGCCTTA
TCTGCTATTA GTTAA
 
Protein sequence
MDFLPFYQDY LQNALSKSKF LLLRILIWLL QVHKQVRIER LAAYLPLPIL YESRRKKIQR 
FLVEPCLSLV LLWFPLIKLI VEREFKPGSR LTLVLDRTQW QDKNVFMISV VWRKRAFPIY
WQILEKKGSS NVKEQIALIR PVLKLFADYE LLILGDREFH GVELSYWLKK RNRTAKNPIY
FAFRERKNVY IRRSKKNQKR FQDLTLTPGV KVFEKNIFIT KQKGFGRFNV LAYQKRKYRN
HQEEEPWFII TNLDNPSEVI KYYKIRGGIE AMFRDYKSGG YNLEGSKANI HRLTNLILLI
AIAYTLSALK GKSIKNRGYQ KYISRLTEPK RQVRRHSEFW VGLYGQSWVL AWDFCYLFVE
QIMRINLHKI NEYNRGLKAL SAIS