Gene Cyan8802_4301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4301 
Symbol 
ID8393653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4440648 
End bp4441739 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content37% 
IMG OID644982211 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_003139922 
Protein GI257062034 
COG category[N] Cell motility
[R] General function prediction only
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF
[COG4783] Putative Zn-dependent protease, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.124533 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTCGCC AAACAACCCT ACCCTGGTTA GTAAGTGTAT TCGTAATGGG TTTAACTTTA 
CCCGCGAATG CTCAACTTCA ACCTCCATTA ATTTTAGCGC AACAATCAAC TGACTCAGAG
GAACTCAAAG AATTGTTGCG TTTAGGTCGA GAATATGTTG ATCTTAAAGA CTATAATAGC
GCGATCGTAA CCTATGAGAA GGCAGCTATT CTTGATGGCA ATAATGCTAA AATTTTCTCA
GGAATCGGTT ATTTGTACGC CCAAAAAGGG AACTTTAGAC AAGCCGTTAA GGCCTATCAA
CAAGCCGTTA CTCTTGATCC TAATAATGCT GATTTTTATT ACGCTCTAGG GTTTAGTTTA
GCGAATATAG GAGATAATGA AAATGCGGCT TCTGCTTATT ATTATGCGAT TCAACTTGCT
CCACGAGTGA CGAAAAATTA TATTGGATTA GGGGTAGTTT TATTACGTCA AAATGATTAT
CAAGGAGCAG CAGAAGCTTA TAAACGAGTG ATTGCCCTTG ATCCCAATAA TTCAGAAGCT
TTTGCTATTA TGGGTTCTTC TTTGATTCAA CAAAAAGAAC TTGATAAAGC CATTCAATAT
CTCAATAATG CGGTTAAAAG ATTTCCTAAT GATCTGGAGT TAAGATTATT ATTAGCAACG
GCTTTTTTAG AACAAGATAA TAACGAACTC GCCTTTAATC AGTTAAAGAG TGCTGAAAGA
ATTAGCCCGG GAAATCCTAA AGTTCAGTTG AAAATTGGCC GCATTTTAGA ACAACAAAAC
AAGTTGGATG ACGCGCTTAA AACCTATCAA CGGATTACTT ATTTATCCCC TAGTTCAACG
GAAGCGCGTG CGGGAGTTGG TAGAATACAA CTAGCTACTA AAGATTATCT AGGTGCAGTT
ATCACTTATC GAGAATTAGC GTCAATGCTT CCTGAAACTC CTGAACCTTA CTATTATTTG
GGATTAGCTT ATAAGGAGCG GGGACGAAAA AAAGAAGCGA CTAAAGCGTT AGAACAAGCA
CGTCAATTGT ATCAAAAACA AGACAATAAT AAGGGCATTG AGGAAGTTGA TAAATTACTT
AAACAATTGT AG
 
Protein sequence
MVRQTTLPWL VSVFVMGLTL PANAQLQPPL ILAQQSTDSE ELKELLRLGR EYVDLKDYNS 
AIVTYEKAAI LDGNNAKIFS GIGYLYAQKG NFRQAVKAYQ QAVTLDPNNA DFYYALGFSL
ANIGDNENAA SAYYYAIQLA PRVTKNYIGL GVVLLRQNDY QGAAEAYKRV IALDPNNSEA
FAIMGSSLIQ QKELDKAIQY LNNAVKRFPN DLELRLLLAT AFLEQDNNEL AFNQLKSAER
ISPGNPKVQL KIGRILEQQN KLDDALKTYQ RITYLSPSST EARAGVGRIQ LATKDYLGAV
ITYRELASML PETPEPYYYL GLAYKERGRK KEATKALEQA RQLYQKQDNN KGIEEVDKLL
KQL