Gene PCC8801_4239 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4239 
Symbol 
ID7103792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4449188 
End bp4450279 
Gene Length1092 bp 
Protein Length363 aa 
Translation table11 
GC content37% 
IMG OID643477220 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_002374319 
Protein GI218248948 
COG category[U] Intracellular trafficking, secretion, and vesicular transport
[N] Cell motility 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTCGCC AAACAACCCT ACCCTGGTTA GTAAGTCTAT TCGTAATGGG TTTAACTTTA 
CCCGCGAATG CTCAACTTCA ACCTCTATTA ATTTTAGCGC AACAATCAAC TGACTCAGAG
GAACTCAAAG AATTGTTGCG TTTAGGTCGA GAATATGTTG ATCTTAAAGA CTATAATAGC
GCGATCGTAA CCTATGAGAA GGCAGCTATT CTTGATGGTA ATAATGCTAA AATTTTCTCA
GGAATCGGTT ATTTGTACGC CCAAAAAGGG AACTTTAGAC AAGCCGTTAA GGCCTATCAA
CAAGCCGTTA CTCTTGATCC TAATAATGCT GATTTTTATT ACGCTCTAGG GTTTAGTTTA
GCGAATATAG GAGATAATGA AAATGCCGCT TCTGCTTATT ATTATGCGAT TCAACTTGCT
CCACGAGTGA CGAAAAATTA TATTGGATTA GGGGTGGTTT TATTACGTCA AAATGATTAT
CAAGGAGCAG CAGAAGCTTA TAAACGAGTG ATTGCCCTTG ATCCCAATAA TTCAGAAGCT
TTTGCTATTA TGGGTTCTTC TTTGATTCAA CAAAAAGAAC TTGATAAAGC CATTCAATAT
CTCAATAATG CGGTTAAAAG ATTTCCTAAT GATCTGGAGT TAAGATTATT ATTAGCAACG
GCTTTTTTAG AACAAGATAA TAACGAACTC GCCTTTAATC AGTTAAAGAG TGCTGAAAGA
ATTAGCCCGG GAAATCCTAA AGTTCAGTTG AAAATTGGCC GCATTTTAGA ACAACAAAAC
AAGTTGGATG ACGCGCTTAA AACCTATCAA CGGATTACTT ATTTATCCCC TAGTTCAACG
GAAGCGCGTG CGGGAGTTGG TAGAATACAA CTAGCTACTA AAGATTATCT AGGTGCAGTT
ATCACTTATC GAGAATTAGC GTCAATGCTT CCTGAAACTC CTGAACCTTA CTATTATTTG
GGATTAGCTT ATAAGGAGCG GGGACGAAAA AAAGAAGCGA CTAAAGCGTT AGAACAAGCA
CGTCAATTGT ATCAAAAACA AGACAATAAT AAGGGCATTG AGGAAGTTGA TAAATTACTT
AAACAATTGT AG
 
Protein sequence
MVRQTTLPWL VSLFVMGLTL PANAQLQPLL ILAQQSTDSE ELKELLRLGR EYVDLKDYNS 
AIVTYEKAAI LDGNNAKIFS GIGYLYAQKG NFRQAVKAYQ QAVTLDPNNA DFYYALGFSL
ANIGDNENAA SAYYYAIQLA PRVTKNYIGL GVVLLRQNDY QGAAEAYKRV IALDPNNSEA
FAIMGSSLIQ QKELDKAIQY LNNAVKRFPN DLELRLLLAT AFLEQDNNEL AFNQLKSAER
ISPGNPKVQL KIGRILEQQN KLDDALKTYQ RITYLSPSST EARAGVGRIQ LATKDYLGAV
ITYRELASML PETPEPYYYL GLAYKERGRK KEATKALEQA RQLYQKQDNN KGIEEVDKLL
KQL