Gene PCC8801_4170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4170 
Symbol 
ID7104572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4372824 
End bp4375055 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content48% 
IMG OID643477157 
Producttype II and III secretion system protein 
Protein accessionYP_002374256 
Protein GI218248885 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4796] Type II secretory pathway, component HofQ 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACAACT ATCGTCATCC CCTGGCTATG AGTGGAGCTA TTTCTTTAAT GCTCCTGATG 
AGTTATCCTG CAATAGCCGT TAATCCTACA GCAACGGAAA ACCCTTTCGA GGAATCTACC
TCAGAGGAGA AACTTTTTTC CAAGGCGGTC GCCATTAACC CCCGTAGCGA ATGGAAATTT
GACCCTCAAA TTGCCCCTTC AAAGGTTAAA GAAGATATTT TTACAGAAAC TTCCCTAGAA
ATTGCCCAAA TCCCTAATTA TTACCCCAAT ACCCCCCCAT CCCCTGCCCC TGCCCCATCG
ACCAATTTTT ACCCCAATAA CCCTAGCCAA CCTTCTGGGT TTTACCCCAA TACCCCCAAC
CCCATCCCCC CATCCCGGTC TTCTGATATC ATGATCCCCA ACCCAGAAGT CCTCATCAAA
TCCAATGGGT CATCTATTCC CGATAGCCTA CAACCCACCA TGCCCATGGC TCCTACCTTA
CCTCGTGCTG TTGCGCCTCC GGTGGGAGAT ATTGCCATCT CTAACATCAA TGCAGCCACT
GATGGGATTG ATTTAGGGAC TTCCATCATC GTTCCCCGTC TAGTTCTGCG TCAGGCTCCA
GCCAGGGAAG TTTTAGCAGT TTTAGCCCGT TATGCCGGAA TGAACTTGGT TTTCACCGAT
GATACCAGAG CAGCCGCTGC CGCTACCGCC GGCCAACCGC CAGGAGCCCA AAGCGCGGGA
GAAGTCACCG TTTCCCTAGA TCTCGAAAAT GAACCGGTCG AGCAAGTCTT TAACTCGGTG
TTGATGATTT CTGGTTTACA AGCTAACCGT CGGGGACGGA CAATTTTTGT CGGTTCTCAA
CTCCCCCAAG CTGCCCAAAA CCTCATCAGT CGGACGATTC GCCTTAACCA AGTTAAATCC
GCCAGTGCAG GAACCTTTTT AGCCTCTCAA GGCGCAGAAT TTCAACGCTT AGTCACCGTT
CGAGAAGATA TTATCGATCC CTTAACCCAA CGAGTTATCG GACAACGCGA ACTCCCTCCA
GAATTAGATC CCCTAACCGC CCAACGCACC GAAGGCTCCA ATAGTCCCCA GTTACTAACC
GGGTTAGCCG TGTCTTCTGA TGATCGCCTG AATACCATTA CCCTAGTGGG AGAACCCCGT
CAGGTTCAGG TAGCCACGTC TTTATTGACC CAATTGGATG CCCGTCGTCG TCAAGTCGCT
GTTAACGTTA AGGTAGTCGA TATTGCCTTA AACAACGATC AATCCTTTAG CAGTAGTTTT
TCCTTTGGTG TTAATGACAG TTTCTTCGTT CAAGATGAAG GAGCAGCCTT TTTGCGGTTT
GGGGGACCAT CTCCTATTGA TAGCGCAGAA TTTAATAGTG CCACGGGACG ACTGGGGGTT
CCTCCGGCCA TTCCTAACCC TTTTGCTGAA GGTGGCAATA TCTTCTTAAA CCAAGGCTCA
TTCCAGTTTC CTGTGTTAAC GGATGGCGGA TTTCCTCAAA TTCCTGGGGC ATCTCTCGGT
GTTTCTAATG ATTTCAGGGC TGTAGGCGTA AGTCCTCCAG AGAATGCTGA TGACCCCTTT
GAGTATCAAC TGCCTTCCTT TTTTGAATTT CCTCGCAAGT TCCTCTCCCA AATTGAAGCG
ACGATTCAAA GTAGCAATGG TAAGGTGCTA ACTGATCCAA CTCTAGTGGT GCAAGAAGGA
CAGCAAGCAA CGGTAAAATT AACCCAAAAA GTCATTGAAA ATATCACAAC CCAGGTTGAT
CCCCTTAGTG GTGTCAGAAC CACAACCCCG GTTTTATCGG ATGCGGGGTT AACTTTGACC
ATTGATATTG ATAAAATTGA CGACAACGGA TTTATCAGTT TAACCGTTAG TCCGACCATT
GCTGCTATTG GAGATGTTCA ACCCTTTGAC AGTGGGGCTG ATGGCGGGTT CAACCAATTA
TTTCTCTTAG CTCGACGGGA ACTGACTTCT GGGTTAATTC GCCTTCGCGA TGGTCAAACT
CTGATTCTCT CAGGGATTAT TAGTGAAACA GACCAGACAA TTACCAATAA AGTCCCCATT
TTAGGGGATA TTCCCCTTTT AGGAGCATTA TTCCGCAGTC AAACGGATAC GAAAAACCGT
TCAGAAGTGA TCGTACTACT GACTCCCCAG ATTCTCCATG ACAACGGACA ATGGGGCTAT
AACTACACCC CTGGACGCGC TTCGGCTGAA GTCTTAAGGC AACAAGGCTT CCCGGTTCAA
ATCGTTCCTT AA
 
Protein sequence
MNNYRHPLAM SGAISLMLLM SYPAIAVNPT ATENPFEEST SEEKLFSKAV AINPRSEWKF 
DPQIAPSKVK EDIFTETSLE IAQIPNYYPN TPPSPAPAPS TNFYPNNPSQ PSGFYPNTPN
PIPPSRSSDI MIPNPEVLIK SNGSSIPDSL QPTMPMAPTL PRAVAPPVGD IAISNINAAT
DGIDLGTSII VPRLVLRQAP AREVLAVLAR YAGMNLVFTD DTRAAAAATA GQPPGAQSAG
EVTVSLDLEN EPVEQVFNSV LMISGLQANR RGRTIFVGSQ LPQAAQNLIS RTIRLNQVKS
ASAGTFLASQ GAEFQRLVTV REDIIDPLTQ RVIGQRELPP ELDPLTAQRT EGSNSPQLLT
GLAVSSDDRL NTITLVGEPR QVQVATSLLT QLDARRRQVA VNVKVVDIAL NNDQSFSSSF
SFGVNDSFFV QDEGAAFLRF GGPSPIDSAE FNSATGRLGV PPAIPNPFAE GGNIFLNQGS
FQFPVLTDGG FPQIPGASLG VSNDFRAVGV SPPENADDPF EYQLPSFFEF PRKFLSQIEA
TIQSSNGKVL TDPTLVVQEG QQATVKLTQK VIENITTQVD PLSGVRTTTP VLSDAGLTLT
IDIDKIDDNG FISLTVSPTI AAIGDVQPFD SGADGGFNQL FLLARRELTS GLIRLRDGQT
LILSGIISET DQTITNKVPI LGDIPLLGAL FRSQTDTKNR SEVIVLLTPQ ILHDNGQWGY
NYTPGRASAE VLRQQGFPVQ IVP