Gene PCC8801_3762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_3762 
Symbol 
ID7103984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp3948221 
End bp3949252 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content45% 
IMG OID643476768 
Productpentapeptide repeat protein 
Protein accessionYP_002373869 
Protein GI218248498 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACCC TTATCGAGAA TTTACTTATT GGCACGATGT CAAACCCAGG TTTGATCAAA 
ACAGCACCTA TGGACGCGCA GGAACTAATC TGGCTATATG GCCAGGGTCA ACGGGACTTT
AGTCGGCAAG ATCTCCAAAG CGAAGATATT ATTCAAGCTA TCCTCACGGA AGCGAATTTA
AGTCGTACTG CCTTAGATTG GGCTAACTTA AGTGGAACGG ATTTAAGTCG TGCGAACCTC
AATCGGGCTG ATTTAATCCA CGCTAAACTC ATTAGTGCCA AATTAGTTGG AGTCGATTTA
ACGGGGGCTG ATCTCAGTCA TGCTGATCTC AGTTGGGTGA ATCTCGAAGG CTCAACCCTG
ATCAGTGCTA ATTTAAGCAA TGCCAATCTT CGACAAACCA ATCTCACCAA TGCTGACTTG
AGAAGTGCCA ACCTCAGTGG AGCTAACCTC AGTGGAGCTA ACCTCAGTGG AGCAAAACTG
AGTCGCGCCG ATCTCAGTGA AGCCGATCTT AGTGGGGTTG ATCTCAGTGG GGCAAATTTG
AGCCGCGCCG ATCTCAGCGA AGCGGATCTG ATGGAAGTGG ATTTAAGTTA CAGCAACCTT
TATAAAGCCG ATCTCAGCGA AAGTAAACTC CGTAACAGCG ATCTAGAAGA GGCATTTCTC
CAAGGAGCCA ATTTTAGCCG TGCTAATTTG AAAGGAGCCG ATCTCTCCAG GGCAGTTTTA
AGGGAAAATA CCCTGAGCCT GCTGACCTTA TCGGAGTTTA ATGTTCAAAG TGTTAATCTC
TCTAATGAAA TTGACTTAAG TTCAGCTAAT CTGCGAGGAT GTAACCTGAG AGGGGCTATT
TTGCGTCATG CCAATTTAGG GTATGGGTTG CTCCACAAAA CCAATTTGAT TGATGCGATC
CTACGGGAAG CTAATATGAT TGATGCTTCG TTACGCGGAG GAGATTTGCG GGGAGCTAAA
TTCCGCAATA GTAATATTAA TGCTATTAAT TTAATGGAAG CGATTATGCC TGACGGCAGT
ATTCATCCCT AA
 
Protein sequence
MSTLIENLLI GTMSNPGLIK TAPMDAQELI WLYGQGQRDF SRQDLQSEDI IQAILTEANL 
SRTALDWANL SGTDLSRANL NRADLIHAKL ISAKLVGVDL TGADLSHADL SWVNLEGSTL
ISANLSNANL RQTNLTNADL RSANLSGANL SGANLSGAKL SRADLSEADL SGVDLSGANL
SRADLSEADL MEVDLSYSNL YKADLSESKL RNSDLEEAFL QGANFSRANL KGADLSRAVL
RENTLSLLTL SEFNVQSVNL SNEIDLSSAN LRGCNLRGAI LRHANLGYGL LHKTNLIDAI
LREANMIDAS LRGGDLRGAK FRNSNINAIN LMEAIMPDGS IHP