Gene PCC8801_4155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4155 
Symbol 
ID7104559 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4354700 
End bp4357714 
Gene Length3015 bp 
Protein Length1004 aa 
Translation table11 
GC content40% 
IMG OID643477144 
Producthypothetical protein 
Protein accessionYP_002374243 
Protein GI218248872 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGAGT CTCTACAACT ACGCTCAAAT CATTTAACCA AAATGTTCAA ACCCTTAACC 
AATCGACTGT TCCGAGGAAT GATTGTACTC CTAGGAATCA CATTAACCTT TGAGTTACTC
TCTAACCTAG TGGTTGAAGG GTTATGGTTT GGCGAAGTTG GGTATTTTAG CGTTTTTTTG
AAGCGGTTAT TGTGGAGATT AGCCTTATTA GGGTTAACTA GCAGTTTTTC TTTATGGTTT
CTCTGGGGGA ATTTACGCCA AGCAGAGACT AATAAATGGC ATTCTATCCC AGAAATAGAG
TCTAGCAAAG GGCGTAGACG ACGTGATCAC TCCCTCGGAA AATCGAAACC TACTACCCCG
GAATCTCGTT CCCTGGGGTT ATCCTGGTTA ATGCCCCTGG TGGTTATTTT AGGGGGATTA
ATCGGCCTAA TGTTATTGTA TTACAGCCAA GTCGTTTACA GTGCTTGGAC TCTAGATTTC
GATCTGCCTA AAGTCACTCC TCCCCTTCCT TCTGCTTTTT ACTTGAATTC CTTACCCAAT
CTTGTTGCTC AAATTATCAG CAATCTTTGG AAAGTGCCAC TAATTGTCCT TTTAATCGGT
TTAATTGTGA CTCGGACTAA ATTTTGTTTG AGACTAATGG CGATCGCCTT TAGTATGATC
TTTGGCCTAG TTTTGTCGGG AAACTGGGGG AGAATTGTTC AATATTTTAG CTCAACCCCT
TTCTCAAAAG TTGATCCCCA ATTTAGCCGA GATATTGGTT TTTATGTTTT TGAAGTGCCT
TTTTGGAAAC TGATCAATTT TTGGCTAGCG GGACTCTTTC TTTATGGGTT AATTGCTGTT
AGTTTGATCT ATTTACTGTC AGCTAACAGT CTTTCTCAAG GAAAATTTCC GGGGTTTTCT
CGCCAACAAC TACGCCATTT GTATTTGTTG GGAGGACTAA CGCTATCGAT GATCGGACTG
TATCATTGGC TCAACCGTTA TGAATTATTA TATTCTCCCC GTGGGGTGGT TTATGGGGGA
AGTTATACCG ATGTTCATGT GGTCTTACCC GTTGATACCT TATTATCAAT TGTGTCTAGC
GTGATTGCCT TTTGGTTATT GTCTAAAGGA ATCATGGGAT GGAAAAAAAC TCAACCGCGA
TCGCTTAAAA CTAAACCTTT ACCGCGTTTC CCCTTTTCTC CCCTGCCTTT TATTATTTAT
TTAGGGATTT TACTCATCGG ATTAGTCGCT ACTGAAGTTG TCCAAAATGC TATCGTACAA
CCCAATGAAC TTAGTCGAGA ACGTCCCTAT CTTGAACGAA ATATCGCCTT AACCCGTGCT
GCTTTTGATT TAGATAAAAT TCGAGTAACA ACTCTCGATG GAAGTGGAAA AATAACCGCG
AAAGATCTCC AAAATAATCA TCTGACTATC AATAATATCC GTCTTTGGGA TGCTCGTCCT
TTACTAGAAA CTAACCGTCA ATTGCAACAA CTTCGCCTCT ATTATCGGTT TCCTGATGCC
GATATTGATC GCTATAGTAT CCCCACAGAA AACCAAGATT CTTCTATTAC GATTGCCAAA
CAGCAGGTCT TAATTGCCCC TAGGGAACTT GATTATAAAG AAGTCCCCCA ACAGGCTCAA
ACTTGGGTCA ATCAACACTT AATTTATACC CACGGTTACG GGTTTACCTT ATCACCAGTG
AATCGTGTGG GGCAAGGAGG ATTACCCTCT TATTTTGTCG AAAATATTGG GACAGCTACC
CATGCAGGGG AATTACAAAC CTCAAGTGAT TTAATTCGTC AAAGTATTCC CATTGATAAC
CCCCGTATCT ATTTTGGAGA ATTAACCAAT ACTTACATTA TGACCAATAC GGGAATCCAA
GAATTAGACT ACCCCAGTGG GCAGGATAAT GTTTACAATG TTTACGATGG TCAAGGAGGG
ATTGCAATCG GTTCTCCATG GCGAAGAGTG TTATTTGCTG AGTATCTCAA AGACTGGAGA
ATCCTGTTTA CCCACAATAT TACCCCCGAA ACTCGTTTAT TGTTTCGCCG GGATATTAAT
CGTCGGATTC GAGAAATTGC CCCATTTCTG CGCTTTGATC GAGATCCCTA TTTAGTAACA
GCAAAAGTTC AGTCATCTAA AGAAAAAAAT CCAGGGAGTC TCTACTGGAT GATTGATGCC
TATACCACCA GCGATAGTTA TCCCTATTCT GATGCAGGTA ATCGCAATTT TAATTATATT
CGTAATTCGG TTAAAATTGT CATTGATGCT TACAATGGTG ATGTACAGTT TTATATTGTT
GATCCCAATG ATCCCCTCAT TCAAACTTGG CAAAATATTT TCCCAGAATT ATTTAAACCC
CTAGAGGCGA TGCCAAACAG TCTTAAAGAG CACATTCGCT ACCCTAAAGA TTTATTTCAA
ACCCAAGCGG AACGGCTCTT AAGCTATCAC ATGACTGATC CCCAAGTATT TTATAATCGA
GAAGATCAAT GGCGTGTTCC CCAAGAAATT TATGGAGAAA AACAACAACC CATTGAGCCC
TATTATCTCT TGATGAGTGT CACTGACAAG GCTCAAGAAT TTATTTTAGT GAACTTTTTT
ACCCCCACCA GTCGTAACAA TTTAATTGCT GGATTATTTG CCCGTTCTGA TGATCCCAAT
TATGGAAAGC TTGATTTAAT TCGATTACCT AAACAGCTCG TGATCTACGG ACCCGAACAA
ATCGAAGCAT TAATTAATCA AGATCCCGTT ATTTCTCAAC AAGTTTCCCT TTGGAATCGT
CAAGGATCTC GTGTGATTCA GGGGAATTTA TTAGTCATTC CTTTTCTCAA AGAACAATCG
CTGCTTTATG TGGAACCACT CTATTTAGAA GCTGAACAAA ATAGTTTACC AACCTTAGTC
AGAGTCATTG TTGTTTATCA AAATCAAATT GTTATGGCCG AAACCCTAGA CGGGGCACTG
AAATCGATTT TTCAATCGGA GTCATCTCCC CCTGAAACAA TTATTCGCCA GGTAGAACCA
GACTTCAATA GTTAA
 
Protein sequence
MKESLQLRSN HLTKMFKPLT NRLFRGMIVL LGITLTFELL SNLVVEGLWF GEVGYFSVFL 
KRLLWRLALL GLTSSFSLWF LWGNLRQAET NKWHSIPEIE SSKGRRRRDH SLGKSKPTTP
ESRSLGLSWL MPLVVILGGL IGLMLLYYSQ VVYSAWTLDF DLPKVTPPLP SAFYLNSLPN
LVAQIISNLW KVPLIVLLIG LIVTRTKFCL RLMAIAFSMI FGLVLSGNWG RIVQYFSSTP
FSKVDPQFSR DIGFYVFEVP FWKLINFWLA GLFLYGLIAV SLIYLLSANS LSQGKFPGFS
RQQLRHLYLL GGLTLSMIGL YHWLNRYELL YSPRGVVYGG SYTDVHVVLP VDTLLSIVSS
VIAFWLLSKG IMGWKKTQPR SLKTKPLPRF PFSPLPFIIY LGILLIGLVA TEVVQNAIVQ
PNELSRERPY LERNIALTRA AFDLDKIRVT TLDGSGKITA KDLQNNHLTI NNIRLWDARP
LLETNRQLQQ LRLYYRFPDA DIDRYSIPTE NQDSSITIAK QQVLIAPREL DYKEVPQQAQ
TWVNQHLIYT HGYGFTLSPV NRVGQGGLPS YFVENIGTAT HAGELQTSSD LIRQSIPIDN
PRIYFGELTN TYIMTNTGIQ ELDYPSGQDN VYNVYDGQGG IAIGSPWRRV LFAEYLKDWR
ILFTHNITPE TRLLFRRDIN RRIREIAPFL RFDRDPYLVT AKVQSSKEKN PGSLYWMIDA
YTTSDSYPYS DAGNRNFNYI RNSVKIVIDA YNGDVQFYIV DPNDPLIQTW QNIFPELFKP
LEAMPNSLKE HIRYPKDLFQ TQAERLLSYH MTDPQVFYNR EDQWRVPQEI YGEKQQPIEP
YYLLMSVTDK AQEFILVNFF TPTSRNNLIA GLFARSDDPN YGKLDLIRLP KQLVIYGPEQ
IEALINQDPV ISQQVSLWNR QGSRVIQGNL LVIPFLKEQS LLYVEPLYLE AEQNSLPTLV
RVIVVYQNQI VMAETLDGAL KSIFQSESSP PETIIRQVEP DFNS