Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4155 |
Symbol | |
ID | 7104559 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 4354700 |
End bp | 4357714 |
Gene Length | 3015 bp |
Protein Length | 1004 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643477144 |
Product | hypothetical protein |
Protein accession | YP_002374243 |
Protein GI | 218248872 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGAGT CTCTACAACT ACGCTCAAAT CATTTAACCA AAATGTTCAA ACCCTTAACC AATCGACTGT TCCGAGGAAT GATTGTACTC CTAGGAATCA CATTAACCTT TGAGTTACTC TCTAACCTAG TGGTTGAAGG GTTATGGTTT GGCGAAGTTG GGTATTTTAG CGTTTTTTTG AAGCGGTTAT TGTGGAGATT AGCCTTATTA GGGTTAACTA GCAGTTTTTC TTTATGGTTT CTCTGGGGGA ATTTACGCCA AGCAGAGACT AATAAATGGC ATTCTATCCC AGAAATAGAG TCTAGCAAAG GGCGTAGACG ACGTGATCAC TCCCTCGGAA AATCGAAACC TACTACCCCG GAATCTCGTT CCCTGGGGTT ATCCTGGTTA ATGCCCCTGG TGGTTATTTT AGGGGGATTA ATCGGCCTAA TGTTATTGTA TTACAGCCAA GTCGTTTACA GTGCTTGGAC TCTAGATTTC GATCTGCCTA AAGTCACTCC TCCCCTTCCT TCTGCTTTTT ACTTGAATTC CTTACCCAAT CTTGTTGCTC AAATTATCAG CAATCTTTGG AAAGTGCCAC TAATTGTCCT TTTAATCGGT TTAATTGTGA CTCGGACTAA ATTTTGTTTG AGACTAATGG CGATCGCCTT TAGTATGATC TTTGGCCTAG TTTTGTCGGG AAACTGGGGG AGAATTGTTC AATATTTTAG CTCAACCCCT TTCTCAAAAG TTGATCCCCA ATTTAGCCGA GATATTGGTT TTTATGTTTT TGAAGTGCCT TTTTGGAAAC TGATCAATTT TTGGCTAGCG GGACTCTTTC TTTATGGGTT AATTGCTGTT AGTTTGATCT ATTTACTGTC AGCTAACAGT CTTTCTCAAG GAAAATTTCC GGGGTTTTCT CGCCAACAAC TACGCCATTT GTATTTGTTG GGAGGACTAA CGCTATCGAT GATCGGACTG TATCATTGGC TCAACCGTTA TGAATTATTA TATTCTCCCC GTGGGGTGGT TTATGGGGGA AGTTATACCG ATGTTCATGT GGTCTTACCC GTTGATACCT TATTATCAAT TGTGTCTAGC GTGATTGCCT TTTGGTTATT GTCTAAAGGA ATCATGGGAT GGAAAAAAAC TCAACCGCGA TCGCTTAAAA CTAAACCTTT ACCGCGTTTC CCCTTTTCTC CCCTGCCTTT TATTATTTAT TTAGGGATTT TACTCATCGG ATTAGTCGCT ACTGAAGTTG TCCAAAATGC TATCGTACAA CCCAATGAAC TTAGTCGAGA ACGTCCCTAT CTTGAACGAA ATATCGCCTT AACCCGTGCT GCTTTTGATT TAGATAAAAT TCGAGTAACA ACTCTCGATG GAAGTGGAAA AATAACCGCG AAAGATCTCC AAAATAATCA TCTGACTATC AATAATATCC GTCTTTGGGA TGCTCGTCCT TTACTAGAAA CTAACCGTCA ATTGCAACAA CTTCGCCTCT ATTATCGGTT TCCTGATGCC GATATTGATC GCTATAGTAT CCCCACAGAA AACCAAGATT CTTCTATTAC GATTGCCAAA CAGCAGGTCT TAATTGCCCC TAGGGAACTT GATTATAAAG AAGTCCCCCA ACAGGCTCAA ACTTGGGTCA ATCAACACTT AATTTATACC CACGGTTACG GGTTTACCTT ATCACCAGTG AATCGTGTGG GGCAAGGAGG ATTACCCTCT TATTTTGTCG AAAATATTGG GACAGCTACC CATGCAGGGG AATTACAAAC CTCAAGTGAT TTAATTCGTC AAAGTATTCC CATTGATAAC CCCCGTATCT ATTTTGGAGA ATTAACCAAT ACTTACATTA TGACCAATAC GGGAATCCAA GAATTAGACT ACCCCAGTGG GCAGGATAAT GTTTACAATG TTTACGATGG TCAAGGAGGG ATTGCAATCG GTTCTCCATG GCGAAGAGTG TTATTTGCTG AGTATCTCAA AGACTGGAGA ATCCTGTTTA CCCACAATAT TACCCCCGAA ACTCGTTTAT TGTTTCGCCG GGATATTAAT CGTCGGATTC GAGAAATTGC CCCATTTCTG CGCTTTGATC GAGATCCCTA TTTAGTAACA GCAAAAGTTC AGTCATCTAA AGAAAAAAAT CCAGGGAGTC TCTACTGGAT GATTGATGCC TATACCACCA GCGATAGTTA TCCCTATTCT GATGCAGGTA ATCGCAATTT TAATTATATT CGTAATTCGG TTAAAATTGT CATTGATGCT TACAATGGTG ATGTACAGTT TTATATTGTT GATCCCAATG ATCCCCTCAT TCAAACTTGG CAAAATATTT TCCCAGAATT ATTTAAACCC CTAGAGGCGA TGCCAAACAG TCTTAAAGAG CACATTCGCT ACCCTAAAGA TTTATTTCAA ACCCAAGCGG AACGGCTCTT AAGCTATCAC ATGACTGATC CCCAAGTATT TTATAATCGA GAAGATCAAT GGCGTGTTCC CCAAGAAATT TATGGAGAAA AACAACAACC CATTGAGCCC TATTATCTCT TGATGAGTGT CACTGACAAG GCTCAAGAAT TTATTTTAGT GAACTTTTTT ACCCCCACCA GTCGTAACAA TTTAATTGCT GGATTATTTG CCCGTTCTGA TGATCCCAAT TATGGAAAGC TTGATTTAAT TCGATTACCT AAACAGCTCG TGATCTACGG ACCCGAACAA ATCGAAGCAT TAATTAATCA AGATCCCGTT ATTTCTCAAC AAGTTTCCCT TTGGAATCGT CAAGGATCTC GTGTGATTCA GGGGAATTTA TTAGTCATTC CTTTTCTCAA AGAACAATCG CTGCTTTATG TGGAACCACT CTATTTAGAA GCTGAACAAA ATAGTTTACC AACCTTAGTC AGAGTCATTG TTGTTTATCA AAATCAAATT GTTATGGCCG AAACCCTAGA CGGGGCACTG AAATCGATTT TTCAATCGGA GTCATCTCCC CCTGAAACAA TTATTCGCCA GGTAGAACCA GACTTCAATA GTTAA
|
Protein sequence | MKESLQLRSN HLTKMFKPLT NRLFRGMIVL LGITLTFELL SNLVVEGLWF GEVGYFSVFL KRLLWRLALL GLTSSFSLWF LWGNLRQAET NKWHSIPEIE SSKGRRRRDH SLGKSKPTTP ESRSLGLSWL MPLVVILGGL IGLMLLYYSQ VVYSAWTLDF DLPKVTPPLP SAFYLNSLPN LVAQIISNLW KVPLIVLLIG LIVTRTKFCL RLMAIAFSMI FGLVLSGNWG RIVQYFSSTP FSKVDPQFSR DIGFYVFEVP FWKLINFWLA GLFLYGLIAV SLIYLLSANS LSQGKFPGFS RQQLRHLYLL GGLTLSMIGL YHWLNRYELL YSPRGVVYGG SYTDVHVVLP VDTLLSIVSS VIAFWLLSKG IMGWKKTQPR SLKTKPLPRF PFSPLPFIIY LGILLIGLVA TEVVQNAIVQ PNELSRERPY LERNIALTRA AFDLDKIRVT TLDGSGKITA KDLQNNHLTI NNIRLWDARP LLETNRQLQQ LRLYYRFPDA DIDRYSIPTE NQDSSITIAK QQVLIAPREL DYKEVPQQAQ TWVNQHLIYT HGYGFTLSPV NRVGQGGLPS YFVENIGTAT HAGELQTSSD LIRQSIPIDN PRIYFGELTN TYIMTNTGIQ ELDYPSGQDN VYNVYDGQGG IAIGSPWRRV LFAEYLKDWR ILFTHNITPE TRLLFRRDIN RRIREIAPFL RFDRDPYLVT AKVQSSKEKN PGSLYWMIDA YTTSDSYPYS DAGNRNFNYI RNSVKIVIDA YNGDVQFYIV DPNDPLIQTW QNIFPELFKP LEAMPNSLKE HIRYPKDLFQ TQAERLLSYH MTDPQVFYNR EDQWRVPQEI YGEKQQPIEP YYLLMSVTDK AQEFILVNFF TPTSRNNLIA GLFARSDDPN YGKLDLIRLP KQLVIYGPEQ IEALINQDPV ISQQVSLWNR QGSRVIQGNL LVIPFLKEQS LLYVEPLYLE AEQNSLPTLV RVIVVYQNQI VMAETLDGAL KSIFQSESSP PETIIRQVEP DFNS
|
| |