Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_4092 |
Symbol | |
ID | 7101887 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | - |
Start bp | 4286554 |
End bp | 4288800 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643477084 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_002374183 |
Protein GI | 218248812 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTCGTG AGCGTTGGAC AATCTTTAGT TTAGCTTATA TTACAGGATT ATTAGCGACA GGGGTTTGGG GGTTTCCTAA TCCTCATCCT ACCTTACAGC AATGGCTATT TGTGATAGTA ATTTTGGGGT TACTTCCTTT TATTGTTGCT TGGTTCCTTA AACAATTTTC GGTAAGATGT CCTTCGGCTA AATTTTGGTT AGGGGTTAGT TTAGTTGCTA TTTTAGGGGC AATCTATTTT CAGTTTCGGG TTCCTCAACC AAGCGTTAAT GATATTAGTC GAATTTTTTC TAAGAATACT TCTTATTATC AGTTCGTAAC GGTTTCAGGA AAAATTTTAT CAGAACCTAG ATTAACCTCT AATCAAGGAC AAAAATTTTG GCTAAAAGCA CAAAAAGTTG ACCTTAATTC CTCAGATCAT TCTGATGCTA AAAGGGTAAG CGGAAAGTTA TATGTTACCA TTCCTTTGAA GACAAAAAAT CAGCTTTATG CAGGTCAAAC AATAACTATT ACTGGAGGGT TATATAAACC GCGATCGCCT ACTAATCCAG GGGCTTTTGA TTTTCGCGCT TATTTAGCAA GTCAAGGAAC TTATGCGGGG TTAAAAGGAG AGAGAATTAT TCACTTAGGA AACGTCCCTT TTTGGGGATG GTGGAAGTTA CGACAGCCTA TTGTTAATAG TTTTACCCAA GGGTTAGGAT CTCCTGAAGG GTTACTCCTC AGTTCGATGG TTTTGGGACG ACAAGCGGTT GATTTATCGG TGGAAATTCG GGATTTATTT ATGAAGACTG GATTAGCTCA TGTCTTGGCT GCTTCAGGGT TTCATGTGGT TTTATTATTG GGAATTATTC TGCGATTAAC GCGAAATTTA TCTCCTAAAC AACAATTAAT TATCGGAATA AGTAGTTTAT TTTTTTATGG AGGTTTGACA GGACTACAAC CGAGTATTTG TCGGGCTATT TTAATGGGTA CTGCTGTTTT AATTGGACAA ACAGTACAAA GAAAAATTAT TATATTAAAC TCTTTATTGT TGGCAGCAAC TCTGTTATTA TTATGGCAAC CCTTATGGAT TTGGGATCTA GGATTTCAAT TTAGTTTTTT GTCTACCTTT GGCTTAATTG TAACTATTCC ACCGTTAATG AAGCGATTAG ATTGGCTTCC CCCCGCGATC GCCACTTTAA TTGCTGTTCC CTTAGCTGCA ACTATTTGGG TTTTACCCCT ATCTTTATAC AAATTTAGTA TCTTAGCAAC CTATAGTATT CCTACTAATA TTATTACCTC TCCTTTAATT ACAGCCATTA GTATAGGAGG AATGATCAGT AGTGCCATAG CATTTATATC TCCAATTCTG GGTAGTTTTG TTGTTAAAAT ACTTTATTAT CCTGTTTATT TTTTGATTAA ATTATTAGAA ATTATTGTTC ATTTGCCAGG TAGTTATTAT GCTGTAGGTA AGCTTTCATT AGGAGTTATG CTCCTCATTT ATGTTATTTT AGTTTTAATT TGGTTAAATC CCTTTTGGCA ACGCTATTGG AAATTGGCAG CATTGTTCAG TTTAGGATTA ATTATTCTGC CGATTATTTA TCAAAATTTA ACCCTAATCA AAGTCACTAT TTTAGAAGCG AAATCTAATC CAGTTGTCGT TATTCAAGAT CGCGGTAAAG TGAGTTTAAT TAATTTAGGA AATGAAGAAG CTATCAAATA TACTATCTTA CCATTTTTAT CCCAACAAGG CATTAATAAA ATTCATGCTG TTTTAGTCTT TGATTCCCAA AGTATAAGAG ATTGGTTAAT CGTTAATCCT TATATTACAG TGGATCTCTT TTTTCATAAC GTTGGGGAAA CTCAATCTAA TTCTCACCAA CTCTCTAGTG GACAAATTAT CAAATTAGGG TCAACTTTTA TTGAACTTGT AGCCAATCAA CCCCTTTTAG TCAGCTTCAA GCTTAGTAAC CAATCTTGGT TATGGATAAC TCAAAATATC CAAAAGCAAA CTTTGCCACA AAAAGCCTTA AACATACCTA ATTTAGCGGT ATTATGGTCA GGGAAAAGTG TTTCGTTAAA ACAGTTAATA AAACTTAATC CCAAAGTGGC GATCGCTAAC TCTTCTTATA TTCCCAAAAA AGTCCGTCAA GAATTAGAAA CTAGAAACAT AGATTTTTAC TGGACTCGTC AAGATGGGGC TATTCAATGG ACTCCAAAAA AAGGATTTAT AACGACAGGA ATTACGGGAG AACAAAATGA ATTTTAA
|
Protein sequence | MSRERWTIFS LAYITGLLAT GVWGFPNPHP TLQQWLFVIV ILGLLPFIVA WFLKQFSVRC PSAKFWLGVS LVAILGAIYF QFRVPQPSVN DISRIFSKNT SYYQFVTVSG KILSEPRLTS NQGQKFWLKA QKVDLNSSDH SDAKRVSGKL YVTIPLKTKN QLYAGQTITI TGGLYKPRSP TNPGAFDFRA YLASQGTYAG LKGERIIHLG NVPFWGWWKL RQPIVNSFTQ GLGSPEGLLL SSMVLGRQAV DLSVEIRDLF MKTGLAHVLA ASGFHVVLLL GIILRLTRNL SPKQQLIIGI SSLFFYGGLT GLQPSICRAI LMGTAVLIGQ TVQRKIIILN SLLLAATLLL LWQPLWIWDL GFQFSFLSTF GLIVTIPPLM KRLDWLPPAI ATLIAVPLAA TIWVLPLSLY KFSILATYSI PTNIITSPLI TAISIGGMIS SAIAFISPIL GSFVVKILYY PVYFLIKLLE IIVHLPGSYY AVGKLSLGVM LLIYVILVLI WLNPFWQRYW KLAALFSLGL IILPIIYQNL TLIKVTILEA KSNPVVVIQD RGKVSLINLG NEEAIKYTIL PFLSQQGINK IHAVLVFDSQ SIRDWLIVNP YITVDLFFHN VGETQSNSHQ LSSGQIIKLG STFIELVANQ PLLVSFKLSN QSWLWITQNI QKQTLPQKAL NIPNLAVLWS GKSVSLKQLI KLNPKVAIAN SSYIPKKVRQ ELETRNIDFY WTRQDGAIQW TPKKGFITTG ITGEQNEF
|
| |