Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_4131 |
Symbol | |
ID | 8393482 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 4257000 |
End bp | 4259246 |
Gene Length | 2247 bp |
Protein Length | 748 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644982047 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_003139759 |
Protein GI | 257061871 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCGTG AGCGTTGGAC AATCTTTAGT TTAGCTTATA TTACAGGATT ATTAGCGACA GGGGTTTGGG GGTTTCCTAA TCCTCATCCT ACCTTACAGC AATGGCTATT TGTGATAGTA ATTTTGGGGT TACTTCCTTT TATTGTTGCT TGGTTCCTTA AACAATTTTC GGTAAGATGT CCTTCGGCTA AATTTTGGTT AGGGGTTAGT TTAGTTGCTA TTTTAGGGGC AATCTATTTT CAGTTTCGGG TTCCTCAACC AAGCGTTAAT GATATTAGTC GAATTTTTTC TAAGAATACT TCTTATTATC AGTTCGTAAC GGTTTCAGGA AAAATTTTAT CAGAACCTAG ATTAACCTCT GATCAAGGAC AAAAATTTTG GCTAAAAGCA CAAAAAGTTG ACCTTAATTC CTCAGATCAT TCTGATGCTA AAAGGGTGAG CGGAAAGTTA TATGTTACCA TTCCTTTGAA GACAAAAAAT CAGCTTTATC CAGGTCAAAC AATAACTATT ACTGGAGGGT TATATAAACC GCGATCGCCT ACTAATCCAG GGGCTTTTGA TTTTCGCGCT TATTTAGCAA GTCAAGGAAC TTATGCGGGG TTAAAAGGAG AGAGAATTAT TCACTTAGGA AACGTCCCTT TTTGGGGATG GTGGAAGTTA CGACAGCCTA TTGTTAATAG TTTTACCCAA GGGTTAGGAT CTCCTGAAGG GTTACTCCTC AGTTCGATGG TTTTGGGACG ACAAGCGGTT GATTTATCGG TGGAAATTCG GGATTTATTT ATGAAGACTG GATTAGCTCA TGTCTTGGCT GCTTCAGGGT TTCATGTGGT TTTATTATTG GGAATTATTC TGCGATTAAC GCGAAATTTA TCTCCTAAAC AACAATTAAT TATCGGAATA AGTAGTTTAT TTTTTTATGG AGGTTTGACA GGACTACAAC CGAGTATTTG TCGGGCTATT TTAATGGGTA CTGCTGTTTT AATTGGACAA ACAGTACAAA GAAAAATTAT TATATTAAAC TCTTTATTGT TGGCAGCAAC TCTGTTATTA TTATGGCAAC CCTTATGGAT TTGGGATCTA GGATTTCAAT TTAGTTTTTT GTCTACCTTT GGCTTAATTG TAACTATTCC ACCGTTAATG AAGCGATTAG ATTGGCTTCC CCCCGCGATC GCCACTTTAA TTGCTGTTCC CTTAGCTGCA ACTATTTGGG TTTTACCCCT ATCTTTATAC AAATTTAGTA TCTTAGCAAC CTATAGTATT CCTACTAATA TTATTACCTC TCCTTTAATT ACAGCCATTA GTATAGGAGG AATGATCAGT AGTGCCATAG CATTTATATC TCCAATTCTG GGTAGTTTTG TTGTTAAAAT ACTTTATTAT CCTGTTTATT TTTTGATTAA ATCATTAGAA ATTATCGTTC ATTTACCGGG TAGTTATTAT GCTGTAGGTA AGCTTTCATT AGGAGTTATG CTCCTCATTT ATGTTATTTT AGTTTTAATT TGGTTAAATC CCTTTTGGCA ACGCTATTGG AAATTGGCAG CATTGTTCAG TTTAGGATTA ATTATTCTGC CGATTATTTA TCAAAATTTA ACCCTAATCA AAGTCACTAT TTTAGAAGCG AAATCTAATC CAGTTGTCGT TATTCAAGAT CGCGGTAAAG TGAGTTTAAT TAATTTAGGA AATGAAGAAG CTATCAAATA TACTATCTTA CCATTTTTAT CCCAACAAGG CATTAATAAA ATTCATGCCG TTTTAGTCTT TGATTCCCAA AGTATAAGAG ATTGGTTAAT AGTTAATCCT TATATTACGG TGGATCGCTT TTTTCATAAT GTTGGGGAAA CTCAATCTAA TTCTCAACAA CTCTCTAGCG GACAAATTCT GAGATTAGGA TCAACTTTTA TCGAATTTTT AGCCAGTCAA CCCCTTTTAG TCAGCTTCAA AGTTAGTCAT CAATCTTGGT TATGGATAAC TGAAAATGTC CAAAAGCAAA CTTTCCTGCA AAAAGCGTTA AACATACCTA ATTCAGTGGT ATTATGGTCA GGGAAAAGTG TTTCATTAAA TCAGTTGATA AAACTCAATC CTAAAGTAGC GATCGCTAAT TCTTCTTATA TCCCAAAAAA AGTCCGTCAA GAATTACAAA CCAGAAACAT AGATTTTTAC TGGACCCGTC AAAATGGGGC GATTCAATGG ACTCCAAACA AAGGATTTAT AACGACAGTA ATTAGCGCAG AACAAAATGA ATTTTAA
|
Protein sequence | MSRERWTIFS LAYITGLLAT GVWGFPNPHP TLQQWLFVIV ILGLLPFIVA WFLKQFSVRC PSAKFWLGVS LVAILGAIYF QFRVPQPSVN DISRIFSKNT SYYQFVTVSG KILSEPRLTS DQGQKFWLKA QKVDLNSSDH SDAKRVSGKL YVTIPLKTKN QLYPGQTITI TGGLYKPRSP TNPGAFDFRA YLASQGTYAG LKGERIIHLG NVPFWGWWKL RQPIVNSFTQ GLGSPEGLLL SSMVLGRQAV DLSVEIRDLF MKTGLAHVLA ASGFHVVLLL GIILRLTRNL SPKQQLIIGI SSLFFYGGLT GLQPSICRAI LMGTAVLIGQ TVQRKIIILN SLLLAATLLL LWQPLWIWDL GFQFSFLSTF GLIVTIPPLM KRLDWLPPAI ATLIAVPLAA TIWVLPLSLY KFSILATYSI PTNIITSPLI TAISIGGMIS SAIAFISPIL GSFVVKILYY PVYFLIKSLE IIVHLPGSYY AVGKLSLGVM LLIYVILVLI WLNPFWQRYW KLAALFSLGL IILPIIYQNL TLIKVTILEA KSNPVVVIQD RGKVSLINLG NEEAIKYTIL PFLSQQGINK IHAVLVFDSQ SIRDWLIVNP YITVDRFFHN VGETQSNSQQ LSSGQILRLG STFIEFLASQ PLLVSFKVSH QSWLWITENV QKQTFLQKAL NIPNSVVLWS GKSVSLNQLI KLNPKVAIAN SSYIPKKVRQ ELQTRNIDFY WTRQNGAIQW TPNKGFITTV ISAEQNEF
|
| |