Gene PCC8801_4092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_4092 
Symbol 
ID7101887 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp4286554 
End bp4288800 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content35% 
IMG OID643477084 
ProductComEC/Rec2-related protein 
Protein accessionYP_002374183 
Protein GI218248812 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCGTG AGCGTTGGAC AATCTTTAGT TTAGCTTATA TTACAGGATT ATTAGCGACA 
GGGGTTTGGG GGTTTCCTAA TCCTCATCCT ACCTTACAGC AATGGCTATT TGTGATAGTA
ATTTTGGGGT TACTTCCTTT TATTGTTGCT TGGTTCCTTA AACAATTTTC GGTAAGATGT
CCTTCGGCTA AATTTTGGTT AGGGGTTAGT TTAGTTGCTA TTTTAGGGGC AATCTATTTT
CAGTTTCGGG TTCCTCAACC AAGCGTTAAT GATATTAGTC GAATTTTTTC TAAGAATACT
TCTTATTATC AGTTCGTAAC GGTTTCAGGA AAAATTTTAT CAGAACCTAG ATTAACCTCT
AATCAAGGAC AAAAATTTTG GCTAAAAGCA CAAAAAGTTG ACCTTAATTC CTCAGATCAT
TCTGATGCTA AAAGGGTAAG CGGAAAGTTA TATGTTACCA TTCCTTTGAA GACAAAAAAT
CAGCTTTATG CAGGTCAAAC AATAACTATT ACTGGAGGGT TATATAAACC GCGATCGCCT
ACTAATCCAG GGGCTTTTGA TTTTCGCGCT TATTTAGCAA GTCAAGGAAC TTATGCGGGG
TTAAAAGGAG AGAGAATTAT TCACTTAGGA AACGTCCCTT TTTGGGGATG GTGGAAGTTA
CGACAGCCTA TTGTTAATAG TTTTACCCAA GGGTTAGGAT CTCCTGAAGG GTTACTCCTC
AGTTCGATGG TTTTGGGACG ACAAGCGGTT GATTTATCGG TGGAAATTCG GGATTTATTT
ATGAAGACTG GATTAGCTCA TGTCTTGGCT GCTTCAGGGT TTCATGTGGT TTTATTATTG
GGAATTATTC TGCGATTAAC GCGAAATTTA TCTCCTAAAC AACAATTAAT TATCGGAATA
AGTAGTTTAT TTTTTTATGG AGGTTTGACA GGACTACAAC CGAGTATTTG TCGGGCTATT
TTAATGGGTA CTGCTGTTTT AATTGGACAA ACAGTACAAA GAAAAATTAT TATATTAAAC
TCTTTATTGT TGGCAGCAAC TCTGTTATTA TTATGGCAAC CCTTATGGAT TTGGGATCTA
GGATTTCAAT TTAGTTTTTT GTCTACCTTT GGCTTAATTG TAACTATTCC ACCGTTAATG
AAGCGATTAG ATTGGCTTCC CCCCGCGATC GCCACTTTAA TTGCTGTTCC CTTAGCTGCA
ACTATTTGGG TTTTACCCCT ATCTTTATAC AAATTTAGTA TCTTAGCAAC CTATAGTATT
CCTACTAATA TTATTACCTC TCCTTTAATT ACAGCCATTA GTATAGGAGG AATGATCAGT
AGTGCCATAG CATTTATATC TCCAATTCTG GGTAGTTTTG TTGTTAAAAT ACTTTATTAT
CCTGTTTATT TTTTGATTAA ATTATTAGAA ATTATTGTTC ATTTGCCAGG TAGTTATTAT
GCTGTAGGTA AGCTTTCATT AGGAGTTATG CTCCTCATTT ATGTTATTTT AGTTTTAATT
TGGTTAAATC CCTTTTGGCA ACGCTATTGG AAATTGGCAG CATTGTTCAG TTTAGGATTA
ATTATTCTGC CGATTATTTA TCAAAATTTA ACCCTAATCA AAGTCACTAT TTTAGAAGCG
AAATCTAATC CAGTTGTCGT TATTCAAGAT CGCGGTAAAG TGAGTTTAAT TAATTTAGGA
AATGAAGAAG CTATCAAATA TACTATCTTA CCATTTTTAT CCCAACAAGG CATTAATAAA
ATTCATGCTG TTTTAGTCTT TGATTCCCAA AGTATAAGAG ATTGGTTAAT CGTTAATCCT
TATATTACAG TGGATCTCTT TTTTCATAAC GTTGGGGAAA CTCAATCTAA TTCTCACCAA
CTCTCTAGTG GACAAATTAT CAAATTAGGG TCAACTTTTA TTGAACTTGT AGCCAATCAA
CCCCTTTTAG TCAGCTTCAA GCTTAGTAAC CAATCTTGGT TATGGATAAC TCAAAATATC
CAAAAGCAAA CTTTGCCACA AAAAGCCTTA AACATACCTA ATTTAGCGGT ATTATGGTCA
GGGAAAAGTG TTTCGTTAAA ACAGTTAATA AAACTTAATC CCAAAGTGGC GATCGCTAAC
TCTTCTTATA TTCCCAAAAA AGTCCGTCAA GAATTAGAAA CTAGAAACAT AGATTTTTAC
TGGACTCGTC AAGATGGGGC TATTCAATGG ACTCCAAAAA AAGGATTTAT AACGACAGGA
ATTACGGGAG AACAAAATGA ATTTTAA
 
Protein sequence
MSRERWTIFS LAYITGLLAT GVWGFPNPHP TLQQWLFVIV ILGLLPFIVA WFLKQFSVRC 
PSAKFWLGVS LVAILGAIYF QFRVPQPSVN DISRIFSKNT SYYQFVTVSG KILSEPRLTS
NQGQKFWLKA QKVDLNSSDH SDAKRVSGKL YVTIPLKTKN QLYAGQTITI TGGLYKPRSP
TNPGAFDFRA YLASQGTYAG LKGERIIHLG NVPFWGWWKL RQPIVNSFTQ GLGSPEGLLL
SSMVLGRQAV DLSVEIRDLF MKTGLAHVLA ASGFHVVLLL GIILRLTRNL SPKQQLIIGI
SSLFFYGGLT GLQPSICRAI LMGTAVLIGQ TVQRKIIILN SLLLAATLLL LWQPLWIWDL
GFQFSFLSTF GLIVTIPPLM KRLDWLPPAI ATLIAVPLAA TIWVLPLSLY KFSILATYSI
PTNIITSPLI TAISIGGMIS SAIAFISPIL GSFVVKILYY PVYFLIKLLE IIVHLPGSYY
AVGKLSLGVM LLIYVILVLI WLNPFWQRYW KLAALFSLGL IILPIIYQNL TLIKVTILEA
KSNPVVVIQD RGKVSLINLG NEEAIKYTIL PFLSQQGINK IHAVLVFDSQ SIRDWLIVNP
YITVDLFFHN VGETQSNSHQ LSSGQIIKLG STFIELVANQ PLLVSFKLSN QSWLWITQNI
QKQTLPQKAL NIPNLAVLWS GKSVSLKQLI KLNPKVAIAN SSYIPKKVRQ ELETRNIDFY
WTRQDGAIQW TPKKGFITTG ITGEQNEF