Gene Cyan8802_4131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4131 
Symbol 
ID8393482 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4257000 
End bp4259246 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content35% 
IMG OID644982047 
ProductComEC/Rec2-related protein 
Protein accessionYP_003139759 
Protein GI257061871 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCGTG AGCGTTGGAC AATCTTTAGT TTAGCTTATA TTACAGGATT ATTAGCGACA 
GGGGTTTGGG GGTTTCCTAA TCCTCATCCT ACCTTACAGC AATGGCTATT TGTGATAGTA
ATTTTGGGGT TACTTCCTTT TATTGTTGCT TGGTTCCTTA AACAATTTTC GGTAAGATGT
CCTTCGGCTA AATTTTGGTT AGGGGTTAGT TTAGTTGCTA TTTTAGGGGC AATCTATTTT
CAGTTTCGGG TTCCTCAACC AAGCGTTAAT GATATTAGTC GAATTTTTTC TAAGAATACT
TCTTATTATC AGTTCGTAAC GGTTTCAGGA AAAATTTTAT CAGAACCTAG ATTAACCTCT
GATCAAGGAC AAAAATTTTG GCTAAAAGCA CAAAAAGTTG ACCTTAATTC CTCAGATCAT
TCTGATGCTA AAAGGGTGAG CGGAAAGTTA TATGTTACCA TTCCTTTGAA GACAAAAAAT
CAGCTTTATC CAGGTCAAAC AATAACTATT ACTGGAGGGT TATATAAACC GCGATCGCCT
ACTAATCCAG GGGCTTTTGA TTTTCGCGCT TATTTAGCAA GTCAAGGAAC TTATGCGGGG
TTAAAAGGAG AGAGAATTAT TCACTTAGGA AACGTCCCTT TTTGGGGATG GTGGAAGTTA
CGACAGCCTA TTGTTAATAG TTTTACCCAA GGGTTAGGAT CTCCTGAAGG GTTACTCCTC
AGTTCGATGG TTTTGGGACG ACAAGCGGTT GATTTATCGG TGGAAATTCG GGATTTATTT
ATGAAGACTG GATTAGCTCA TGTCTTGGCT GCTTCAGGGT TTCATGTGGT TTTATTATTG
GGAATTATTC TGCGATTAAC GCGAAATTTA TCTCCTAAAC AACAATTAAT TATCGGAATA
AGTAGTTTAT TTTTTTATGG AGGTTTGACA GGACTACAAC CGAGTATTTG TCGGGCTATT
TTAATGGGTA CTGCTGTTTT AATTGGACAA ACAGTACAAA GAAAAATTAT TATATTAAAC
TCTTTATTGT TGGCAGCAAC TCTGTTATTA TTATGGCAAC CCTTATGGAT TTGGGATCTA
GGATTTCAAT TTAGTTTTTT GTCTACCTTT GGCTTAATTG TAACTATTCC ACCGTTAATG
AAGCGATTAG ATTGGCTTCC CCCCGCGATC GCCACTTTAA TTGCTGTTCC CTTAGCTGCA
ACTATTTGGG TTTTACCCCT ATCTTTATAC AAATTTAGTA TCTTAGCAAC CTATAGTATT
CCTACTAATA TTATTACCTC TCCTTTAATT ACAGCCATTA GTATAGGAGG AATGATCAGT
AGTGCCATAG CATTTATATC TCCAATTCTG GGTAGTTTTG TTGTTAAAAT ACTTTATTAT
CCTGTTTATT TTTTGATTAA ATCATTAGAA ATTATCGTTC ATTTACCGGG TAGTTATTAT
GCTGTAGGTA AGCTTTCATT AGGAGTTATG CTCCTCATTT ATGTTATTTT AGTTTTAATT
TGGTTAAATC CCTTTTGGCA ACGCTATTGG AAATTGGCAG CATTGTTCAG TTTAGGATTA
ATTATTCTGC CGATTATTTA TCAAAATTTA ACCCTAATCA AAGTCACTAT TTTAGAAGCG
AAATCTAATC CAGTTGTCGT TATTCAAGAT CGCGGTAAAG TGAGTTTAAT TAATTTAGGA
AATGAAGAAG CTATCAAATA TACTATCTTA CCATTTTTAT CCCAACAAGG CATTAATAAA
ATTCATGCCG TTTTAGTCTT TGATTCCCAA AGTATAAGAG ATTGGTTAAT AGTTAATCCT
TATATTACGG TGGATCGCTT TTTTCATAAT GTTGGGGAAA CTCAATCTAA TTCTCAACAA
CTCTCTAGCG GACAAATTCT GAGATTAGGA TCAACTTTTA TCGAATTTTT AGCCAGTCAA
CCCCTTTTAG TCAGCTTCAA AGTTAGTCAT CAATCTTGGT TATGGATAAC TGAAAATGTC
CAAAAGCAAA CTTTCCTGCA AAAAGCGTTA AACATACCTA ATTCAGTGGT ATTATGGTCA
GGGAAAAGTG TTTCATTAAA TCAGTTGATA AAACTCAATC CTAAAGTAGC GATCGCTAAT
TCTTCTTATA TCCCAAAAAA AGTCCGTCAA GAATTACAAA CCAGAAACAT AGATTTTTAC
TGGACCCGTC AAAATGGGGC GATTCAATGG ACTCCAAACA AAGGATTTAT AACGACAGTA
ATTAGCGCAG AACAAAATGA ATTTTAA
 
Protein sequence
MSRERWTIFS LAYITGLLAT GVWGFPNPHP TLQQWLFVIV ILGLLPFIVA WFLKQFSVRC 
PSAKFWLGVS LVAILGAIYF QFRVPQPSVN DISRIFSKNT SYYQFVTVSG KILSEPRLTS
DQGQKFWLKA QKVDLNSSDH SDAKRVSGKL YVTIPLKTKN QLYPGQTITI TGGLYKPRSP
TNPGAFDFRA YLASQGTYAG LKGERIIHLG NVPFWGWWKL RQPIVNSFTQ GLGSPEGLLL
SSMVLGRQAV DLSVEIRDLF MKTGLAHVLA ASGFHVVLLL GIILRLTRNL SPKQQLIIGI
SSLFFYGGLT GLQPSICRAI LMGTAVLIGQ TVQRKIIILN SLLLAATLLL LWQPLWIWDL
GFQFSFLSTF GLIVTIPPLM KRLDWLPPAI ATLIAVPLAA TIWVLPLSLY KFSILATYSI
PTNIITSPLI TAISIGGMIS SAIAFISPIL GSFVVKILYY PVYFLIKSLE IIVHLPGSYY
AVGKLSLGVM LLIYVILVLI WLNPFWQRYW KLAALFSLGL IILPIIYQNL TLIKVTILEA
KSNPVVVIQD RGKVSLINLG NEEAIKYTIL PFLSQQGINK IHAVLVFDSQ SIRDWLIVNP
YITVDRFFHN VGETQSNSQQ LSSGQILRLG STFIEFLASQ PLLVSFKVSH QSWLWITENV
QKQTFLQKAL NIPNSVVLWS GKSVSLNQLI KLNPKVAIAN SSYIPKKVRQ ELQTRNIDFY
WTRQNGAIQW TPNKGFITTV ISAEQNEF