Gene Cyan8802_3811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_3811 
Symbol 
ID8393161 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp3925329 
End bp3926360 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content45% 
IMG OID644981736 
Productpentapeptide repeat protein 
Protein accessionYP_003139450 
Protein GI257061562 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAACCC TTATCGAGAA TTTACTTATT GGCACGATGT CAAACCCAGG TTTGATCAAA 
ACAGCACCTA TGGACGCGCA GGAACTAATC TGGCTATATG GCCAGGGTCA ACGGGACTTT
AGTCGGCAAG ATCTCCAAAG CGAAGATATT ATTCAAGCTA TCCTCACGGA AGCGAATTTA
AGTCGTACTG CCTTAGATTG GGCTAACTTA AGTGGAACGG ATTTAAGTCG TGCGAACCTC
AATCGGGCTG ATTTAATCCA CGCTAAACTC ATTAGTGCCA AATTAGTTGG AGTCGATTTA
ACGGGGGCTG ATCTCAGTCA TGCTGATCTC AGTTGGGTGA ATCTCGAAGG CTCAACCCTG
ATCAGTGCTA ATTTAAGCAA TGCCAATCTT CGACAAACCA ATCTCACCAA TGCTGACTTG
AGAAGTGCCA ACCTCAGTGG AGCTAACCTC AGTGGAGCTA ACCTCAGTGG AGCAAAACTG
AGTCGCGCCG ATCTCAGTGA AGCCGATCTT AGTGGGGTTG ATCTCAGTGG GGCAAATTTG
AGCCGCGCCG ATCTCAGCGA AGCGGATCTG ATGGAAGTGG ATTTAAGTTA CAGCAACCTT
TATAAAGCCG ATCTCAGCGA AAGTAAACTC CGTAACAGCG ATCTAGAAGA GGCATTTCTC
CAAGGAGCCA ATTTTAGCCG TGCTAATTTG AAAGGAGCCG ATCTCTCCAG GGCAGTTTTA
AGGGAAAATA CCCTGAGCCT GCTGACCTTA TCGGAGTTTA ATGTTCAAAG TGTTAATCTC
TCTAATGAAA TTGACTTAAG TTCAGCTAAT CTGCGAGGAT GTAACCTGAG AGGGGCTATT
TTGCGTCATG CCAATTTAGG GTATGGGTTG CTCCACAAAA CCAATTTGAT TGATGCGATC
CTACGGGAAG CTAATATGAT TGATGCTTCG TTACGCGGAG GAGATTTGCG GGGAGCTAAA
TTCCGCAATA GTAATATTAA TGCTATTAAT TTAATGGAAG CGATTATGCC TGACGGCAGT
ATTCATCCCT AA
 
Protein sequence
MSTLIENLLI GTMSNPGLIK TAPMDAQELI WLYGQGQRDF SRQDLQSEDI IQAILTEANL 
SRTALDWANL SGTDLSRANL NRADLIHAKL ISAKLVGVDL TGADLSHADL SWVNLEGSTL
ISANLSNANL RQTNLTNADL RSANLSGANL SGANLSGAKL SRADLSEADL SGVDLSGANL
SRADLSEADL MEVDLSYSNL YKADLSESKL RNSDLEEAFL QGANFSRANL KGADLSRAVL
RENTLSLLTL SEFNVQSVNL SNEIDLSSAN LRGCNLRGAI LRHANLGYGL LHKTNLIDAI
LREANMIDAS LRGGDLRGAK FRNSNINAIN LMEAIMPDGS IHP