Gene Cyan8802_4104 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_4104 
Symbol 
ID8393455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp4230129 
End bp4231379 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content45% 
IMG OID644982021 
ProductS-layer domain protein 
Protein accessionYP_003139733 
Protein GI257061845 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTA ACTTACCCCT GCGACTAGCT ACCCTAGCCA CAGTGACCTT TAGTAGCTTC 
ATTCCCTTTA ATACCCCCGT TAAAGCCGCT GGGTTTGAAG AACAAGCGAT TGATCAAGAG
GGAGTTATTG CGATCGCTAG ACCCTACGGA CAAAATAAGT ATGACCTGTT AGTTATTGAG
CAAATTCCTG GTAAACAAGC TTGTTGGGCA GAAAATGGTT CTAATCCCGT TTTGGTTGAC
CCTCTATTAC TTAATTTTGA TTTTACGGGA ATTTGCCGCC GAGCTACCGA TAGTAACGGC
TACTCCATTC GGGTTGATGG GACAGACTTG GGGTTAGATT ACCTCTTGCG TCTGGTTCCT
CGCAATGGTG AATTAGTCTT AGTGGGAACC CCCCGCGTGG GAAGTTACAG CGAAATCGTT
CTCGGTAGCA CCAAAGGACT GGCCTCTGGG TTTATGAAAA TCATTCTCAA TCCAGGGTGG
CAATTTTCCA AACGCGCCTA TCAAGGGAAA GCATTGGGTC ACTTCTACAT TAGCGGTTCT
AAAGCTGCGA TCGCGGGTGA CGTGGCAGCT ACCCCCAGTA ACCCCTCTGT CACCTTGAAT
CCTGACACCA ATACCACAAC AACGGCTGGT TTAAAGGATA TTAGTAATAA TCCCTATAAA
ACCGAGATTG AAAAAGCGGT TGCTCTGGGA ATTGTGTCTG GGTTTGAAGA TAACACTTTC
CGTCCCCAAG ACTCCGTCAC CCGTGAACAA TTTGTCTCTA TGGCCGTCGA TGCGATCGCA
ACTGTCTACA AAGTTGATTT AGCGACCCAA CCCCAACGGG ATATGATCCC CTTTAAGGAT
GTGGAGAGTA GTCGGTGGAG TGCTAATAAA ATTAAATGGG CTCAATGGAA TTTCTTGAAC
GTGGGTAATC CTAATAATAC CTTTCAACCC ACAGAATCCA TTACTCGCGC TGAATTAATC
GACACTGCGC GTCGTATGGC TATCCATCTG AAAAATCAAC TCGATTTACC ACGAGAAATT
CAACAAACCC AAGAACCGGC CAAATTTTCT GATATTTCGG GTAGTTGGGC TCAAACCGTT
ATTACTGAAA TGTCTGGTTA TTGTGGTGTA GCTACCCCCA TCAATGAAAC GGGAACCGAG
TTTGCCCCTG ACCGTAAAGC GACCCGCGAC TATACAACAG CCGTTCTTAA ACGGATCGTT
GAATGTGTTA AATCAGAAGC GCAACAAGCC AATAATAAGA CAAATCAATA A
 
Protein sequence
MKINLPLRLA TLATVTFSSF IPFNTPVKAA GFEEQAIDQE GVIAIARPYG QNKYDLLVIE 
QIPGKQACWA ENGSNPVLVD PLLLNFDFTG ICRRATDSNG YSIRVDGTDL GLDYLLRLVP
RNGELVLVGT PRVGSYSEIV LGSTKGLASG FMKIILNPGW QFSKRAYQGK ALGHFYISGS
KAAIAGDVAA TPSNPSVTLN PDTNTTTTAG LKDISNNPYK TEIEKAVALG IVSGFEDNTF
RPQDSVTREQ FVSMAVDAIA TVYKVDLATQ PQRDMIPFKD VESSRWSANK IKWAQWNFLN
VGNPNNTFQP TESITRAELI DTARRMAIHL KNQLDLPREI QQTQEPAKFS DISGSWAQTV
ITEMSGYCGV ATPINETGTE FAPDRKATRD YTTAVLKRIV ECVKSEAQQA NNKTNQ