Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cyan8802_0530 |
Symbol | |
ID | 8389836 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8802 |
Kingdom | Bacteria |
Replicon accession | NC_013161 |
Strand | - |
Start bp | 525326 |
End bp | 526723 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 644978557 |
Product | Mammalian cell entry related domain protein |
Protein accession | YP_003136313 |
Protein GI | 257058425 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component |
TIGRFAM ID | [TIGR00996] virulence factor Mce family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0794795 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.00420823 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCTGCGAT CAAGAACCCT TCAGGAAGGT ACAGTGGGCT TATTTGCCCT CATAGGACTC GTCTTGTTTG GAGGATTGGT GATTTGGCTA CGCGGAGGCG TATTAGGTCA AAAACCCTAT CAAATTCAGG CTAATTTTCA AGATGTTAGC GGGTTACAAA TCGGTGCGCC TGTTAACTTT CGTGGGGTTG CTGTGGGGAA AATCACGGCA CTGCAAGCGA GTAGCAATGG GGTAACGGTA TTAATTGAAG TGTCTTCGCG GGAGTTACGC ATTCCCATCG GTTCTACCAT TCAAATTAAC CGTTATGGAT TGATTGGGGA AGCTTCGGTG GATATTACCC CATCAGAAAA ACTCTCTGAT CAAGCGTTGG CGGTTGATCC GACGAGTGAG GAGTGTCCTG ACAAACAACT GATCATTTGT GATAATGATA CCCTTGATGG TGAAACGGGT TCTCAATTGG TACAAGCTTT AACTCGTCTG AGTAATGCCT ATAGTGATCC CGAATTTGTT AAGGAATTGA AAGGGGCTTT TACCAGTGTT GCCCAAGCAG GAACTAAAAT TGGTAAATTG AGTGACGAAG CGGCTATTTT TTCTAAAACA GCGCGTCGAG AAATTCAAGG GACTTCTCAA ACCATTGCTC AAATTAACCA AGCTGCGCGG GATGCTTCCC AATTAATGCG AAATGTGAAT ACGGTTGTCT CGGAAAATCG AGAAAGCCTC AATCGGGCGG TTAATAATGC AGCGAGTTTA GTCAATAATT TGAATGGATT AGTCTCGGAA AATCGAGGTA ATGTTATTAA TACGTTGAAT AGTTTAGAAC GTACCAGTGA TGAGGTGCGA ATGGTGGCTA TTGGCTTAGG AAAAACCGTT AATAAAGTGA ATAGCGGCAT TGATGAAGTG AATATTAAAA AAATTGCTAG GGATTTAGAA ATTTTAATGG CTAATGCGGC GGAAACTTCA GCCAATTTGC GAGATATTTC TCAATCTTTT AATGATCCTA CTGTGATTTT AACGGTGCAA AAAACCTTGG ATTCTGCGCG AGCAACCTTT GAAAATGCCC AGAAAATCAC CTCGGATGTA GAGGAATTAA CGGGTGATCC CGCTTTTAGG GATAATGTTC GTAAATTGAT TAATGGCTTG AGTAATTTAT TGTCTTATAC TAATCAACTA GAACAACAGA TTTATACGGC TCAATTAATG GAGTCAGTCA CCGAACAGTT AGAATATCAA GTTGCCGTAC AACAGCGTTT TCTTGAACAA GAAAATGCGA ATCAAACAAC GCTTTCTAGG GATAGTTCTA TCCCTCCCCA AGTTCCCGTT AAAGAAACCC CTAAACCTGT TCGAGTCATT GCTCCTGAGT GGGTACTAGA AAGTGAAAAA AACAATCAAA TTAGATAA
|
Protein sequence | MLRSRTLQEG TVGLFALIGL VLFGGLVIWL RGGVLGQKPY QIQANFQDVS GLQIGAPVNF RGVAVGKITA LQASSNGVTV LIEVSSRELR IPIGSTIQIN RYGLIGEASV DITPSEKLSD QALAVDPTSE ECPDKQLIIC DNDTLDGETG SQLVQALTRL SNAYSDPEFV KELKGAFTSV AQAGTKIGKL SDEAAIFSKT ARREIQGTSQ TIAQINQAAR DASQLMRNVN TVVSENRESL NRAVNNAASL VNNLNGLVSE NRGNVINTLN SLERTSDEVR MVAIGLGKTV NKVNSGIDEV NIKKIARDLE ILMANAAETS ANLRDISQSF NDPTVILTVQ KTLDSARATF ENAQKITSDV EELTGDPAFR DNVRKLINGL SNLLSYTNQL EQQIYTAQLM ESVTEQLEYQ VAVQQRFLEQ ENANQTTLSR DSSIPPQVPV KETPKPVRVI APEWVLESEK NNQIR
|
| |