Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_3234 |
Symbol | |
ID | 7107711 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | - |
Start bp | 3619974 |
End bp | 3621314 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 643481478 |
Product | NHL repeat containing protein |
Protein accession | YP_002378502 |
Protein GI | 218440173 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAT CCTCAATTAT TTTATTATCT CTTTTAAGTT TAATGAGTGC CTGTCAAAAC CCTGGCCTTG AGACAACCTC AACCCCGTCT CTAGAAAGTT CTATTAATAG GAGTAAGGCG AGTATTCCTC CTCAAAAGAT CATTCCTACC GAAACTTTAA CCCCCCAACC TATCCGCATC ACGATTGAAA GTTTACCCCA ACCTTACGCC ACTAATAGCG CTTCTCAACC GCCTCAAGTT GTGGATATTC CGGATAATCC CACCTTAAAA GTTCCCCCAG GTTTTGGAGT TAATATATTT GCGGAAAATT TAGATCAACC TCGTTGGTTA GCGTTAACTC CGGATGGAGA TGTATTAGTC ACAGAAACTC GTCAAAATCG CATTCGGTTA TTAAAAGATA CTAATCAAGA TGGGGTAGTT GATGAGTATA AAACTTTTGC GACGGCTGAA AATGGATTAG AAATTCCTTT TGGTATGGAT TTTGCCAAAG GATACTTTTT TTTAGGCAAT CATAACGAAG TCCGACGCTA TCCTTATACT CAAGGACAGG AACAATTACA AGGAACGGGA GAAAAAATTA CCGATTTACC AGGAGGAGGA TACCGTCAAC ATTGGACTCG AAATGTAATC GTTTCTCCGG ATGAACAAAA ATTATATGTT TCTATTGGTT CTGAATCGAA TGCTGACACA GAACCTCTTC CTCGCGCTTC TGTACAGGTG ATGAATTTAG AGGGTTCTAA TCGAGAAACT TTTGCTTATG GATTACGAAA TCCTGTCGGT CTTGATTTTC ATCCCCTAAC AGGAGAGTTA TACACAACAG TCAATGAACG AGATAAATTA GGAGATGATT TAGTTCCCGA TTATCTCACT CAGATAGAAA AAGGGGAATT TTATGGCTGG CCTTACGCTT ATTTTACCCC CAATTTACTC GATCCCCGTC ATGTTAAAAA TGGTCAAAGT GTTAAACCTG AATTAGTGGC GCAGACTGTC ATGCCTGATG TGTTATTTCA ATCTCATTCA GCCGCTTTAG GGTTACAATT TTATGATAAA ACGACTTTTC CTCAAAAGTA TCATAATGGC GCGTTTGTGG CGTTTCGAGG GTCTTGGAAT CGGAATCAGG GAACGGGTTA TAAAATCGTT TTTGTTCCGT TTAATGACCA AGGAAAACCT TTAGGATATT ACGAAGATTT TTTAAGCGGA TTTCTCCTCG ATCCTTCTAT TCCTACTACT TGGGGAAGAC CGGTAGGATT ATTAGTATTA CCCGATGGAA GTTTATTAGT CACCGAAGAA GCAAACGGGA GAATTTATCG GATTTTTAGG AAAGGGGAAA CACAGGGATA A
|
Protein sequence | MKLSSIILLS LLSLMSACQN PGLETTSTPS LESSINRSKA SIPPQKIIPT ETLTPQPIRI TIESLPQPYA TNSASQPPQV VDIPDNPTLK VPPGFGVNIF AENLDQPRWL ALTPDGDVLV TETRQNRIRL LKDTNQDGVV DEYKTFATAE NGLEIPFGMD FAKGYFFLGN HNEVRRYPYT QGQEQLQGTG EKITDLPGGG YRQHWTRNVI VSPDEQKLYV SIGSESNADT EPLPRASVQV MNLEGSNRET FAYGLRNPVG LDFHPLTGEL YTTVNERDKL GDDLVPDYLT QIEKGEFYGW PYAYFTPNLL DPRHVKNGQS VKPELVAQTV MPDVLFQSHS AALGLQFYDK TTFPQKYHNG AFVAFRGSWN RNQGTGYKIV FVPFNDQGKP LGYYEDFLSG FLLDPSIPTT WGRPVGLLVL PDGSLLVTEE ANGRIYRIFR KGETQG
|
| |