Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_2399 |
Symbol | |
ID | 7110710 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | - |
Start bp | 2673083 |
End bp | 2673973 |
Gene Length | 891 bp |
Protein Length | 296 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643480656 |
Product | hypothetical protein |
Protein accession | YP_002377688 |
Protein GI | 218439359 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 0.735883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAAAA TTTGGCAATT GATTGGTAAA AGTGGATTAA TAGCTTTAAT CGTGTCCTGT CAACCGGCTG AAATCCCTTC TTTTAAACCC GAAGCATCTC CCGTTGTTAA ACCCACCCTA CAATACCAAG TTTATGAATT GCCTTACACT ACAGTTCATA CCCTACGAAT TCCGGCACAG GGCGAGTTTA CTGTCAAAGT AGCTGTTTGG GAGAGTGTAA ATACCTTAGA TACTTTTGCT CACAAATATG GGGCGATCGC TGTTTTAAAT GGTGGCTTTT TTGATCCCAG TAATGGGAAG ACGACTTCCT ACATAATTGA AGATGGCCAA ATCGTAGCTG ACCCCCAAGA AAATCCCTCG TTGATGAATA ATCCCACATT AGTCCCCTAT TTGGAGAAGA TTTTCAACCG TAGCGAGTTT AGACGTTACT TGTGTGGGAC AACGCTCAAA TACGCGATCG CTTCTCGTAA CCAACCAATA CCAGAAAATT GTCAATTAAT CGATGCCCTT GGAGGTGGGC CAAGTTTATT ACCAGAGATT ACAGCCAAAG CGGAGGGCTT TTTTACACAG GAGGGGGAAA AAATTATCCG CGATCCTTTG GGGATGAAAC AAGCTAATGC CCGAAGTGCC CTTGGAATTA CTGGTCAAGG AGATTTAATT TGGATTATGG TAGCCCAAAA GCCTAATTCA TCTGGGGCTT CTGGGCTATC CTTGCCAGAG TTAGCTAAGT TTTTGAAGAG TTTAGGAGTT GAGGAAGCCA TCAATTTAGA TGGCGGCAGT TCCTCTTCTT TTTTCTATCA GGGAAAAACC TATTACGGCA AGGTGGATCG GCAGGGAAAT CGAATTCAAC GTCCGGTTAA ATCGGTTTTA ATTTTACAAA AACAGCAATA A
|
Protein sequence | MKKIWQLIGK SGLIALIVSC QPAEIPSFKP EASPVVKPTL QYQVYELPYT TVHTLRIPAQ GEFTVKVAVW ESVNTLDTFA HKYGAIAVLN GGFFDPSNGK TTSYIIEDGQ IVADPQENPS LMNNPTLVPY LEKIFNRSEF RRYLCGTTLK YAIASRNQPI PENCQLIDAL GGGPSLLPEI TAKAEGFFTQ EGEKIIRDPL GMKQANARSA LGITGQGDLI WIMVAQKPNS SGASGLSLPE LAKFLKSLGV EEAINLDGGS SSSFFYQGKT YYGKVDRQGN RIQRPVKSVL ILQKQQ
|
| |