Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC7424_0303 |
Symbol | |
ID | 7108191 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 7424 |
Kingdom | Bacteria |
Replicon accession | NC_011729 |
Strand | + |
Start bp | 329899 |
End bp | 331833 |
Gene Length | 1935 bp |
Protein Length | 644 aa |
Translation table | 11 |
GC content | 41% |
IMG OID | 643478578 |
Product | hypothetical protein |
Protein accession | YP_002375639 |
Protein GI | 218437310 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4632] Exopolysaccharide biosynthesis protein related to N-acetylglucosamine-1-phosphodiester alpha-N-acetylglucosaminidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 46 |
Fosmid unclonability p-value | 0.139753 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGAAAAG ATAATTGGTA TTCCTTGGGT ATATTTTCCT TTTCCTTGGT GACGGTAGCT ATTCTCTATA ATGTCAATAA TAATAGCCCC AACCATTCCC AAAAACTTAA CCCTCACGAA ACTTCATTTA ACTCGCAAAC TCTCTTAACT AGCACAATTT CTAATTTTAA AACACAACCC GTCTTAGCTC AAGGTAATTT TTCTGGACTT CAACAAGGAA ATGAAATTAC AATTAATGGA CGTAAATTTC CTCTAGCTTG GAGTCAATGG ACAGAAGGAA CAGCCACTAG AATTGGGATT AGTGATACGG GGGTAATGAA TATTTTCGGA CTTCATTTAT TGAGTACCCC TAACCCTAAC ATTCAACCCG TTGTCTGGTT TAATAATGAT CCCAATCAAC CGACACTTTT AAAAGCCAAA TTTATTGCCC CTTATCGTTA TTTAGATGTC ACGGATATTT TACGGGATGC CGGCACACAA ATTCAAATGG TTGGTAATAG TTTAAATTTA ACGATTCCTC CGGCAAAAAT TGAAAACATT CGTCAAGGTA ATCAAGAGTG GGGTAAACGA ATTGTCATTG ATCTCGATCG TCCTACCTTT TGGTTAGTTA GTCAAGCCAA AAATGAAGGA GCAGTCATTT TTGAAGGGGT AGCGAGTCCA GAATTAATGG CTCAATATCA ACCCCCATCT ACAGCGTCAA CCCAAGGAAC AACCAATACT GATGAAGATG ATCTAGGAAG TGGATCTAAT ATTAAATTTG AGAGTCAACT GTTTTCTTTA GCAACCGAAA AAACGACGAC AAAATTATTA TTAAATTTAC CCACCGCTTA CGGAATTCGG GCGTTTAGTT TATCTAACCC TCCTCGTTTA GTGGTTGATG TTCGTCCCGA TGAAAAAGTA GAACGAGAAA TAACATGGAC ACCTGGACTC ATTTGGCGAC AAAAAATTAT TCCGCTTAAA GGAGATTCAT TTCCCGTTAC TTGGCTAGAC ATTGATTTAA AATCCCCTAA TATTTTCCTC AAACCGGTGA CATCTAATCC CGATACTCTA GAAGGAACTG AACCCATTGT TACTATAGGA AGAAACACAA CCGCCTCTGC GGCTATTAAT GGGGGATTTT TTAACCGGAA TAATCGCTTA CCTTTAGGGG CAATTCGGAC TAATAACCGT TGGGTTTCTG GGCCAATTCT CAACCGAGGG GCGATCGCTT GGAACGATCG AGGACAGGTG AAAATGGGGC GTTTACGTCT GCAAGAAACG GTCATTACTA ATGGGGGAAA TCGGCTGCCG GTATTGTATC TTAATAGTGG ATATGTTCAA TCTGGAATGG CACGTTATAC CCGTGACTGG GGCGCAACTT ATACCCCTTT AAGTGATGAT GAATTAATCA TTACGGTTCA AAATAATCAA GTCATTAGTC AACGACAAGG GGGAAAAGCC GGTCAAAATG TGATCCCTAT TCCTAATGAC GGGTATCTGT TAGCTATTCG TAAAAATAGT GTGCCGGCTT CTGCGTTAAC CATCGGGACT TCACTTAATT TAGAAAGTGG TACAATTCCG GCAGATTTTA ATAATTATCC CCATATTTTA GGGGCAGGGC CTTTATTATT GCTTAATGGT CAGATAGTTC TTGATGTTGC CTCAGAACAG TTTAGCAAAG GGTTTCAAAA CCAAAAAGCC TCCCGAAGTG CGATCGCTAC CACGAGAGAC GGAAAATTAA TGGTAGTGGC GGTTCATAAC CGAGTCGGGG GATCAGGGGC AAGTTTACCT GAATTAGCCC AAATTTTACA GAGTTTAGGG GCTGTAGATG CCTTAAATTT GGATGGGGGA AGTTCTACGT CTTTGGCGTT GGGAGGTCAA TTAATCGATC GTTCTCCGGT GACTGCGGCT AAAGTTCATA ATGGTATCGG AATTTTTATC GCCCCTAACC CTTAA
|
Protein sequence | MRKDNWYSLG IFSFSLVTVA ILYNVNNNSP NHSQKLNPHE TSFNSQTLLT STISNFKTQP VLAQGNFSGL QQGNEITING RKFPLAWSQW TEGTATRIGI SDTGVMNIFG LHLLSTPNPN IQPVVWFNND PNQPTLLKAK FIAPYRYLDV TDILRDAGTQ IQMVGNSLNL TIPPAKIENI RQGNQEWGKR IVIDLDRPTF WLVSQAKNEG AVIFEGVASP ELMAQYQPPS TASTQGTTNT DEDDLGSGSN IKFESQLFSL ATEKTTTKLL LNLPTAYGIR AFSLSNPPRL VVDVRPDEKV EREITWTPGL IWRQKIIPLK GDSFPVTWLD IDLKSPNIFL KPVTSNPDTL EGTEPIVTIG RNTTASAAIN GGFFNRNNRL PLGAIRTNNR WVSGPILNRG AIAWNDRGQV KMGRLRLQET VITNGGNRLP VLYLNSGYVQ SGMARYTRDW GATYTPLSDD ELIITVQNNQ VISQRQGGKA GQNVIPIPND GYLLAIRKNS VPASALTIGT SLNLESGTIP ADFNNYPHIL GAGPLLLLNG QIVLDVASEQ FSKGFQNQKA SRSAIATTRD GKLMVVAVHN RVGGSGASLP ELAQILQSLG AVDALNLDGG SSTSLALGGQ LIDRSPVTAA KVHNGIGIFI APNP
|
| |