Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PCC8801_1783 |
Symbol | |
ID | 7105555 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cyanothece sp. PCC 8801 |
Kingdom | Bacteria |
Replicon accession | NC_011726 |
Strand | + |
Start bp | 1868160 |
End bp | 1869365 |
Gene Length | 1206 bp |
Protein Length | 401 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 643474851 |
Product | cysteine desulfurase NifS |
Protein accession | YP_002371985 |
Protein GI | 218246614 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes |
TIGRFAM ID | [TIGR03402] cysteine desulfurase NifS |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAGACT GCATTTATCT TGATAATAAT GCAACCACTC AAGTTGATGA GGAAGTATTA GGCGCAATGT TGCCTTACCT CACTCTCTAT TACGGAAACC CTTCGAGTAT GCACACTTTT GGCGGACAAG TTGGCAGTGC CATTAAAACC GCAAGAGAAC AGGTGGCTGC TTTATTAGGG GCCGAACCCT CGGAAATTGT CTTTACCAGT TGTGGAACGG AAGGGGATAA TGCAGCTATT CGCGCTGCGT TGGCTGCCCA ACCTAATAAA CGGCACATTA TTACCACAGA AGTCGAACAT CCGGCGATTT TGAATCTCTG CAAAAATTTA GAACGCCAGG GTTACACCGT TACCTATCTG TCGGTGAATA ACCAAGGACA GCTTGATCTC AGTGAACTAG AAGCGTCCTT AACCGGAAAT ACTGCCGTTG TCTCCATCAT GTATGCCAAC AACGAAACGG GGGTGATCTT CCCGGTGGAA CAGGTGGGAC AAATGGCTAA GGAATATGGG GCTCTGTTCC ATGTGGATGC AGTGCAAGCG GTGGGTAAAG TGCCTTTAAA TATGGCCGAA AGTACCATCG ATATGTTAAC CCTATCCGGC CATAAAATTC ATGCCCCCAA AGGGATTGGT GCATTATATG TCCGTCGTAA TACTCGTTTT CGTCCTTTGT TGATTGGCGG ACATCAAGAA CGGGGTCGTC GTGCCGGAAC CGAAAATGTG CCAGGGATCG TTGCGTTAGG AAAAGCCGCC GAATTGGCAG CCTATCACCT ACAATACGGG ACTTCTGAAC GGGAATTACG GGATTATTTA GAACAGACAA TTCTCACCAT TATTCCCGAT ACGGTATTAA ATGGTCATCC CGTGCAACGA TTACCGAATA CCTCAAATAT TGGCTTTAAA TTTATTGAAG GGGAAGCTAT TCTTTTATCC CTGAATCAAT ACGGAATCTG TGCTTCTTCG GGGTCAGCTT GTACCTCTGG ATCGCTAGAA CCGTCCCATA TTTTACGCGC AATGGGTCTT CCTTATAGTG TTTTACACGG CTCAATTCGC TTTAGTTTAT CGCGCTTTAC GACCCAAGAG CAAATCCAAA AAGTCCTCGA AGTCTTACCC GGAATTATTG ACCGACTCAG AGCATTATCG CCGTTTAACA GCGATGAAGC AGGTTGGTTA GTTGAACAAG AAAAAGCCGC CTTAGCTAAG TCATAA
|
Protein sequence | MKDCIYLDNN ATTQVDEEVL GAMLPYLTLY YGNPSSMHTF GGQVGSAIKT AREQVAALLG AEPSEIVFTS CGTEGDNAAI RAALAAQPNK RHIITTEVEH PAILNLCKNL ERQGYTVTYL SVNNQGQLDL SELEASLTGN TAVVSIMYAN NETGVIFPVE QVGQMAKEYG ALFHVDAVQA VGKVPLNMAE STIDMLTLSG HKIHAPKGIG ALYVRRNTRF RPLLIGGHQE RGRRAGTENV PGIVALGKAA ELAAYHLQYG TSERELRDYL EQTILTIIPD TVLNGHPVQR LPNTSNIGFK FIEGEAILLS LNQYGICASS GSACTSGSLE PSHILRAMGL PYSVLHGSIR FSLSRFTTQE QIQKVLEVLP GIIDRLRALS PFNSDEAGWL VEQEKAALAK S
|
| |