Gene PCC8801_1783 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_1783 
Symbol 
ID7105555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp1868160 
End bp1869365 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content46% 
IMG OID643474851 
Productcysteine desulfurase NifS 
Protein accessionYP_002371985 
Protein GI218246614 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGACT GCATTTATCT TGATAATAAT GCAACCACTC AAGTTGATGA GGAAGTATTA 
GGCGCAATGT TGCCTTACCT CACTCTCTAT TACGGAAACC CTTCGAGTAT GCACACTTTT
GGCGGACAAG TTGGCAGTGC CATTAAAACC GCAAGAGAAC AGGTGGCTGC TTTATTAGGG
GCCGAACCCT CGGAAATTGT CTTTACCAGT TGTGGAACGG AAGGGGATAA TGCAGCTATT
CGCGCTGCGT TGGCTGCCCA ACCTAATAAA CGGCACATTA TTACCACAGA AGTCGAACAT
CCGGCGATTT TGAATCTCTG CAAAAATTTA GAACGCCAGG GTTACACCGT TACCTATCTG
TCGGTGAATA ACCAAGGACA GCTTGATCTC AGTGAACTAG AAGCGTCCTT AACCGGAAAT
ACTGCCGTTG TCTCCATCAT GTATGCCAAC AACGAAACGG GGGTGATCTT CCCGGTGGAA
CAGGTGGGAC AAATGGCTAA GGAATATGGG GCTCTGTTCC ATGTGGATGC AGTGCAAGCG
GTGGGTAAAG TGCCTTTAAA TATGGCCGAA AGTACCATCG ATATGTTAAC CCTATCCGGC
CATAAAATTC ATGCCCCCAA AGGGATTGGT GCATTATATG TCCGTCGTAA TACTCGTTTT
CGTCCTTTGT TGATTGGCGG ACATCAAGAA CGGGGTCGTC GTGCCGGAAC CGAAAATGTG
CCAGGGATCG TTGCGTTAGG AAAAGCCGCC GAATTGGCAG CCTATCACCT ACAATACGGG
ACTTCTGAAC GGGAATTACG GGATTATTTA GAACAGACAA TTCTCACCAT TATTCCCGAT
ACGGTATTAA ATGGTCATCC CGTGCAACGA TTACCGAATA CCTCAAATAT TGGCTTTAAA
TTTATTGAAG GGGAAGCTAT TCTTTTATCC CTGAATCAAT ACGGAATCTG TGCTTCTTCG
GGGTCAGCTT GTACCTCTGG ATCGCTAGAA CCGTCCCATA TTTTACGCGC AATGGGTCTT
CCTTATAGTG TTTTACACGG CTCAATTCGC TTTAGTTTAT CGCGCTTTAC GACCCAAGAG
CAAATCCAAA AAGTCCTCGA AGTCTTACCC GGAATTATTG ACCGACTCAG AGCATTATCG
CCGTTTAACA GCGATGAAGC AGGTTGGTTA GTTGAACAAG AAAAAGCCGC CTTAGCTAAG
TCATAA
 
Protein sequence
MKDCIYLDNN ATTQVDEEVL GAMLPYLTLY YGNPSSMHTF GGQVGSAIKT AREQVAALLG 
AEPSEIVFTS CGTEGDNAAI RAALAAQPNK RHIITTEVEH PAILNLCKNL ERQGYTVTYL
SVNNQGQLDL SELEASLTGN TAVVSIMYAN NETGVIFPVE QVGQMAKEYG ALFHVDAVQA
VGKVPLNMAE STIDMLTLSG HKIHAPKGIG ALYVRRNTRF RPLLIGGHQE RGRRAGTENV
PGIVALGKAA ELAAYHLQYG TSERELRDYL EQTILTIIPD TVLNGHPVQR LPNTSNIGFK
FIEGEAILLS LNQYGICASS GSACTSGSLE PSHILRAMGL PYSVLHGSIR FSLSRFTTQE
QIQKVLEVLP GIIDRLRALS PFNSDEAGWL VEQEKAALAK S