Gene Cyan8802_1811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCyan8802_1811 
Symbol 
ID8391125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8802 
KingdomBacteria 
Replicon accessionNC_013161 
Strand
Start bp1842930 
End bp1844135 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content46% 
IMG OID644979798 
Productcysteine desulfurase NifS 
Protein accessionYP_003137545 
Protein GI257059657 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGACT GCATTTATCT TGATAATAAT GCAACCACTC AAGTTGATGA GGAAGTATTA 
GGCGCAATGT TGCCTTACCT CACTCTCTAT TACGGAAACC CTTCGAGTAT GCACACTTTT
GGCGGACAAG TTGGCAGTGC CATTAAAACC GCAAGAGAAC AGGTGGCTGC TTTATTAGGG
GCCGAACCCT CGGAAATTGT CTTTACCAGT TGTGGAACGG AAGGGGATAA TGCAGCTATT
CGCGCTGCGT TGGCTGCCCA ACCTAATAAA CGGCACATTA TCACCACAGA AGTCGAACAT
CCGGCGATTT TGAATCTCTG CAAAAATTTA GAACGCCAGG GTTACACCGT TACCTATCTG
TCGGTGAATA ACCAAGGACA GCTTGATCTC AGTGAACTAG AAGCGTCCTT AACCGGAAAT
ACTGCCGTTG TCTCCATCAT GTATGCCAAC AACGAAACGG GGGTGATCTT CCCGGTGGAA
CAGGTGGGAC AGATGGCGAA AGAATACGGG GCTCTGTTCC ATGTGGATGC AGTGCAAGCG
GTGGGTAAAG TGCCTTTGAA TATGGCTGAA AGTACCATCG ATATGTTAAC CCTCTCCGGT
CATAAAATTC ATGCTCCCAA GGGGATTGGT GCATTGTATG TCCGTCGTAA TACTCGTTTT
CGTCCTTTGT TAATTGGCGG ACATCAAGAA CGGGGTCGTC GTGCCGGAAC CGAAAATGTG
CCAGGGATCG TTGCGTTAGG CAAAGCCGCC GAATTGGCAG CCTATCACCT ACAATACGGG
ACCTCTGAAC GGGAATTACG GGATTATTTA GAACAGACAA TTCTCACCAT TATTCCCGAT
ACAGTATTAA ATGGTCATCC CGTACAGCGA TTACCGAATA CCTCAAATAT TGGTTTTAAA
TTTATTGAAG GGGAAGCTAT TCTTTTATCC CTGAATCAAT ACGGAATCTG TGCTTCTTCG
GGGTCAGCTT GTACCTCTGG ATCCCTAGAA CCTTCCCATA TTTTACGCGC AATGGGTCTT
CCTTATAGTG TTTTACACGG CTCAATTCGC TTTAGTTTAT CGCGCTTTAC GACCCAAGAG
CAAATCCAAA AAGTCCTCGA AGTTTTACCC GGAATTATTG ACCGACTCAG AGCGTTATCG
CCGTTTAACA GCGATGAAGC AGGTTGGTTA GTTGAACAAG AAAAAGCCGC CTTAGCTAAG
TCATAA
 
Protein sequence
MKDCIYLDNN ATTQVDEEVL GAMLPYLTLY YGNPSSMHTF GGQVGSAIKT AREQVAALLG 
AEPSEIVFTS CGTEGDNAAI RAALAAQPNK RHIITTEVEH PAILNLCKNL ERQGYTVTYL
SVNNQGQLDL SELEASLTGN TAVVSIMYAN NETGVIFPVE QVGQMAKEYG ALFHVDAVQA
VGKVPLNMAE STIDMLTLSG HKIHAPKGIG ALYVRRNTRF RPLLIGGHQE RGRRAGTENV
PGIVALGKAA ELAAYHLQYG TSERELRDYL EQTILTIIPD TVLNGHPVQR LPNTSNIGFK
FIEGEAILLS LNQYGICASS GSACTSGSLE PSHILRAMGL PYSVLHGSIR FSLSRFTTQE
QIQKVLEVLP GIIDRLRALS PFNSDEAGWL VEQEKAALAK S