Gene PCC8801_0418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPCC8801_0418 
Symbol 
ID7105807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCyanothece sp. PCC 8801 
KingdomBacteria 
Replicon accessionNC_011726 
Strand
Start bp422555 
End bp423814 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content42% 
IMG OID643473528 
Productcysteine desulfurase, SufS subfamily 
Protein accessionYP_002370671 
Protein GI218245300 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCCA TACAAGAAAA AACTCTAGCC GCTAAAACCC GTTCTGACTT TCCTCTTTTG 
CATCAACAAA TCAATGGAAA GCCCTTAATT TATCTTGATA ATGCGGCAAC TTCTCAAAAG
CCCTTAGCTG TTATTAACAC CCTCAAAAAT TACTACGAAA ACGATAACGC CAACGTACAC
CGAGGAGCCC ATAGTTTAAG TATACGGGCA ACAGAAGCCT ATGAAGGAGC AAGAGATAAA
ATTGCTCAAT TTGTTAACGC TACTTCATCC CAAGAAATTG TTTTTACCCG TAATGCCACT
GAAGCCATTA ATTTAGTCGC CTATAGTTGG GGACTAACGA ATTTAAAACC AGGAGATGAA
ATCATCATTT CTGTGATGGA ACATCATAGT AATATTGTGC CGTGGCAAAT GATCGCACAA
AAGACAGGGG CAGTCATTAA ATATGTCCCT TTAACAGAAA CAGAAGAATT TGATTTAGAA
CAATTTAAAG CCCTATTATC CAATAAAACT AAATTAGTTG CCGTCGTCCA TGTTTCTAAC
ACCTTAGGCT GTATTAACCC CGTTGAAACC ATTATTAATC TCGCCCATCA AGCCGGGGCA
AAAGTCTTAA TTGACGCTTG TCAAAGTGTC CCCCATTTAG CCATTGATGT CCAAGCAATA
GACTGTGATT GGCTGGTTGC CAGTGGCCAT AAAATGTGCG CTCCTACGGG GATTGGGTTC
CTCTATGGCA AAAAAGCCAT TTTAGAAGAA ATGCCCCCTT TCCTGGGCGG AGGAGAGATG
ATAGGAGAGG TTTTTTTTGA TGGGTTCACC TACGGAGAAT TACCCCATAA ATTTGAAGCC
GGAACCCCCG CCATTGGAGA AGCGATCGCC CTAGGGGCAG CCGTGGATTA TTTAACCACC
CTTAGCTTTA AAGAAATTCA TGCCTATGAA GAAGAATTAA CTGCCTATTT GTTCAAGAAA
TTGCTCGAAA TTCCTAAGCT AAGAATTTAC GGACCAAAAC CTACTATCGA TGGCAAAGGA
AGGGCTGCCT TAGCCTCATT TAATATCGAA GGAATCCACG CCAGCGACTT ATCAACCTTG
CTAGATAATG AAGGCATTGC CATCCGTTCA GGACACCACT GTACTCAACC TTTACATCGT
CTGTTTGATG CGTCAGGAAG TGCCAGAATT AGCTTATATT TCTATAACAC CCGCGAAGAA
ATTGATCGCT TTATTTTAGC CTTACAAGAA ACTATCGATT TTTTTGGGAT GATGCTGTAG
 
Protein sequence
MTAIQEKTLA AKTRSDFPLL HQQINGKPLI YLDNAATSQK PLAVINTLKN YYENDNANVH 
RGAHSLSIRA TEAYEGARDK IAQFVNATSS QEIVFTRNAT EAINLVAYSW GLTNLKPGDE
IIISVMEHHS NIVPWQMIAQ KTGAVIKYVP LTETEEFDLE QFKALLSNKT KLVAVVHVSN
TLGCINPVET IINLAHQAGA KVLIDACQSV PHLAIDVQAI DCDWLVASGH KMCAPTGIGF
LYGKKAILEE MPPFLGGGEM IGEVFFDGFT YGELPHKFEA GTPAIGEAIA LGAAVDYLTT
LSFKEIHAYE EELTAYLFKK LLEIPKLRIY GPKPTIDGKG RAALASFNIE GIHASDLSTL
LDNEGIAIRS GHHCTQPLHR LFDASGSARI SLYFYNTREE IDRFILALQE TIDFFGMML