Gene Haur_0409 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0409 
Symbol 
ID5732305 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp479199 
End bp480437 
Gene Length1239 bp 
Protein Length412 aa 
Translation table11 
GC content53% 
IMG OID641277532 
ProductSufS subfamily cysteine desulfurase 
Protein accessionYP_001543188 
Protein GI159896941 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01979] cysteine desulfurases, SufS subfamily 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGTAC AAACAACCCT TGATATTCAG GCCATTCGCG AGCAATTCCC GCTCTTGGAT 
CAATCGATTA ACGGCCATCG CCTAGCCTAT TTGGATAGCA CCGCAACTGC CCAAAAGCCG
CTAGCAGTGC TTGATGCGAT GGATCGCTAT TATCGCACGA TTAATGCGAA TGTTCATCGA
GGTGTGTATC AGATTAGCGA AGCCGCCACC GAAGCCTATG AAGGCACGCG CCGCACGATT
GGCCGCTTTA TCGGCGCAAA ATCGACCAAA GAAATTATTT TTACTCGCAA CGCGACCGAA
GCGATTAACT TGGTTGCCCA AAGCTGGGGC CGTGCTAATT TGCAAGCGGG CGATCGAATT
TTGCTCACAG TCAGCGAGCA TCATTCAAAT TTAGTGCCAT GGCAATTGCT AGCAGCCCAA
ACTGGTGTAG AGCTTGATTT TATCGAGCTT GATGATCAAG GCCGACTTGA TCTCAGCCAC
CTTGATCAAC TATTGACTGA ACGCACCAAA TTGGTCGCCA TGACCCACAT GTCGAATGTG
TTGGGCACGA TCAATCCAGT TGAACGGGTG ATTGCGGCGG CCAAACAGGT TGGAGCCTTG
GTGCTGCTGG ATGGGGCGCA AAGTGTGCCA CATATTCCCG TCAATGTTCA AGCACTTGGC
TGCGATTTCT TGGCCTTTTC GGGGCATAAA ATGTGCGGTC CAACTGGGAT TGGGGTGCTG
TGGGCGCGGC GCGAATTGCT TGAAGCCATG CCGCCGTTTA TGGGTGGCGG CGATATGATC
AAACGGGTCG GGCTACGCGA AAGCTCATGG AACGATCTCC CATGGAAATT CGAGGCAGGC
ACGCCAGCGA TTGCCGAGGC GATTGGCCTT GGCGCGGCGA TTGACTTCTT GAATGAACTT
GGGATGCAGG CGATTCACGA GCGCGAACGC CAATTGACCC ACTACGCTTG GGATAAACTC
AGCGCCATCG ATGGGTTGAC CATTTTTGGT CCACCTGCTG CCGAGCGCGG TGGCTTGTTG
AGCTTTACCC TTGCAGGTGT GCATGCCCAC GATGTGGCAG CGATTCTCGA TACCCAAGGG
ATTGCAGTGC GGGCTGGGCA TCATTGCACC CATCCATTGC ACGATATTTT TGGCGTGCCA
GCAACGGTAC GCGCATCATT CTACCTATAC ACGCTTGAGG AAGAAATTGA TCGTTTGGCC
GAAGCCTTGG TTTTGGCTCG CGATACCTTC CAACTGTGA
 
Protein sequence
MSVQTTLDIQ AIREQFPLLD QSINGHRLAY LDSTATAQKP LAVLDAMDRY YRTINANVHR 
GVYQISEAAT EAYEGTRRTI GRFIGAKSTK EIIFTRNATE AINLVAQSWG RANLQAGDRI
LLTVSEHHSN LVPWQLLAAQ TGVELDFIEL DDQGRLDLSH LDQLLTERTK LVAMTHMSNV
LGTINPVERV IAAAKQVGAL VLLDGAQSVP HIPVNVQALG CDFLAFSGHK MCGPTGIGVL
WARRELLEAM PPFMGGGDMI KRVGLRESSW NDLPWKFEAG TPAIAEAIGL GAAIDFLNEL
GMQAIHERER QLTHYAWDKL SAIDGLTIFG PPAAERGGLL SFTLAGVHAH DVAAILDTQG
IAVRAGHHCT HPLHDIFGVP ATVRASFYLY TLEEEIDRLA EALVLARDTF QL