Gene Haur_1455 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1455 
Symbol 
ID5736866 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1694190 
End bp1695449 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content51% 
IMG OID641278593 
Productcysteine desulfurase family protein 
Protein accessionYP_001544227 
Protein GI159897980 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0520] Selenocysteine lyase 
TIGRFAM ID[TIGR01976] cysteine desulfurase family protein, VC1184 subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000306553 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGTCG ATTTAGCGCC ATTTCGTAGT CACTTTCCAG CGTTAACCCA GACTCACGCG 
GGGAAACCCT TAGTTTTTTT TGATAATCCT GGTGGAACCC AAGTGCCCCA ACAAGTGATT
GGCCAGATGA CCGATTATTT GCGGCGTTCG GTTGCCAATA CCCATGGTGC TTTTATCACC
AGCCAGCGCA CCGATGCCGT CATTGACGAA TGTCATGCAG GGTTGGCGGC CTTGCTTGGC
GGTGAGCCAG ATGAAATTGT GCTGGGAGCC AACATGACTT CGCTCACATT TGCACTCAGT
CGTTCACTAG CTCGCGAATG GCAAGCTGGC GATGAGATTA TTCTGACCAC GCTTGACCAC
GATGCCAACG TTACGCCATG GCTGCTGGCT GCTGAAGAAC GTGGGGTTAT CGTACACTTT
GTGGATATTA ATCCTGTTGA TTGCACCCTG GTGATGAGTG ACTTTGAGCG CTATCTTTCG
CCGCGCACTA AATTGGTGGC CGTTGGTTGG GCCTCAAATG CCTTTGGCAC AATTAATGAT
GTTCAAACGA TTGTCAAACA AGCCCATGCT GTTGGTGCTT TGTGTTTTGT TGATGCGGTT
CAGAGCGTGC CCCACATTCC ATGCGATGTC AAAGCGCTTG ATGCCGATTT TGTGGCATGT
TCGGCCTATA AATTTTTTGG GCCGCATGTT GGGGTGCTCT GGGCCAAACG CGAACATCTA
GAGCGCCTGT TTGCTTATAA AGTGCGACCT GCCCCCGAAA CTTTGCCTAG TCGCTGGGAA
ACTGGCACGC AAAATTTTGA AGGCCAAGCG GGCATCAACG GAGCCTTGGA ATATCTCGGT
GGCTTAGGTG TGGGTTATAT GGAGCGCTAC GATCAGCTGC TTGGCGAAAC GGTTGGTCAA
CGGGCCGTCT TGTTGTCCGC AATGCATGCG ATTGCTGAAG CCGAGCAGAG CCTTGGCCAA
TATTTGATTC AAGCGTTGCA AACGCTTAAA GGTGTGCAAT TGTATGGCAT TTTGGAGCCT
GAACGTGGCC ATTTGCGCGT ACCCACCGTG GCATTTCGCA AGGCTGGAGT TACGCCCCAA
GCAATTGCCA AAACCTTTGG TAATGAGGGA ATTTGTGTTT GGGATGGCCA TTATTATGCC
TTGCGAGCCG TCGAACGCTT AGGCTTGCTT GATCAAGGGG GGATGGTGCG GGTTGGTTTA
GCCCATTACA ACACGCGCAC TGAGATTGAT CGTATGCTGA ATGTGCTTGA ATCAATTTAG
 
Protein sequence
MTVDLAPFRS HFPALTQTHA GKPLVFFDNP GGTQVPQQVI GQMTDYLRRS VANTHGAFIT 
SQRTDAVIDE CHAGLAALLG GEPDEIVLGA NMTSLTFALS RSLAREWQAG DEIILTTLDH
DANVTPWLLA AEERGVIVHF VDINPVDCTL VMSDFERYLS PRTKLVAVGW ASNAFGTIND
VQTIVKQAHA VGALCFVDAV QSVPHIPCDV KALDADFVAC SAYKFFGPHV GVLWAKREHL
ERLFAYKVRP APETLPSRWE TGTQNFEGQA GINGALEYLG GLGVGYMERY DQLLGETVGQ
RAVLLSAMHA IAEAEQSLGQ YLIQALQTLK GVQLYGILEP ERGHLRVPTV AFRKAGVTPQ
AIAKTFGNEG ICVWDGHYYA LRAVERLGLL DQGGMVRVGL AHYNTRTEID RMLNVLESI