Gene Haur_0919 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0919 
Symbol 
ID5732688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1050704 
End bp1052536 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content54% 
IMG OID641278051 
Producthypothetical protein 
Protein accessionYP_001543695 
Protein GI159897448 
COG category 
COG ID 
TIGRFAM ID[TIGR03605] SagB-type dehydrogenase domain 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCCAGG CAATTGATTT ACACAGCCAA CTCCAGAGCG ATCCGCAGGT GCAACTTCCA 
ACTCGCCCAC GGATCTCCGA TGAAATTGCC CGAGTTTGGC ATAACGAACA GCTGTTGTTG
TTGCTGGGCG GGCCGCAAGA TGTGGTGTTG CGTGGCAAAG CGGTGCGCAA TCTGCTGCCG
CAACTCTTGC CCTTGCTGAA TGGCCTCAAC ACCCGCGAAA CGATTGGCGA ACGCTTGGCC
CAATTCAAGC CCACAACGAT TGACGATAGC TTGCAGTTGT TGCATGTGCA GGGGGTGCTT
GAAGAAGGCC CATTGCCGAG CAATACACTT GATCCGCAAG TTGCCCGTGC GTTCAGTCAA
CAACTAGCTT TTTACAGCCG CTTTGTTGAC CAAACGCGGG TCTGTCGCAA TCGTTATGAA
GTGCTGCAAC GCTTGCAAAC AACGCCTTTG TTGTTGATTG GTGGCGAACG CTTGGTTGCT
CCATTGCTGC ATCAATTGGC GCAAGCTGGC TTAGGCCGCG CCACTTGGCT GGCCAATTCA
GCTCCAACCA CAACCTATGC CCTGCCACAT TTGGCGCTTG ATCTTCAAGT TCCGCAGGCT
GATCAATTAC CTGCAGCAGT TGATCATTGG TTAGCTGAGC ATCACGATGG CTTGATCTGT
CTGCTGACCA GCACGGCGCA GGCCGAATTA ACCCAAGCGC TCAATCAACG GGCAATTCAA
GCGCAAACTC GCTTTACCCG ACTTTGGCTG CACCCACAGC ATATTGAACT TGGCCCAACC
ACCTTTGCTG GTGAAGCTGG CTGTTATGCC TGTGCCGAGC AAGTTGATAG CCACGAACCT
CAATTAAGCG AGCCAACCGC GCCGCTGAGC CTCAACGAGC AGTTAGCCTT GAGCCAAGCA
AGTTTAGTGG TTGGCAACCT AATTTCGGGT TTGAGTCCGG TGATTACGGG CGGCGTGCGT
TATGTGCTTG ATCCAACGAC GCTTGAATTT GTGGCGCAAT CGGTGCATCG TTTGGTGTTG
TGCCCAGTTT GCGGCCAGCC CAACCTCGAT GCGCCCCGCG ATCTGTTGGT GGGCGCGGGC
CATTTCGAGA ATTTGCCGCT GTGGTATCAC GCCAACACCG ATGAGCGCAG TTACGCGATT
TTCCCCAAGG CGCATCAGCA GCATTATGCG CCAAAAAATG CGATTGCAAT CGTCAGTGGA
GCCAAGGGTT ATACCAATAG CCAACGCTAT GCCCTGGGCG AATTGCAAGG CCAATGGCAG
CCTGATTCAA CTGTTGATCT GAGCGTTCAA CTGCCAGTTC AAACCTCGAT TGCGCCCTTG
GCTTGGCTGT TTGAGCAAGC ATTTGTGCGC AAGCCAGCCA CCCAGGTGAT CGCCGCTGGC
CAACGTTTTG TACCTTCGGG CGGCAATCTA GCCTCGCAAA CGGTCTATTT GCTCAATCAC
AAGCTGAGCA ATTTGGCGGC TGGCATCTAT CACTTGAACC AACATGACGC TTCGTTGGAA
GCAATGCGCC CACAGTTTAG CTGGGATGAC GTTGCCAAAG CCTTCCCCAG TGATCCGCTG
AATCAACAAA CCTTGGGTTT GATTGTGCTG ACGGCGGCGA TTGGGCGGGT TGAAGGCAAA
TATGGCGCAA AATCGTATCG GATCGCGCTC TACGATTGTG GCGTTGCCGC GCAGGCAATT
GAATTTTTGG CAGCTGCGGC AGGCTGGCAA ATCGAGCAAA TCAGCGCCTT CTACGATCAA
GAGCTGCGCG ATCTGCTCCA GGTCTATAGC CCAAGCGAAA CCCCGTTGCT GGTGCTGCGG
CTGGTTGCGC CCAATCCTGA GGTGCTGCAA TGA
 
Protein sequence
MTQAIDLHSQ LQSDPQVQLP TRPRISDEIA RVWHNEQLLL LLGGPQDVVL RGKAVRNLLP 
QLLPLLNGLN TRETIGERLA QFKPTTIDDS LQLLHVQGVL EEGPLPSNTL DPQVARAFSQ
QLAFYSRFVD QTRVCRNRYE VLQRLQTTPL LLIGGERLVA PLLHQLAQAG LGRATWLANS
APTTTYALPH LALDLQVPQA DQLPAAVDHW LAEHHDGLIC LLTSTAQAEL TQALNQRAIQ
AQTRFTRLWL HPQHIELGPT TFAGEAGCYA CAEQVDSHEP QLSEPTAPLS LNEQLALSQA
SLVVGNLISG LSPVITGGVR YVLDPTTLEF VAQSVHRLVL CPVCGQPNLD APRDLLVGAG
HFENLPLWYH ANTDERSYAI FPKAHQQHYA PKNAIAIVSG AKGYTNSQRY ALGELQGQWQ
PDSTVDLSVQ LPVQTSIAPL AWLFEQAFVR KPATQVIAAG QRFVPSGGNL ASQTVYLLNH
KLSNLAAGIY HLNQHDASLE AMRPQFSWDD VAKAFPSDPL NQQTLGLIVL TAAIGRVEGK
YGAKSYRIAL YDCGVAAQAI EFLAAAAGWQ IEQISAFYDQ ELRDLLQVYS PSETPLLVLR
LVAPNPEVLQ