Gene Haur_2993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2993 
Symbol 
ID5734865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3780514 
End bp3781578 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content52% 
IMG OID641280137 
Productsaccharopine dehydrogenase 
Protein accessionYP_001545759 
Protein GI159899512 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1748] Saccharopine dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGCAA TCGTTATTTT GGGTGGTTAT GGCGTGGTTG GTTCGCAAAT TGCCCAGATT 
TTGCGCCAGA GTCACCCTGA TCTGCCGTTG ATTTTGGCGG GGCGTAATCC ACAGCAAGCC
CAGCAATTGG TTGCCGAACT TGGCGGGCCA ACCAGCGCGG CGGCGATCGA TGTGCTCAAG
CCGCAACCAT TGGCGGGGTT GCAACCACAG GCGATTATCA ATGCTGTCAA TGACCCGCAT
GATTATGTGT TGCAAGAAGC AGTGGCCCGT GGGATTCCTA TGGTTGATAT CACCCGTTGG
ACATCACGTT TACAACAAAC CTTGGATAGG ATTGATCGCA ACACCCTGAA GGCTCCATTG
CTGTTTGCCT CGCATTGGAT GGCGGGCTTG GCTAGTGTGG TGGCGTTTGC AGCAACCCAG
CAATTAGCCC AAACCGAACA GATGGATTTG CATGTGCTGT TTTCGCTCAA AGATAAAGCT
GGGCCAAATT CAATTGAATA TATGGAGCAT ATTGCTACGC CGTTTACCAT CACTGAGAAA
CATCAGCCAC GTGAGGTTTA CCCCTATACC GAGCCGCAAA CTGTGACTTT TCCCAATGGT
TATCGCGCCA AAACTTATCG CTTTGATACG CCTGATCAAT GGACATTGCC GCAAAGCACC
AAGGCTGCGA GTGTTTCGGC CCGCATCACC TTTGATGATC GTTTGACCAT GGGTTTGTTG
TTGGGCTTGG CGCGTTCTGG CGTGTGGAAA TTGTTGATGC ATCGGCGCTT TGATCGTTTG
CGCCATGCCT TGCTCTACAA TCCTGGCACG GGTGCGGCGC ATGAATTGGT GTGGCAGATC
AGCGGGCGTG ATCATGCTGG CAAAGCCCAG CAATTGACCA AAACAATTGT TGATCGCCAA
GGCCAAACTC ATCTGACGGC AGTGGGTGCG GTGATTCAGC TTGAAACAAT GTTGGGGCTA
GATGGCAGCC AAGCATTTGC GCCAGGCATT TATTTTCCTG AGACAGCGCC GAAATTGGCC
TATGCTTTGG GGCGGATGCA GCAGCTTGGG GTGGAATTAA ATTAA
 
Protein sequence
MQAIVILGGY GVVGSQIAQI LRQSHPDLPL ILAGRNPQQA QQLVAELGGP TSAAAIDVLK 
PQPLAGLQPQ AIINAVNDPH DYVLQEAVAR GIPMVDITRW TSRLQQTLDR IDRNTLKAPL
LFASHWMAGL ASVVAFAATQ QLAQTEQMDL HVLFSLKDKA GPNSIEYMEH IATPFTITEK
HQPREVYPYT EPQTVTFPNG YRAKTYRFDT PDQWTLPQST KAASVSARIT FDDRLTMGLL
LGLARSGVWK LLMHRRFDRL RHALLYNPGT GAAHELVWQI SGRDHAGKAQ QLTKTIVDRQ
GQTHLTAVGA VIQLETMLGL DGSQAFAPGI YFPETAPKLA YALGRMQQLG VELN