Gene Haur_4196 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4196 
Symbol 
ID5736058 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5352901 
End bp5353956 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content53% 
IMG OID641281351 
Productaspartate-semialdehyde dehydrogenase 
Protein accessionYP_001546956 
Protein GI159900709 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0136] Aspartate-semialdehyde dehydrogenase 
TIGRFAM ID[TIGR00978] aspartate-semialdehyde dehydrogenase (non-peptidoglycan organisms) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0158843 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAGA TTCCTGTTGG TGTGCTAGGC GCAACAGGGA TGGTTGGACA ACGCTTTTTG 
AGTTTGTTGA TCGATCACCC CTATTTTGAA GTAACGGTAG TTGCCGCTTC GGAGCGTTCG
GCTGGTAAGA CCTTGGCCGA AGCTGGCCGC TGGATGATCG GCGGCGAAAT GCCTAGCCAA
TTTGCCAAGA TGACCGTCCA GCCCGTTGAT CAGGTTGGCG ATGTTGGCTT AGTGTTCAGT
GCCTTGCCCA GTGATGCTGC TGGCCCGACC GAAATTGCTT GGGCCAAATC AGGCGCGTTG
GTTTTCTCAA ATGCTGGTGC ACATCGCCGC GATCCACTTG TGCCGCTGTT AGTGCCCGAA
GTCAACCCTG ACCATGTTAA TTTGTTGGAA TTACAACGCC AACAGTATGG CTGGAGCGGC
GGCATTTTGA CCAACCCCAA CTGCACAACC ACCCATGCTG TGTTGCCAAT GCGAGCTTTG
CACGATGCCT TTGGCCTCAC CAAGGTTTTG TTGGTGAGCA TGCAAGCAAT TTCGGGCGCA
GGCTATCCAG GTGTGCCAAG CCTCGACATT ATTGACAATG TTGTGCCGTT GATCAAAGGC
GAAGAAGAAA AAGTTGAGTG GGAGCCACGC AAGCTTTTGG GTACGCTTGG CGCTCAGGGC
GTGGAAGAAG CCCAAATCAC AATTAGCGCT CATTGTAATC GGGTGGCGGT TATTGATGGC
CATACCGAGT GTTTATCATT GGCGTTCGAG CGTCCGCCCG CCGATGAAGC TGAATTGATC
GCGGTCTTGC GTGAATTTAA GGCTGAACCA CAAGCGCTCA ATTTGCCAAG TGCGCCAGCC
CACCCAATTT TGGTAACTGA ATTGGCCGAT GGCCCGCAGC CACGCCGTGA TCGCGATGCT
GAGCGTGGTA TGGCGACGAC CGTTGGGCGC GTTCGCCGTT GCCCAATTCT TGATTACAAA
TTAGTTTTGT TGGGGCACAA TACCTTGCGC GGAGCCGCTG GGGGTTCGTT GCTCAACGCT
GAATTAATGG TTGCCAAAGG CTTGATCAAA GCCTAA
 
Protein sequence
MKKIPVGVLG ATGMVGQRFL SLLIDHPYFE VTVVAASERS AGKTLAEAGR WMIGGEMPSQ 
FAKMTVQPVD QVGDVGLVFS ALPSDAAGPT EIAWAKSGAL VFSNAGAHRR DPLVPLLVPE
VNPDHVNLLE LQRQQYGWSG GILTNPNCTT THAVLPMRAL HDAFGLTKVL LVSMQAISGA
GYPGVPSLDI IDNVVPLIKG EEEKVEWEPR KLLGTLGAQG VEEAQITISA HCNRVAVIDG
HTECLSLAFE RPPADEAELI AVLREFKAEP QALNLPSAPA HPILVTELAD GPQPRRDRDA
ERGMATTVGR VRRCPILDYK LVLLGHNTLR GAAGGSLLNA ELMVAKGLIK A