Gene Haur_1036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1036 
Symbol 
ID5732940 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1182413 
End bp1183447 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content53% 
IMG OID641278171 
Productthreonine aldolase 
Protein accessionYP_001543812 
Protein GI159897565 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2008] Threonine aldolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATT TGCGCAGCGA TACCGTTACG AAGCCAAGTT TGGCCATGCG TGAGGCCATG 
CATCATGCCG AGGTTGGTGA CGATGTGTTT GGTGATGATC CAACGGTCAA TCAATTACAG
CGCTATGCCG CCGATCTGGT TGGCAAAGAG GCGGCGATTT TTGTGCCAAG CGGCACGATG
GGCAATTTGG CGGCAATTTT GGCCCATGCT GGGCGTGGTC AAGAACTGTT GCTCGGTGAT
GAATCGCATA TTTATCATTA TGAGGCGGGT GGCGCTTCAG CCTTGGGTGG CTTGGTGTTT
CATCCCATCC CAACCAATGC CCAAGGCGAG CTGGATTTAG CGGCTTTGAA TGCGGCAGTA
CGCCCAGCCT ACGATGCCCA TGCGGCTCAG GCAGGCCTGG TGTGTCTAGA AAATAGTCAC
AATCGCTGTG GTGGCACAGT GCTTTCGTTA GAATATTTGG CGCAAGTACA GCAATGGGCC
AGCAGCCAAA ACTTGCCAGT GCATATGGAT GGCGCTCGGG TGTTCAATGC AGCGGTGGCG
CTCGGCGTGC CAGCTAGCAC CATCACCAAG CATGTCGATA GCGTGCAATT TTGCTTATCC
AAAGGCTTGG GTGCACCAAT TGGCTCGATT GTGGCTGGCT CAGGCGAGTT TATCAAAAAG
GTGCATCGCT GGCGCAAAAT GCTTGGCGGC GGCATGCGCC AAGTTGGCGT AGTCGCAGCG
GCGGGCATGA TTGCGCTCAA CGAAGGCCGC GAACGCTTGA TCGATGACCA TGTAAATGCC
AAAATGCTGG CCGAAGCACT CAGTCAATTG CCTCAAATTG AGCTTGATTT GGCTTCGGTG
CAAACCAATA TTATCGTTTT TGGCTTGCGT GATAGCACGT TTAGCCCTGA ACAATTGGTT
GAACGTTTAC GCCAAGCAGG GGTTTTGATC GTGCCATTCA AAGGCCGTTT ACGGGCTGTA
ACCCATGTTG ATGTTAATCG TGAGCAATGC ACCGAGGCCT CCAGCATCAT CGCCGAGGTG
CTGCAAAGCG CTTAA
 
Protein sequence
MIDLRSDTVT KPSLAMREAM HHAEVGDDVF GDDPTVNQLQ RYAADLVGKE AAIFVPSGTM 
GNLAAILAHA GRGQELLLGD ESHIYHYEAG GASALGGLVF HPIPTNAQGE LDLAALNAAV
RPAYDAHAAQ AGLVCLENSH NRCGGTVLSL EYLAQVQQWA SSQNLPVHMD GARVFNAAVA
LGVPASTITK HVDSVQFCLS KGLGAPIGSI VAGSGEFIKK VHRWRKMLGG GMRQVGVVAA
AGMIALNEGR ERLIDDHVNA KMLAEALSQL PQIELDLASV QTNIIVFGLR DSTFSPEQLV
ERLRQAGVLI VPFKGRLRAV THVDVNREQC TEASSIIAEV LQSA