Gene Haur_3551 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3551 
Symbol 
ID5735410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4468259 
End bp4469548 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content54% 
IMG OID641280698 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_001546315 
Protein GI159900068 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCAACG ATGCTTCTAG CGCGGCGTTT GAACGCGCCC AAGCACTTTT ACCAGGCGGA 
GTGAATAGCC CAGTGCGGGC TTTTCGGGGC GTTGGCGGCG TGCCACGCTT TATCGATCAT
GGCGCAGGAG CCTATCTCTA CGACATCGAT GGCAATCAAT ATATCGATTA TGTTTTGTCG
TGGGGGCCGT TAATTCTGGG CCACGCCTAT CCAGCAGTAG TCGAGGCAAT TTGTGCCCAA
GCTCAACGTG GCACAAGCTT TGGTGCACCA ACCGAGCTTG AAAGCGAATT GGCCGAGTTG
GTGATCGCCG CAGTACCAAG TGTCGAGATG GTGCGCTTTG TTTCGTCGGG CACTGAAGCC
GCGATGAGCG CAATTCGTTT GGCGCGGGCT TACACCCAAC GCGAGAAAAT TATCAAATTT
GAGGGTTGCT ACCACGGCCA TGCTGATCCA TTTTTGGTGC AAGCTGGCTC AGGTGTGGCA
ACCTTAGGCT TGCCCGATAG CCCAGGCGTT TTGAAAAGCG CTACCAGCAA CACCCTGACC
GCACCATTTA ACGATCTTGA AGCAGTCGAA GCGCTATTTA AAGCCAATGC TGGGCAAGTT
GCCGCCTTGG TAATCGAGCC TGTGGCAGGC AATATGGGCT TTGTACTGCC ACGCGAAGGC
TATCTTGCAG GCCTGCGCCA ACTTTGCGAT CAATACGGGG CATTATTGAT TTTCGACGAA
GTAATGACGG GCTTTCGCGT GGCCTACGGT GGAGCACAAG CCTACTTCAA CGTGATGCCC
GATTTGACCT GCTTGGGCAA AGTAGTAGGC GGCGGTTTGC CAGCGGCGGC CTATGGCGGA
CGACGCGAGA TTATGCAGAT GGTCGCTCCA GCTGGCACAA TGTATCAAGC TGGCACGCTT
TCGGGCAACC CACTGGCGAT GGTCGCTGGC ATTGTAACTT TACGCGAAAT TGCCAAGCCC
GAAGTTTTCG AGCGCTTAAC TGGTGTAACT TCGACGCTGT GTCAAGGCTT TTGGAAGGCC
GCCTTCAAAA ATGGCATTCC CTTCCAAGCG CATAAAGCTG GCAGTATGTG GGGCTTCTTC
TTTGCTGGCG ATGAGGTTTA TGATTTCACA TCGGCCAAGC GGGCTGATAC CGCCATGTTT
GGCAAATTCT TCCATGCCAT GCTGGAGCAA GGCGTGTATC TTGCGCCGTC GCAATTTGAG
GCCGCCTTTG TCTCAACGGC CCATACCGAC GAACTCGTCG CTCAAACGAT AAATGCAGCC
CAAGCGGCTT TTGCCAGCAT TCGCAGCTAA
 
Protein sequence
MINDASSAAF ERAQALLPGG VNSPVRAFRG VGGVPRFIDH GAGAYLYDID GNQYIDYVLS 
WGPLILGHAY PAVVEAICAQ AQRGTSFGAP TELESELAEL VIAAVPSVEM VRFVSSGTEA
AMSAIRLARA YTQREKIIKF EGCYHGHADP FLVQAGSGVA TLGLPDSPGV LKSATSNTLT
APFNDLEAVE ALFKANAGQV AALVIEPVAG NMGFVLPREG YLAGLRQLCD QYGALLIFDE
VMTGFRVAYG GAQAYFNVMP DLTCLGKVVG GGLPAAAYGG RREIMQMVAP AGTMYQAGTL
SGNPLAMVAG IVTLREIAKP EVFERLTGVT STLCQGFWKA AFKNGIPFQA HKAGSMWGFF
FAGDEVYDFT SAKRADTAMF GKFFHAMLEQ GVYLAPSQFE AAFVSTAHTD ELVAQTINAA
QAAFASIRS