Gene Haur_4731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4731 
Symbol 
ID5736575 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6041625 
End bp6043178 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content52% 
IMG OID641281896 
Productputative delta-1-pyrroline-5-carboxylate dehydrogenase 
Protein accessionYP_001547490 
Protein GI159901243 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR01237] delta-1-pyrroline-5-carboxylate dehydrogenase, group 2, putative 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.362278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACCCG AGTACCGCAA CGAACCGTTT GTCGATTTCA GTGTGAAAGC CAATGCCGAT 
GCGATGCGCA CGGCACTGAG CAAAGTTGGC GATGAATTAG GGCGTACTTA TCCGCTTCTG
ATTGGTGGCG AACATATCGA ATTGGCTGAT ACATTTGATT CACTGAATCC GGCCAAGCCA
AGCCAAGTTG TCGGCAGTTT TGCCAAGGCA ACGGTTGAGC ATGCCAATCA AGCGGTGGAA
GTTGCCGCCA CAACCTTCGA ATCATGGCGC AACGTCGCGG CAGAAGAACG CGCCCGCTAT
TTGTTCCGCG CTGCGGCAGT GATGCGCCGC CGCAAGTTTG AATTTATGGC GTGGTTGGTT
TATGAAGTAA GCAAAAGTTG GGCTGAGGCC GATGCTGATG TGGCCGAAGC CATCGACTTT
ATGGAATATT ATGGTCGTCA GGCGATTAAG TTTGGTGGCC CGCAACCAGT CGTCGCCTAC
AACGGCGAGG AAAATGAATT ACGCTATGTG CCGCTCGGCG TGACCGTGGT GATTCCGCCA
TGGAACTTTG CCTTGGCAAT TATGGTCGGT ATGACCACCG CTGCGATTGC TGCTGGCAAT
ACGGTGGTGT TGAAACCAGC CTCAGCTTCG CCGGCGATTG CTGCCCAATT TGTGCGCTTG
TTGGTTGAAG AAGCTGGCTT GCCCGATGGG GTAGTTAATT TCGTGCCTGG TTCAGGCGGC
GCAATGGGCG ATGCCTTGGT TGATCACGCC AAAACTCGCT TGATCGCCTT TACTGGCTCG
AAGGAAATTG GCTTGCGGAT TTTTGAACGC TCAGCCAAAT TGCAACCTGG CCAAATCTGG
CTTAAACGCA CAATCTTAGA AATGGGCGGC AAAGATGGGA TTGTGGTTGA TGAAACCGCC
GATCTCGATG CTGCTGCCGA TGCAATTGTA GCCTCAGCTT TTGGCTTCCA AGGCCAAAAA
TGCTCAGCCT GCTCGCGGGC AATTATCGTC GATAGCGTGT ATAGCACAGT CTTGAAGAAG
GTCGTTGATC GCACCAAAAA GCTGACCATG GGCGATCCAA CCGATCCCAA ACATCATATG
GGCGCGGTGG TTGACCAAAA AGCCTTCGAC AAGATTCGCG AATACATTGA AATTGGCAAG
AGCGAAGGTC GCTTGATGCT TGGCGGCGAA ACTGGCGATG GCTCGGAAGG TTATTTTATT
CCCCCAACAA TCATCGCCGA TATTGCCCCC GAGGCTCGGC TCTCGTTGGA AGAAATTTTC
GGGCCAGTCT TGGCATTTAT CAAGGCCAAC GATTGGAAGC ATGCCTTGGA AATTGCCAAT
AACACCGAAT ATGGCTTAAC TGGCGCGGTG TTTAGTCGCT CACGCGAACG ACTTGAAGAA
GCTCGGCGCG ATTTCCATGT CGGCAACTTG TATTTCAACC GCAAATGCAC AGGCGCGTTG
GTTGGGGTGC AGCCGTTTGG TGGCTTCAAC ATGAGCGGCA CCGACTCGAA AGCTGGCGGC
CCCGATTACC TCTTGTTGTT CACTCAAGCC AAAACCATTA CTGATCGCTT CTAA
 
Protein sequence
MLPEYRNEPF VDFSVKANAD AMRTALSKVG DELGRTYPLL IGGEHIELAD TFDSLNPAKP 
SQVVGSFAKA TVEHANQAVE VAATTFESWR NVAAEERARY LFRAAAVMRR RKFEFMAWLV
YEVSKSWAEA DADVAEAIDF MEYYGRQAIK FGGPQPVVAY NGEENELRYV PLGVTVVIPP
WNFALAIMVG MTTAAIAAGN TVVLKPASAS PAIAAQFVRL LVEEAGLPDG VVNFVPGSGG
AMGDALVDHA KTRLIAFTGS KEIGLRIFER SAKLQPGQIW LKRTILEMGG KDGIVVDETA
DLDAAADAIV ASAFGFQGQK CSACSRAIIV DSVYSTVLKK VVDRTKKLTM GDPTDPKHHM
GAVVDQKAFD KIREYIEIGK SEGRLMLGGE TGDGSEGYFI PPTIIADIAP EARLSLEEIF
GPVLAFIKAN DWKHALEIAN NTEYGLTGAV FSRSRERLEE ARRDFHVGNL YFNRKCTGAL
VGVQPFGGFN MSGTDSKAGG PDYLLLFTQA KTITDRF