Gene Haur_4453 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4453 
Symbol 
ID5736304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5695880 
End bp5696860 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content51% 
IMG OID641281616 
ProductHEAT repeat-containing PBS lyase 
Protein accessionYP_001547213 
Protein GI159900966 
COG category[C] Energy production and conversion 
COG ID[COG1413] FOG: HEAT repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0155685 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTAGAC CAGCTATTTT GCCACTGAGA GATGTACAAA TGCAGCCAAT TGAACAAACC 
GAAACCTACA GCTCCGCCGA AATGCAGGCG CTGGCACAGC ACCCTGAGGT TGAGCAACGC
CAAACCATCG CTTACGATCT GCGTTTCTCG ACCAACCCCT TAACCCTGGC AATCTTGATT
GACTTATTAA ATGATCCAGA TCAAGCAGTG CAAATTAACG CAATTCGTTC GATTGGTTTG
TGGGGTGCAC GGCGCAATTC GCCCAGCCTG ATGCAGCCAG CAACTCAGGC TCTATTAGCG
CTAGTTCAAC AATCACACGA GCAGATGCTA CTTGATCACG GGTTAATTAG CCTTGGTGAA
ATTGGCGATC AGTTATCAAT TGACTGGTGT TTAAGCCAGC TGATCAATCA ACCTCGTTCA
CGCTTATGCG CGGCAACAGC CTTGGGTATG CTCAAAGCCG AAAAAGCTCG CCCGTGGCTC
TTGACCATTC TAGCCGATCC GTGCCAAGCA TCGATTGTCC GCACAACCTG TATTGAGGCA
TTGAGTCAGC TCGCATTCGA CCTTCCCACC AACCAAACCT TGATCGCCGC GCTGCAAGAT
TCCGTAGCCG AAGTGCGCGA AAAGGCTGGC TTAGCGCTCT GTAAGTTGGG CGATTTTAGT
GCCTTCAAGC CAATCTGGGC CTACATTCGC CTTGAAACCG CAATCAAGCC CAGCCAAGTT
GCCCATGCAT TAGCCTTATT TGGCGACCAA GCATTTGAGC CAACCTTGGC TTTTTTAAAT
GATCCTGATC CCAATCTGCG CTATTGGGCC GCCTTAGCGC TCGGCATGTT CCACGATTCA
CGGGCGATTC CGGCCTTGAT TGCATTATTG AATGATCAAG CACAAACGCA CACCCGTGCC
GTGGTTGCAA CCGCAGCTCG CAAATCGCTT AACCGCCTCC AAAATTTGGC GGTTGGCAAC
CCTGACAGCA CTTTAACGTA G
 
Protein sequence
MLRPAILPLR DVQMQPIEQT ETYSSAEMQA LAQHPEVEQR QTIAYDLRFS TNPLTLAILI 
DLLNDPDQAV QINAIRSIGL WGARRNSPSL MQPATQALLA LVQQSHEQML LDHGLISLGE
IGDQLSIDWC LSQLINQPRS RLCAATALGM LKAEKARPWL LTILADPCQA SIVRTTCIEA
LSQLAFDLPT NQTLIAALQD SVAEVREKAG LALCKLGDFS AFKPIWAYIR LETAIKPSQV
AHALALFGDQ AFEPTLAFLN DPDPNLRYWA ALALGMFHDS RAIPALIALL NDQAQTHTRA
VVATAARKSL NRLQNLAVGN PDSTLT