Gene Haur_4588 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4588 
Symbol 
ID5736433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5869383 
End bp5870666 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content52% 
IMG OID641281750 
Producthypothetical protein 
Protein accessionYP_001547347 
Protein GI159901100 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0268669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACCCTG CATCTAGTTT AGAGCTGCTT GGGCTAGCGA TGTGTCTCAT TATTGTGAGT 
GTCGCCTCAG CAGCCGAAGC AGCGCTCTCT ACTATCTCAC GCCACCGAAT GAATACGCTG
TTGGAGAACG GCGAACGCCG CGCCAACGTG GCTATGCGCC TGTTGGAAGA CCCCTATCAT
CTGCGAATTG GGACGTTGTT GCTCAGTACC GTCGGGACGA TTGGCGCAAC GGCGCTGCTC
TTGACGTTTG TCGCCAATCG CACTGGTTGG CAGCAAGCCT CGTTTCTGTT GATCTTTCTG
CTGGTTTGGC TCACCATCGG CACAACCTTA CCCAAAACCC TCGCCACCGC CTATCCCGAT
AGTATGGTGC TTTGGACGGT GCGTCCATTG CGGATTTTGC TGTGGCTGGT TGCGCCAATG
CAATGGCTGT TGCGCCAAAT TGCCCGCCCA ATTGGCTTGG TAACTGGCCA GCATCCTCTC
ATCGTTACCG AGGAAGAACT GAAATTAATG GTCAACGTCG GCGAAGAAGA GGGCGTGATC
GAGGCCGAAG AACGCGATAT GATCGAGGGT ATTTTTGAAT TTAGCGATAC GGCTGTACGC
GAAGTGATGG TCCCACGGAT TGATATCGTC GGTTTGGCTG CGACTGCCTC GATGGAAGAA
GCACTGAATT TGTTTATTAG TGCTGGCCAC TCGCGGTTGC CGATCTACGA CGAGTCAATT
GACCATGTGC TCGGAGTGTT GTACGCTAAG GATCTGTTTC CGCTGTTGCG TGATGGGTTG
CGTGATGCGC CATTACGCTC CTTGGTACGC CAGTCCTATT TTGTGCCCGA TTCGATCAAA
GTTGATGATT TGATGCGGGC TTTGCAAAGC CGCAAAGTGC ACATGGCGAT TATCGTCGAT
GAATATGGCA GCACGGCGGG CTTAGTCACA ATTGAGGATT TGCTGGAAGA GATCGTTGGC
GAAATTCAGG ATGAGTTCGA TAGCGAAGAA GCGCCGATTC AACAGGTTGG CCCTCACGAA
TGGCTGTTCG ATGGTCGGGT CTCAATTGAT GCAGTCAATG ATGCCACAGA GTTAACATTA
ATCAATGACG ATGTAGATAG TCTCGGTGGC TTCGTTTTGT CGATGTTAGG CTCAATGCCC
AAAGTCGGCG ATGTTATCCA AGCTGGAGAT ACAACAATTG AGGTTGTTAC GATTCAAGGC
TTGCGACCAC AACGCCTACG CCTAAGCTTA GCCCATGCCG AACACGAGTT CGCCGAGGTT
GGGAGTAATA CCGATGATGG TTGA
 
Protein sequence
MDPASSLELL GLAMCLIIVS VASAAEAALS TISRHRMNTL LENGERRANV AMRLLEDPYH 
LRIGTLLLST VGTIGATALL LTFVANRTGW QQASFLLIFL LVWLTIGTTL PKTLATAYPD
SMVLWTVRPL RILLWLVAPM QWLLRQIARP IGLVTGQHPL IVTEEELKLM VNVGEEEGVI
EAEERDMIEG IFEFSDTAVR EVMVPRIDIV GLAATASMEE ALNLFISAGH SRLPIYDESI
DHVLGVLYAK DLFPLLRDGL RDAPLRSLVR QSYFVPDSIK VDDLMRALQS RKVHMAIIVD
EYGSTAGLVT IEDLLEEIVG EIQDEFDSEE APIQQVGPHE WLFDGRVSID AVNDATELTL
INDDVDSLGG FVLSMLGSMP KVGDVIQAGD TTIEVVTIQG LRPQRLRLSL AHAEHEFAEV
GSNTDDG