Gene Haur_4994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4994 
Symbol 
ID5736830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp6332762 
End bp6335176 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content52% 
IMG OID641282161 
Producthypothetical protein 
Protein accessionYP_001547752 
Protein GI159901505 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.158573 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACGAG TTTTTTGTTG GTTAGGATGT TTATTAATTG GATTAAGCTT CGTGCCGAGC 
CATACCAGCA CGGTTGCGGC CACGTATCAA GTGCAAGCCG AGTTTCAACA ATTTTGGCAG
GCGAACGGCG GCGCAGCAAT CTTTGGTCAA CCAATAAGCG AAGCGCTGTG GTTCGATAGT
CAGTTGGTAC AATATTTCGA AAACCAACGC TTGCACTTGG TCAACAATCA AGTTGTGCCC
GCCAAGCTTG GGCTAGAGCT ATTTCAGGCC AGCCGCCGAA CCTGGCAAAA CCAACGCCAA
CATCTGCCAA CTAGCCAGTG TTTGCGGGTT GAAACAACCG AACGCTCAAT CTGCGAGCCG
TTTCGTCGCT ACTGGCAAGC CAATGGCGAG GCCAGTCGTT TGGGCAACCC CATCACTGAA
ACGGTCAGCG AGACTAATCC ATTAACTGGT CAGCAACAAA TTATGCAATA TTTTGAGCAG
CAATTGCTGA TCAGCACCAA CCAACAAATT ATGCTCAGCC CACTTGGACG GTGGCAGGCC
GATTGGCTGC TGAATGGCAC ACGCCAAGCC AGCCCAATTC GTGCCAATTT AACTGGCCCA
AGCCAACCAC TCAAGCCACT CGACCAATTT GAAATTCAAA TTGATGCTGG CAACTATAAC
GGCGCAGCCA ACCTACGCAT CTTTGATAGT GCTGGCCAAC TTGAAACCAG CCAAACCCTA
AACCTCACAG GCCAAAGCCA AACCCTCGCA TTCCAAGCCC AAGGCGCGTT AGGCCAGCAT
TACGCCGTCT TATTGATCGA TGGCAAGGTG GCGGCGATCA ACAGCAGCAT CTATCAACTT
GAGGCGACCA CCAGCCTTCA AACTGGAGTT GCGGCCTACG ATACGCTGCC CAACAAAGTG
CGTAGCTTTT TGCGCAACGA CCTATCGATC TATCAATATC AAGGCTACAC GATTCGCGGC
TATCGTTCGC CCGATAGCTA CCTGATTTGG CTGCGTGATC ATGTTCATCA AGGCTTGGGC
TATCGCTATT TTGAGCAAGA TATGACCAGC ACGCTAGATT ATTTTCGGCG TGAGCAAAAG
CCCAATGGAG CCTTCGACGA CTATTTTGCA ATGCTTGGCG GCGCACCAGT CCAAGGCCGT
ACCGCTGTCG AGGCTGATTT AGAATATTTA TTTGTGCAGG GCGTGCATCA AGCCTGGCAA
GCCACTGGCG ATACTAGTTG GATGCTTGAG CAAAAACCTG CCATGCTACG CGGCATCAAC
TATAGCCTGA GCGATCCACA GCGCTGGAAT CCCACCCTGC GTCTAATTCG CCGCCCCTAT
ACGATTGACA CATGGGATTT TGAATATGGT GGGCCGACGA TTGCCCCTGA TGGCAAAACC
TCGCCACGCC ATTGGATCGA CGAAAAAACC CGCTTTGGAA TTATGCACGG CGATAATACG
GGCATGGCTC ATGGCTTGTA TTTGCTGAGC AAACTTGAAC TAACTCAAGC AAATTTTGAG
CAAGCTACTC AATGGCTGGT TCGTTCGCAA CAACTAACCA AGCAGCTGAA TCAGGTTGCG
TGGAACGGCA AATTCTACAC CCACAATGTT TTAGAGCAGC CTTTCGATAT TCCTGAACTC
GATGAGGCTC GCCAACTGTC GCTCTCCAAC TCGTATGCGC TCAATCGCTA TGGCATGGAA
GCCAGCAAGG CCTTGGCAAT CATCGACGAA TATTATCAGC GGCGGGTGGC TGATTCTAGC
AGTCTTTCCT CAGAATGGTT CAGCATCGAC CCGCCATTTC CTGCCGAGAG TTTTGGCACA
TTGCCAGGCT GGGGCAACGT GCCTGGCGAA TATGTTAATG GTGGGCGGAT GCCGCTAGTT
GGCGGCGAAT TAGCTCGTGG CGCATTTCGT TGGGGCCAGC CAGCCTATGG CTTTGATATT
CTGCGCCGCT ATGCCCAAAT GATTGAAGCC CAAGGCGGCA GCTATTTGTG GTATTACCCG
GTCGGCAACC CTGGTATCTC TGGCCCCGAC ACCCTCGCCA CCGACGGCTG GGGCAGCACG
GCAATGCTGG CAGCGCTAAT CGAAGGCGCA GCCGGGGTGA CCGATCAAAG TGCGTTGTAT
CAGCATGCAG TGCTCAGCCC ACGTTGGATT GTTGAGCCAG ATGTGCAACA AGCCCAGGTC
ACCACCCGCT ATGCTGCTTC GCAAGGCTAT ATGAGCTATC GCTGGCAACG CCAAGCTCGT
GGTTTTCAGC TTGATTTTAC CGGCAGCGCT GAACAAGTCA CATTGCAATT ATTGCTACCC
AACGATGCTC CACAACACGT TAATTTAACG ATCAATGGGT TACCATCATT CGGCCATGAA
CGCACAATCG GCCAAAGTCG CTACCTTGAA CTGCGGCTAA ACAAGGCCAC TGGTTCGATT
ATGGTCAATT GGTAG
 
Protein sequence
MRRVFCWLGC LLIGLSFVPS HTSTVAATYQ VQAEFQQFWQ ANGGAAIFGQ PISEALWFDS 
QLVQYFENQR LHLVNNQVVP AKLGLELFQA SRRTWQNQRQ HLPTSQCLRV ETTERSICEP
FRRYWQANGE ASRLGNPITE TVSETNPLTG QQQIMQYFEQ QLLISTNQQI MLSPLGRWQA
DWLLNGTRQA SPIRANLTGP SQPLKPLDQF EIQIDAGNYN GAANLRIFDS AGQLETSQTL
NLTGQSQTLA FQAQGALGQH YAVLLIDGKV AAINSSIYQL EATTSLQTGV AAYDTLPNKV
RSFLRNDLSI YQYQGYTIRG YRSPDSYLIW LRDHVHQGLG YRYFEQDMTS TLDYFRREQK
PNGAFDDYFA MLGGAPVQGR TAVEADLEYL FVQGVHQAWQ ATGDTSWMLE QKPAMLRGIN
YSLSDPQRWN PTLRLIRRPY TIDTWDFEYG GPTIAPDGKT SPRHWIDEKT RFGIMHGDNT
GMAHGLYLLS KLELTQANFE QATQWLVRSQ QLTKQLNQVA WNGKFYTHNV LEQPFDIPEL
DEARQLSLSN SYALNRYGME ASKALAIIDE YYQRRVADSS SLSSEWFSID PPFPAESFGT
LPGWGNVPGE YVNGGRMPLV GGELARGAFR WGQPAYGFDI LRRYAQMIEA QGGSYLWYYP
VGNPGISGPD TLATDGWGST AMLAALIEGA AGVTDQSALY QHAVLSPRWI VEPDVQQAQV
TTRYAASQGY MSYRWQRQAR GFQLDFTGSA EQVTLQLLLP NDAPQHVNLT INGLPSFGHE
RTIGQSRYLE LRLNKATGSI MVNW