Gene Haur_3241 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3241 
Symbol 
ID5735109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4101921 
End bp4104290 
Gene Length2370 bp 
Protein Length789 aa 
Translation table11 
GC content53% 
IMG OID641280387 
Producthypothetical protein 
Protein accessionYP_001546006 
Protein GI159899759 
COG category[S] Function unknown 
COG ID[COG1432] Uncharacterized conserved protein 
TIGRFAM ID[TIGR00288] conserved hypothetical protein TIGR00288 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000142022 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTGACC GTACTCGATC ACTCGCCAGA ATTAAATGGC ATGCCACAAC CAAAATGACT 
GCCCAGGAGC ACATTGGAAT CCCTATGAAC AAACCCAAGC AGGATGTTGC GGTTTTTATC
GACTTCGAGA ACATTTATGT TTCAGTTCGA GAGAAGTTCG ACGCTACCCC TAATTTCGAA
GCTTTGATGG AACGTTGCGA GGACTATGGC CGTGTTGTAG TTGCTCGCGC TTACGCCGAT
TGGTATCGTT ACCCACGCAT TACCAGCGCT TTGTTTGCCA ATAATATCGA GCCAATGTAT
GTGCCAACCT ACTATTATGA TAAGGACGAA GGTCGCATGG GCCGTCCGAT CAAAAATAGT
GTGGATATGC ACATGTGTAT CGATGCAATG CGCACGCTCT ACACACGCAC CAACATTGGT
TCGTATATCT TTATCACTGG CGACCGCGAT TTTATCGCCT TGGTCAACTG TGTGCGCCAA
GAAGGTAAAG ATGTGATTAT CGTTGGCATT GGCGGGGCTG CCTCCAGCCA TCTCGCTCAA
AGTGCCGATG AATTTTTGTT CTACGAGCAA ATTGTCGATA TTCGCCCGAT GGGTGGCCGT
CGCAACGAGC GCCACGAAAA AACCTTCGAG CGGGTCGAGC GCCCTGAACG GGTTGAACGT
AACGATCGTA GCGAACGCCC AGAACGCCCT GAACGCAGCG AACGCCCAGA ACGGTCAGAA
CGTAGCGAAC GCAATGGCCG CGATGATCGC CGCCGCGAAC GCGAACGCCC TGAGAAGAAT
GCCGAGCCAA CTTTCGAGCG GCTTGATTCA CGTTCGGAGA AAACAGCCGA ACCACGGGTT
GAAAAGCCCG CTGAAAAAGC TCCCGAAATT GTGGTACGGC CAGCACCAAG CACCGCGCCA
GTCGTGGCAA CCACCCCCAG CAATCCCGAT GCGGCAATTT ATGATAAGTT GGTTGAGGCC
TTGCACTTAG CGCGGAAACG TGGCTATGTG ACTTCATTTG GCTCGCTCAA GGTGCTGATG
AAAGAACTGC TGCCCAACTT CAAAGAAAGT CGCTATCGCG ATGCTCAAGG CAAGCCGTTC
ACCAAATTTA CCGATTTTGT GCGCGAAGCC GAACGACTTG GCAAGATTCA AATCTTCACC
AGCGGCACAG TTAACGAAGT CTTCTTGCCC GATGAAGATC CTTACAAGCT TTCGCAATTT
GCTGAAGATT TGCCCTCAGT TGAGGCCGAA CGCGAACTCG AACAAGAGCT TGAGTTGCAA
GCCGAATCGA TCGCCCCAAT CACGATTGAA ACAATTGGCA TCGCCGATGA GCCGATGGAA
TTGGCCGATG GCGAAGAAGT TGCCCCAAGC AGCGAACGTG GCGACCGCAA TGGCCGTCGT
CGCCGCCGCC GTGGCAGCAG CCGCAGCGAA CAACCAGCCG TCGATGTGCT GCCCGAAGAG
TTGAGCGAAA CAATCCTTGT ACCAGCACCA ACCGAAGGTT TCAGCTTTGG CGAACGTGAA
TGGCAGTTGT TCTATCAGGT GATGGGCGTG TATAGCGAGC CAGTGCCATT CGCCGATATC
TTTAACGATC TTCGCGAAAT GCGCAACACG GCTGAGCTTG ATCTGACCAA TAATGCCTTG
AAGGAATTGA TCAAACAAGC GATCAACGAA GGCAAAGTCC AGCGCTCAAA TCGTGGAGCC
AAAGCCCACT ATCGCTTGGT GCTCAACCCT GAGCAGTTTG ATGGTGCAAT AACAACCTTC
GAGCCAAGCG ATCTTGAAGC CTTTGATGAT GGCCCAAGCT TGCCTGACCT GGTTGATGAA
GAACCACGTT TGTTGCCAGC CATGATCGGT GGCCAAACCA TCGCGATTGA CGATGATTAC
ACTGCGCCAA TCGATGAAGT GGCCTTGCCC GAAAGCTTTG AAGAACAGCC GTTGCCTGAA
AGTGATGCCG ATTATTCGTT TGCCGAGCCA ATCGAGTATG AAGCACCTGT GGCAACCTCA
ACCGCAGCGA TCAAGCCGCG TAAGCCCAGC CGCCGCCGCA AATCGATTGT GCCGCTGAGC
GTGGCCGAAG CGATTGCTGC CAGCGTCGCC AATGATCCAG CGCCTGAGCC AGCGGTTGAA
GTGGAAGCTG TAGCTGAGCC TGTCGCCGAA ATCGTCAGCG AACCAATCGC AGAACCTGTG
GTTGAAGCAG TGACTGAGGA AAAGCCCAAA AAGCGGGGTG GCCGCCGGAA AGCTGCCGAG
CCAGTTGAAG TCGTCGAAGA AAAACCTAAG AAAACCACTC GCCGCAAAAA GGCCGAGCCA
GTCGCTGAAG CTCCAGCTCC AGTCGAGCCA GCCAAAAAAC AACCACGTCG CCGTAAAAAG
GCCGCCAGCG AAACTGGAGA AGAAGCATGA
 
Protein sequence
MGDRTRSLAR IKWHATTKMT AQEHIGIPMN KPKQDVAVFI DFENIYVSVR EKFDATPNFE 
ALMERCEDYG RVVVARAYAD WYRYPRITSA LFANNIEPMY VPTYYYDKDE GRMGRPIKNS
VDMHMCIDAM RTLYTRTNIG SYIFITGDRD FIALVNCVRQ EGKDVIIVGI GGAASSHLAQ
SADEFLFYEQ IVDIRPMGGR RNERHEKTFE RVERPERVER NDRSERPERP ERSERPERSE
RSERNGRDDR RRERERPEKN AEPTFERLDS RSEKTAEPRV EKPAEKAPEI VVRPAPSTAP
VVATTPSNPD AAIYDKLVEA LHLARKRGYV TSFGSLKVLM KELLPNFKES RYRDAQGKPF
TKFTDFVREA ERLGKIQIFT SGTVNEVFLP DEDPYKLSQF AEDLPSVEAE RELEQELELQ
AESIAPITIE TIGIADEPME LADGEEVAPS SERGDRNGRR RRRRGSSRSE QPAVDVLPEE
LSETILVPAP TEGFSFGERE WQLFYQVMGV YSEPVPFADI FNDLREMRNT AELDLTNNAL
KELIKQAINE GKVQRSNRGA KAHYRLVLNP EQFDGAITTF EPSDLEAFDD GPSLPDLVDE
EPRLLPAMIG GQTIAIDDDY TAPIDEVALP ESFEEQPLPE SDADYSFAEP IEYEAPVATS
TAAIKPRKPS RRRKSIVPLS VAEAIAASVA NDPAPEPAVE VEAVAEPVAE IVSEPIAEPV
VEAVTEEKPK KRGGRRKAAE PVEVVEEKPK KTTRRKKAEP VAEAPAPVEP AKKQPRRRKK
AASETGEEA