Gene Haur_2909 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2909 
Symbol 
ID5734780 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3679769 
End bp3681739 
Gene Length1971 bp 
Protein Length656 aa 
Translation table11 
GC content50% 
IMG OID641280052 
Producthypothetical protein 
Protein accessionYP_001545675 
Protein GI159899428 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0214301 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGATTG GATTGATTGT GCAACGTAGA ATTATTCAAA TAAGCAGTTT TATAGCTATT 
TTAGCTCTTT GTTTAGCCAT AACCTTAATT TGGGTACTGC AACGTAGCCC TGCGCAGGTC
ACAATTGGTG GCAAATACGA TTCACCATGG CTGGTGGAGG GCTTTCAAAC CAAAGAACGC
TCAGAATTGG GAGCCTATCG TTGGACGAAT GGCCATGGAA TTATTGGCTC ACCAGCAACT
CATCGTAGTT ATATGCTAGG CTTGAGTTTA GTTTCTCCCG TAACAACCAC GGTTGTTCAG
CTCAATAATG CTGGACATCG GGTGCTTGAG CTACCGATTA GTAATGCGCC GCGCTATTAT
CAAATTTTTT GGCGGCCCAA TATCTCGCTG AATTGGCTGC GTTGGGCTAG CGACCAGCAA
CTTACGCTCA ATAGCGAGCT GCAAGTGCTT GAAGGCAGCG ATCAACGCCA ACTAGGCGTA
GTCTTGCAAA ACCTCAGTTG GTCGCCTACT GGCAGCATTT CGTTATTACC ATTTGCCTGG
ATTACCGCGC TGGTGGTGAG TTTAGCGGCA TTAATCAGGC CACAGCAACG GCGTGATTGG
CTTTGGTTTG CACTATCGGC TATTGGGTTA AGCCTAATGC TCGGCGGCCT GAGCTGGCTT
GCCAACGATC AAAGCGTTTG GTCGCCATTA CGCTTTGCCC CTAGTTTGTT GATTTTGCCC
TTGGCGGGGT TTGCACTATT GCGCTGGCCA TGGCAAGGCT GGTGGCAAGC CTTGCCAGTA
CTCAGTTTAA TTGGCATTGC GGCAATTGTG ATGCTGCTCT CGCGCCAGTG GTGGGCGGTT
GAGGGGCCAG ATTTTGGCTG GCACGCCAAC CATGGTAGTT CTGCCGAATC GGTGTTTCGG
GCGCATCCCT TCTATCCATT GGGATTTCCG CTAATCTTAT GGCTTGGTTT GTTGTGGAAC
GGCGATCAAC TAGCGATTGG GCAAACCGCT GGATTTATCA GCATGCTCTT GAGCTTGCTG
CTCACGGGCT TATTGGCCTA TCGGATCTTG GCCTTGCGTG GGGCAATTGT CGCCTTGATT
TTGGCCTTAG CAACCCCGCT ATTACTGGCT TTTGGCGTGG TGGCCAGCAG CGATAGTGTC
CAATTGCCAG CCTATTTAGC TGCGTTATTG ATCCTAGTTT GGCAACCAGA ACTGACCCGC
CGCCGCGTCG CACTGGCTGG GTTGTGCTTG GGGTTGGCTT ATTTATTCCG TTTTCAATCG
ATAGTGATTG TGGTGCTGGT TTTGCCATGG TTATGGCTGC AACGCCTGCC TGCCCCGCCG
CGTTGGCCGC AACGTTTGGC AGGTTGGTTT GCTCCAAGTT TGCTTTTAGC CGGATTTTTG
CTTGGGTCAT CGCCGCAGTG GGTGCTCGAT ATTCGCGATA CAGGGCGACC ATTTTTCTCA
CAACAATATG AAAACATCTG GCAAGCTGCC TACAATCGGG TTGATGCGGT AGTAGCTGCC
GATAGCCCCG AAGCGATTGC CACCGCGCCC AGCGATACAG GCTTATACGA TATTGTGGCG
TTTGATCCAT ATGGCCTATT TCGTCATTGG CAAGCTAATT TAAGCCAATT TTTTAGCTTT
ACCTTGCACA CAATCTTTAT TTGGCCATTT GGCTTATTGA TGCTTTTGGG ATTGGGCTTA
GCGGTATTGA AACGGGCTGA CCCGCGTTTG AGTTTGTTGG CATGGCTGAG TTTAAGCTAT
ATTCCAATTA TTGCCCTAAC CTGGAACAAA GATCGTTTTT ATCTACCGAT TGTGCCCTTG
TTGTTGGTGC TTGGCGCGTA TTGGTTGGAG TGGTTGCGCG GGCAGGCCTG GCGTTGGCCA
CGAGGCAGTC GTTGGTTGGC TGAGGCAGTT CAGGCTGCCA GTTTGGCTTG GGCTTTGAGC
CACCTCAGCG CAATCGATCC GATTTTACGG GTGTATGGAA GCTTAAAATA G
 
Protein sequence
MRIGLIVQRR IIQISSFIAI LALCLAITLI WVLQRSPAQV TIGGKYDSPW LVEGFQTKER 
SELGAYRWTN GHGIIGSPAT HRSYMLGLSL VSPVTTTVVQ LNNAGHRVLE LPISNAPRYY
QIFWRPNISL NWLRWASDQQ LTLNSELQVL EGSDQRQLGV VLQNLSWSPT GSISLLPFAW
ITALVVSLAA LIRPQQRRDW LWFALSAIGL SLMLGGLSWL ANDQSVWSPL RFAPSLLILP
LAGFALLRWP WQGWWQALPV LSLIGIAAIV MLLSRQWWAV EGPDFGWHAN HGSSAESVFR
AHPFYPLGFP LILWLGLLWN GDQLAIGQTA GFISMLLSLL LTGLLAYRIL ALRGAIVALI
LALATPLLLA FGVVASSDSV QLPAYLAALL ILVWQPELTR RRVALAGLCL GLAYLFRFQS
IVIVVLVLPW LWLQRLPAPP RWPQRLAGWF APSLLLAGFL LGSSPQWVLD IRDTGRPFFS
QQYENIWQAA YNRVDAVVAA DSPEAIATAP SDTGLYDIVA FDPYGLFRHW QANLSQFFSF
TLHTIFIWPF GLLMLLGLGL AVLKRADPRL SLLAWLSLSY IPIIALTWNK DRFYLPIVPL
LLVLGAYWLE WLRGQAWRWP RGSRWLAEAV QAASLAWALS HLSAIDPILR VYGSLK