Gene Haur_5011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5011 
Symbol 
ID5736970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp17604 
End bp20459 
Gene Length2856 bp 
Protein Length951 aa 
Translation table11 
GC content48% 
IMG OID641282178 
Producthypothetical protein 
Protein accessionYP_001547769 
Protein GI159901523 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.261042 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCGAC GTTGGATACT TCTATGTTTG GTTATTGGCA TTGGGTTGTC GGTAGTGCCG 
CAGGATACCT CTGCCCAATC AGCGGTCTAC TATACGTTTG ATGCACATTC GTTATTGATG
AATCAATCTG ATGAGAATTT TGATAAAGTC GCCTTTTTGT TATCGGTACA AGGGAATGTG
AATCGAGAAA GTCCTCGACT CTTCATTAAG CAACCACAGA TGATCAGCGT GAAAGGAAGT
GATTATAGCC CTGATACACT ATGGAAAAAC GCTATCCAAA GTTCATTCAT TTGGCTTAAC
CCGCGACAGT ATCAGGAAAC AGTGTTAACC GATATTAATC AGGTTATTAC CACCTTCGCG
GCATCATCCT ATGGCATTGA TGGAAGTGTC GTATGGGATA AGGATCGTCC TTGGACATTG
AATATTGCCG CATCAATTGC TGGTGCTCGA AATCTCGCTA TTGTCCGCAA GCAAAGCCCG
ATCTATGCGA CGCTTACCGC ACACTATCCT GTCGTGGTTG ATTTAACGAC GGATCCGATT
CAGGGTTTTA TGACGACGAA ATCGAGTGCG TATCAATGGC TCCTCCAAGA GTACCTGTTG
AATTCTGCGA ACCCGCATCG ACTGGAACCA ACGCTCGCAA GCCTCAAAGA TGGCTATGCC
CTCAAAAATC GGGCAATGAA TACCCTTTAT TTTGGCCGAT GGATTAGTCT GGAAATGGAT
GCTGCCATTG CTCGAAAGGC ATTAATTTTT GATCTTTCCC CTAATGCCGA TCTGGGATCG
CCAGAAGAGC AGTGTCAAGC ACCAAGTAGT GATTATACAA CGTTGACGAC CTTGCTTCAG
AATGTTCGTA ATCGCAAAGG AACCCAGCCT ATCGAAGTGA TGGGATTTTT AGATTTTCGC
TATATCTATT GTGAAGGCAA TGGTCAGCCC CATCCAACGG ATGAGCAGCG ACGGTTACAA
AATGTTGTGA CAGCGTTAGA GCATCCATTT GCAAAAGTGA TTTCCCAGTA TGGTGGCATG
ATGACGGTTG GTGGGATTGG TCCAGCTGAT GCTGCGAATG GATCATTTTT TCGTCATAGT
CCTGGTGTGC GGTATATCCC CCAATCCCCA GCAATGACTC CTGAAACCTT GCTCAGAAAT
GATTATGCTG ATGGCTATCC CCTCAATTTT TCATTTGAAA AAGGTGGCGT AAGTCAATGG
ACGATGTGGA CGACTAATTA TGCCACCTAC GCTGGAACAA ATCTCCCCCA CGGATCAACG
TTTCTTGAAA TGAATACCAG TACGACTGAC TGGCAGAATG GGAAGAATAC GCTGTATCAA
GATGTGCCGA TTGCGCTCTT ACGCGGATCA CGGTATCAAC TCCGGCTTAG TGCTCGGCGG
AATCCGAGCG AGGTGGGTAG TATCCAAGGT GGAATTGCCT TGTGGGGGAA ACGCGCTAAT
GGAAGTTATA CCCAACTCAA TCACTGTCCC TTCACGCTTA CAAGTGGATC ATGGGTTCCC
ATTGCATGTG ATACCGATAT TCGTGAAGAT GGACTCCATG GGATACGACT TCAAATTGCC
CTTTATACAC CGGATAAGAA CTATGATTTT GATGCGATCA CCTTCCTTGG TCCGAATACC
TTGCGCGTGA ATCCGACGAA GACCTTTGGC TTATTCTACA TGGGCGATTA TGATGGACCT
GGTGCGGCTT ACAGTACCTT GATGGCCGAT GTGAACGATG AGAAAAGTTA TAATGAACTG
ATTTGGACGA GCAAAATAAC GACCACCGTT CCCGTTGCGT GGGCAATTGC ACCGAGCTTT
CGTGATGCCT ATCCCAGTGC CTATGCCTAT CTTGCAAAAA CCAAAGGGCC ATATGATTAC
TTCATGATGC CCAACTCAGG GCCAAATTAT ACCAATCCCC TGCATTTTGA TAGCCTAGCG
AGTGGTCGCC CACGGGTTGG GCCTTTCAAA CAGCAAACGG CAACCCTCAA TCGCGAGGTG
GGGTATCGCG TGGGATGGGT CTTGGATGGT GCAGAAAGTC ATCTCAGTTA CAGTGATCCC
ACGGTGCGGA GTATTTTTAA GATTGCTACC CCCGATGGCT ATATTCACAA TAGCAATGTT
CCACCAATCG GGCCAACTGA TCCGTCGGTT GCAACCTATG ATGGACATGC GGCGATTTTA
CGCAAAACAA CCGATTTAGT GGATCATGCA ACCACAGACA CAGATGGTGC TGATCGCTTA
ATTGCACACG TCCTCGCACC ACAAGCCGCA CAATTTCAAG TCTATCGAAG TATTTTTGTC
TCATCAAACT TCATTTCGCA CGTCGTGTCC ACCGCGAAGA CAAAAAACCT TCAGTTTGCG
AATCGATTTG CCGCCCTTGA TCCGATGAGT TTCTTTGGTT TGTATAAGAG CCAGCATGGA
CTGTATCCAC GCTTACGGAT GAGTATGGTC AGTGATACCC TGCCGCAGGT GATGTACACT
GGTCAATCGT ATGCCGTCCA GGTCACGATT CGGAATGATG GATGGGATAT CTGGCGACCA
AAGCCCTCAG GCGCAACCGA TTGTGATGGG AGTGGGCTGG CATATAAAGG ATGTGATCGC
TTTGTTTGGA CATTCCAACC GCCAACCAAT CCCATTATTC CGACTGGACC AGGTGCGATT
CCAACCGTTA CCTATCCATC GGGAAATCGG ATTGATTTTG GAACAACAAT TGCTCCAGGT
GCAACGACCA CGGTGAACCT GATGTTGACC ATTCCTGCGA ATGCGACACT TGGCTATCAC
ACCTTCCAAG CAGATCTGGT TCAAGAAGGG TATGGATTTG GGGAAACCTA TGGGAATCAG
CCATGGCAAG GACGTGTCCT CGTGGCTACA CCCTAG
 
Protein sequence
MLRRWILLCL VIGIGLSVVP QDTSAQSAVY YTFDAHSLLM NQSDENFDKV AFLLSVQGNV 
NRESPRLFIK QPQMISVKGS DYSPDTLWKN AIQSSFIWLN PRQYQETVLT DINQVITTFA
ASSYGIDGSV VWDKDRPWTL NIAASIAGAR NLAIVRKQSP IYATLTAHYP VVVDLTTDPI
QGFMTTKSSA YQWLLQEYLL NSANPHRLEP TLASLKDGYA LKNRAMNTLY FGRWISLEMD
AAIARKALIF DLSPNADLGS PEEQCQAPSS DYTTLTTLLQ NVRNRKGTQP IEVMGFLDFR
YIYCEGNGQP HPTDEQRRLQ NVVTALEHPF AKVISQYGGM MTVGGIGPAD AANGSFFRHS
PGVRYIPQSP AMTPETLLRN DYADGYPLNF SFEKGGVSQW TMWTTNYATY AGTNLPHGST
FLEMNTSTTD WQNGKNTLYQ DVPIALLRGS RYQLRLSARR NPSEVGSIQG GIALWGKRAN
GSYTQLNHCP FTLTSGSWVP IACDTDIRED GLHGIRLQIA LYTPDKNYDF DAITFLGPNT
LRVNPTKTFG LFYMGDYDGP GAAYSTLMAD VNDEKSYNEL IWTSKITTTV PVAWAIAPSF
RDAYPSAYAY LAKTKGPYDY FMMPNSGPNY TNPLHFDSLA SGRPRVGPFK QQTATLNREV
GYRVGWVLDG AESHLSYSDP TVRSIFKIAT PDGYIHNSNV PPIGPTDPSV ATYDGHAAIL
RKTTDLVDHA TTDTDGADRL IAHVLAPQAA QFQVYRSIFV SSNFISHVVS TAKTKNLQFA
NRFAALDPMS FFGLYKSQHG LYPRLRMSMV SDTLPQVMYT GQSYAVQVTI RNDGWDIWRP
KPSGATDCDG SGLAYKGCDR FVWTFQPPTN PIIPTGPGAI PTVTYPSGNR IDFGTTIAPG
ATTTVNLMLT IPANATLGYH TFQADLVQEG YGFGETYGNQ PWQGRVLVAT P