Gene Haur_4056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4056 
Symbol 
ID5735914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5176037 
End bp5179441 
Gene Length3405 bp 
Protein Length1134 aa 
Translation table11 
GC content53% 
IMG OID641281207 
Producthypothetical protein 
Protein accessionYP_001546816 
Protein GI159900569 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.55653 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACGGT TGGTGCTTCA ACTCGCTTTG CTCGCCGCAC TTATCGTTCG TTTTAATCTC 
CCCGCCGCGC AAGCGCAACC TCAAGCTTTT GTCGCTCCTG AAGAATTACA GGCTGTCGCC
GATGATCAAT TTTGGAATAA TACAGGCTTG ATTGCTGGGG CAAATAACAC GATTCGCGCC
ATCAGTTCGC AATCAGGCGA TTTATTTGTT GGTGGTTTGT TCGATCGCAT CGCTGGCATC
AGTGCCAATC GGGTGGCTTT TTGGGATGGC GATCATTGGA ATACGATGGG CAGTGGGGTT
AACGGCCCAG TCGATGATCT CGATGCTTCA ACTGGCGGTT CAGTTTATGT GGTTGGCTCC
TTCAGCAGTG CTGGTGGAAT TGCCGCCGAT GGCATTGCTC GTTGGAATTC TGGCACAGGC
CAATGGTCGG CCTTGGCGAC CAACGTCAAT GGAGCTGTTA CCGCCGTTTT GGTGCAACAG
GTCGCTGGCA GTGATGTGGT GTATGTTGGT GGAACGTTTA GCTCAATTGA TGGGGTGAGT
GCCAACCGTA TCGCTAAATT CAGCAATGGC TCATGGTCGG CATTAAGCAG TGGGATCGGC
GGTGGCACTG CTCCCCAAGT GCTCGATTTA GCGATCAATC CGGCCAATGT CAACCAACTT
GTGGCTGGGG GCACGTTTAG CTCGGCTGGT GGTAGCACTG CCAACAATGT AGCGATCTGG
ACAGGCTCAG CTTGGCAGAG CCTTGGTACG GGTAGCAGCA ATGGGGTCAA CGGTGCTGTG
CGTTTTGTTG ATTTCCGTGG CACAAATATG GTGGTCGTTG GTGGGAGTTT TAGCAATGCT
GGCACTGTTA CCAATGTTGG TGGTGCAGCG GTTTGGACTG GTGGCAATAC CTGGGCAGCC
ATGGCCGGAC GTGGGGTGAC TGGCGATGTG CGTGGGATTG TCGAGAACGC CAATTTTACC
TATGTGATGG GCAATTTTGG CAGCGGTATC AACCCCAATG GCAATAGTGT TTTCTCGCCC
AATATCGCCC GTTGGGATGG TAATATCTGG TCGCCTGTAC CGAATGCCAC CAATGCCTTT
GGCACAAATG GGGCAATTTT ACGGGCTGAA CGCTTGGGCA GCGGCAGCGA TACCTTCTTT
ATTGCTGGGG CGTTTGGCAC AGCCCATGGC ATGGAATTGA ATTTTGTGGG CATGGTTGTG
CCACAACAGG GCTTCTTGCC TGGCGCAACT GATCGCTTTT TCCCCTTGGC TGGTGGCCTC
GAAGGCTCCA ACGCCAAAGT TTTTGCCATT CAGCCACGTT CTGGCGAAAT TATTGCTGCT
GGGCGCTTCG ATCTGGGCAG TAATCGCTTG CTCAATAATA TCGCTCGCTT TGATCCTGTT
GATCGAGTCT GGTCGCCGTT GACTGGCTCC AGTGATAGTG GAGTCAACGA TGATGTGCGT
GATCTTGCGT TGCGCAACAC CGATTTGATT GTTGTGGGTG AGTTTAGCAA GGCTGGTGGC
ATTGATGCCG CAGGCGTAGC TAGTTGGAAT GGCACAACCT GGACTGCCCT CGCAACCAGC
ATCAATGGTC GGGTCAACGC GGTTGCGATC AGTGGCAGCG ATATTTATAT TGGCGGCGAA
TTTACGCTGA TTGATGGTGT GCCTGCCAAT CGGATTGCCC GTTTGAGTGG TGGTTCATGG
CAAGCTGTCG GCGCTGGCAC TGATGGCCCT GTCAATAGTT TGTTGTTCAA ATCGAACCAA
CTCTATGCTG GCGGCTTGTT TGCCAACGCT GGTGGTGCGC CTGCCAGCAA TCTTGCCCGC
TGGAACGGCA CAACATGGCA AGCAATCGGG GTGGGCACTG ATGCCGAAGT GTTAGCTTTG
GCCGATGTTA ATAGCACGAC TGTGGCAGTT GGTGGGCGGT TTACCAGTGC TGGTGGGGTT
GCCAACACCC GCGCAGTTGC TCTGCTTAAC CACAGTAGTT TGGCATGGAC GGCGCTTGGC
ACTGGCACTG ATGGCTATGT AACCAGCCTC GTCGTGCGCG GTGACGATTT GTATGCTGGT
GGTTTGTTCA GCCGCATGGA TGGCTTGACG GTCAATCATG TGGCGCGGTG GAATGGCACA
ACTTGGAATG CCTTGGGTAG TGGGGTTGCG GGTGGCAACC TGCAAAATAG CGAAGTTGGA
GCCTTAGCGG TCAATGGCGA TAATCTGTAT GTTGGTGGAC GTTTTGATCG GGCTGGCGAT
AAAGTTTCGC ATCGCTTTGC TGAATGGCGA CAACCCGAAG TTGATCTCAG CCTCAAACTG
AGCGAATCGC CTGATCCGGT GACGATTGGC AATCCTATGA GCTATAAAGC CACCGTCAGC
AATTTGGGCA CGATCAGCGC CAGCAGCGTC GTGTACGAAC AAACCTTTGC CAATACATTA
GTTTTTGGTC AAGTTACAAC CTCGCAAGGC TCATGTAGTT TCCCCAATTC TACGACCTTA
CGTTGTAATT TAGGTACGCT AGCGGCCAAT GCTAGTGCCA ATATCACAAT TAATGCTACG
CCCAGCCAAG TTGGCACAAT TAGTAGCACG GGCACGGCTT CATCGCCTGC GAATGAGGCT
TTTCCAAGCA ACAATAGTCG TAGCGTTGCA ACTCAAGTGA TTGTGCCTGG TAATCCAGTG
CCAAGCATCA GCAACATCAC GCCTGATCGC TTTATTCGCC AGCCGATTGG CTTTCCACCG
CCGCCAGCAG TGCGGATTAC AGTCAATGGC ACGGGCTTTG TGGCCAATTC CAAGGTTGTG
GTAGCTGGGG TTGAGCGCAC GACAACCTTT ATCAATAGCA ATCGGCTGGA ATTTAGTATG
GCCGCAACCA CCAGCCAAGG CACATACTCG GTTTTGGTGC GCAACCCCAC GCCAGGCGGC
GGCGATTCCA ACAGCGTAAC CTTGGGGGTT TCGGTTGGCA TCGTTGGCTT GAGCAGCATC
ACGCCCAACC TTGGTGGCAC TGATGTCGAT CTACAAACAA CCTTCAATGT AAGCTGGACG
CATACCACCG ATCCATGGCG AATTATCGAG CACCTCGATT TGCGCTTGGT CGATAGCGAT
GGTGTGGCTT TATGGGCACG CTTTACCGAA GGCGTTTCGG GCACATTTAG CCTGCTCGAT
GCCAATGGCG ATGTGCTTGG TTATGCCACG GCTGGCAGCA GCGATCCATT GGAGAGCGAT
AGCGCCATCC TCGATTTGGC GGATAGCAAC TTTGCTGGCA GCGGGCCGAC AGGCTTTAGC
ATGCAAGTCA ATTTCAGCAT TCGCTTCAAA CCAAGTGCCG CTGGTCGTCG CTATAACATC
GAGTTGTATG CCACCGATGA TCATGGCGGG GTACAAGGGC CAGATGTGAT GGGCACATTC
ACCGTTGGCA TTCACGATGT ATATCTGCCA ATGACCATTA AATAG
 
Protein sequence
MPRLVLQLAL LAALIVRFNL PAAQAQPQAF VAPEELQAVA DDQFWNNTGL IAGANNTIRA 
ISSQSGDLFV GGLFDRIAGI SANRVAFWDG DHWNTMGSGV NGPVDDLDAS TGGSVYVVGS
FSSAGGIAAD GIARWNSGTG QWSALATNVN GAVTAVLVQQ VAGSDVVYVG GTFSSIDGVS
ANRIAKFSNG SWSALSSGIG GGTAPQVLDL AINPANVNQL VAGGTFSSAG GSTANNVAIW
TGSAWQSLGT GSSNGVNGAV RFVDFRGTNM VVVGGSFSNA GTVTNVGGAA VWTGGNTWAA
MAGRGVTGDV RGIVENANFT YVMGNFGSGI NPNGNSVFSP NIARWDGNIW SPVPNATNAF
GTNGAILRAE RLGSGSDTFF IAGAFGTAHG MELNFVGMVV PQQGFLPGAT DRFFPLAGGL
EGSNAKVFAI QPRSGEIIAA GRFDLGSNRL LNNIARFDPV DRVWSPLTGS SDSGVNDDVR
DLALRNTDLI VVGEFSKAGG IDAAGVASWN GTTWTALATS INGRVNAVAI SGSDIYIGGE
FTLIDGVPAN RIARLSGGSW QAVGAGTDGP VNSLLFKSNQ LYAGGLFANA GGAPASNLAR
WNGTTWQAIG VGTDAEVLAL ADVNSTTVAV GGRFTSAGGV ANTRAVALLN HSSLAWTALG
TGTDGYVTSL VVRGDDLYAG GLFSRMDGLT VNHVARWNGT TWNALGSGVA GGNLQNSEVG
ALAVNGDNLY VGGRFDRAGD KVSHRFAEWR QPEVDLSLKL SESPDPVTIG NPMSYKATVS
NLGTISASSV VYEQTFANTL VFGQVTTSQG SCSFPNSTTL RCNLGTLAAN ASANITINAT
PSQVGTISST GTASSPANEA FPSNNSRSVA TQVIVPGNPV PSISNITPDR FIRQPIGFPP
PPAVRITVNG TGFVANSKVV VAGVERTTTF INSNRLEFSM AATTSQGTYS VLVRNPTPGG
GDSNSVTLGV SVGIVGLSSI TPNLGGTDVD LQTTFNVSWT HTTDPWRIIE HLDLRLVDSD
GVALWARFTE GVSGTFSLLD ANGDVLGYAT AGSSDPLESD SAILDLADSN FAGSGPTGFS
MQVNFSIRFK PSAAGRRYNI ELYATDDHGG VQGPDVMGTF TVGIHDVYLP MTIK