Gene Haur_2254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2254 
Symbol 
ID5734141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2876027 
End bp2880355 
Gene Length4329 bp 
Protein Length1442 aa 
Translation table11 
GC content48% 
IMG OID641279395 
Producthypothetical protein 
Protein accessionYP_001545022 
Protein GI159898775 
COG category 
COG ID 
TIGRFAM ID[TIGR00756] pentatricopeptide repeat domain (PPR motif) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGACGA CCCCAAACAC TCAGGTTTTG CATGAAGCCG CCAACCAGCG TGAAAGTGCC 
CGTTTGTTGC AATTGCGGCT CACTGGCTTC GTTGGTCGTC AAGCCGAACA AACAGCGATT
CGCGGATTAA TCGACCAAAC CCGCCCAAGC GGTGGCTATG TGTTGGTGAC GGGTGAGGCT
GGGGCGGGCA AAAGTAGTCT GCTCGCCCAA CTAATTGTGA ATGCTGGGCT AGACCAAACC
CCGCAGCATT TTATTGCGCT GACTCCAGGC CGCGCCTATC AACTCGACGT GTTGCGTAGC
ATCGTTGCTC AACTGCTGCT TAAACATGAT CTGGCGAGCA ACTATTTTCC TGCCGATAGC
TACCCCGCCT TGCGCCTCGA ATTTGGTCAG TTGTTACAAA CCCTCTCGGC GCGTGGCATC
AGTGAAACGA TCTATCTCGA TGGACTGGAT CAATTGCAGC CTGAGGTGGA TGGAACCCGC
GATCTCAGCT TTTTACCCTT GCAGCTGCCG CCTGGCATCG TGATAGTGCT TGGCTCGCGT
CCCAACGAGA CGATCGATAG TTTAGCACTT GAGCATGGGG TCGTTTATCA GGTTCCACCG
TTGCACGAAC AGGATGCGAT TGGGCGTTGG CAGCAGGTGC AGCCGACGCT GGAGCCAGCG
TTGTTGCACG GTTTAGCGCA AGCCGTCAAG GGCAATGCCT TGTTGATCGA ACTGGCGGCC
AATGTGCTGC GCCACACTTC GACTAGTGAA ATGCTTGCAT TACTCGACCA CGCCAGCGCT
GACGCAACCA ATCTCTTTCG GCTGAGCCTT GGGCGGATCG AACAAGCAGC GCCGCGCCAC
TGGCAACCGC TGATTCGCCC GTTGTTGGCG GTGTTGTTGA TAACCCAAGA ACCGCTTGAG
CCAGCGGTGC TTGCGGCGAT TATTGAACGA CCAACCGCTA CAGTGGTCGA GGCGCTGACC
CTGATGAGCG ATTGGGTGAG TGTTGCCGCC GATCAGCGCG TGGCGCTACG CCATTTGTTG
TTTCACGATT TTCTGATCCA ACACGAATTT ACCCAGCCAG AATTGCAAGT GTGGCATGGA
CGCATGACGC AATGGTGTGG CGCAGCACTT GACCAGATTT GGCACGATAG TACAGAATCC
GTTGAACAGG CACGGCGCTG GTATGCACGC CAGCATTACA TCACCCATTT GGATTGCGCA
GAACAATGGG AAGCATTGTG GCAAGTGATC GATGCGGGCG ATTATGGCGA GCACAAAGTG
CGGTTTGAGC CAAGCACGCG CTTGTATGGC TTGGATTTGG ATCGGGCCCG CGAGAGCGTA
ATTGCCGCTG GCCAGAGCAT CGAACAGCAG CTTGAATTAT TGCCACGCTT GTGGCGCTAT
AGCCTGCTGC GCACCAGCCT CACGGCCCAT GCCGATCAAT GGCATGATGA TGTATTTGTG
ATTTTGGCGA TGCTTGGGCG GGTATCTGAG GCGCTTGCCC AGATTGAAAT TTGCTCGGAT
CAAGTACGTC AAGTGCTGTT ATGGTCACGT GTAGTTGCTT ATACAGAGCC TGAGCTGCGT
TTGCATATTT TTCAGCGCAT GGAACAAGTT GCGCGAAGCT TGCATGAGTC TGAAGAGCGT
GATTATGCGT TGCATCTCGT TGCAATGGCC TATGCTGACC ATGGCTTGCT AAATATGGCC
TACCCGATTG CGATTAGCCT GGGGAATACT CGTGATGAAA CGCTTGCTTA TTTGGTTGAC
GTGGTGATTA AACAGCATGA TTTGGCAAGA GTTAAGCTGA TCATTGGACA AATTCAAACT
CCAAAATATC GAATTAAGAG TTCTATGCTG CTAGCTAATG CCCTGATTGA AGAAACTGAA
TTTATCGAGG CTCGACAACT ATTAATTGAA ACGTTACCAT TTGCTCAAAA CGAGCAAGTA
GTTGAAATTA AAAGCTTACT TGCAACAATT GCATGGCGGT TGGGTGATCA TCAACAGGCC
GATATATTAC TAGCTGAAGC TCGATCCATG CACAAATATT TTGCTGATGA TGCAAAAATT
GCAGCACTTT TGGCAATAAT TAAAGGGTAC TTGGCACAAG GGAATTTAGC ACAAGCCTAT
AATTTGCATG ATGAGATCAA ATTAAATCGA TTTCGTCGGG AATTAGTTGA TATCTATATT
AATCGTGATG ATATTGCAAT CGCGGTTGAG CTTGCGGCTA CCATTACCCA CTGGCACTCC
AGTGATCTGG CTTATGCCGC ACTAGTTGCC TGGTATTGTA AGGAAGCTGA TTTCTCGAAG
GCCGAGCAAG CGCTTGGATT AATCAAAGCG CCTGATCAAC AGATTAAAAG CTATTGTTTA
TTGGCAAATA GCCATGCCGA TCGATTTCAA TGGATGCGGC TGCTGGAGTC GGCCCAGCTG
TCTTTGAGTT CGGTTATAAG CTCGATCTCT TTAGCAAAAT GCTGGCTGCA ACTTGCCGAT
GCATATGCTT TTCAGCATGC GCATGACCGT GCTCAGTCTA TGTTTGAGCA TGCGTTGACA
GCGATTTTGG CCACATCAAA CTCGTTTTTG AATGACCAAA TGTATGATTT ATTGCAGCTT
GCTCATTTTG CTAAACGCTA TAACTATGAT GATCTCTGTC AGCGAGTGAT CTACACCACG
TTTTTAGTTG GTAAACACGA GGATTTTGAG TATTCTTTGC CGTTTGAGCA TGCTATGCTC
CATCTTAATC ATGGTGAGAT AGACCAAGTT CGCCAGATTA TTGACACAAG TGTTGGACCA
TATGTGGCAG TTAACTTATT ACAAATACTT ATAGCTGAAT CAATTAAGCA ACAGGATCAT
CCTCAAGCTC AACTATACTT GTTCGAGGCG TTGAACCATG CACGAAAGCT TGAAAATCCT
AGCTATCGGG TGAGTCTCCT GGGTGAACTT GCTGATATAG CCTTAGCTAG TGGGTTTGAG
CTACTCGCTA AAACAATTCT CGGCGAAGCT ACGCAACTGC TGCCCGTAAT CACTGCTGAA
AACGAGCAGC GTTGGGCTGG GGTGGGCCTT GTTCGTCGCT ATTATAGCCA TGGGATGCTA
GCGAGTGCGG CTACGATAAC CCAGGTAATG ACCGTTTCGC AAACACACGA TCACATTATG
GTAGAAATAA GTCTTTGTTA TGCTGATAGT GGACAGCTAG CCCAAGCTTA TGCAACGCTG
AAGGTTATCA ATACTCAAAC TGAAGTGTAT GCACGGAATT TGTGCCAGAT TATTATTAAA
GCGCATGAGC ATGGTTTAGC TACGCTTGCA GCAGAGTATT ATGATGAGTT GATTGAAGCA
TGGAGTGTAA TTGCCGATCC GATTCGCTTG CTTAAGGATC TAAAGGATCT GGCGATTGCC
CAGATTAACT ATGGCTCCAA TCAGTATCTC CCCAGCCTAT TGGAGGCCAT CCGCACTATT
CGACATCCAG CCTTGGCTGA GTATCAGTAT GTGGAAGTGC TTTGTGAAAT TGCCAGAGCC
TATATCAAAC AAGCCAATTA CCCAGATTTT GCCGATTGGT TAGCGTATGC CCATTCCATT
GCTCAATCGA TTGTGATTGA GTCGCGTCCC AAAGTTGCTG CTTATCATTA TGTTGCGATC
ACATATCTTT GCCATGCTAC CGATTCAGAT ACAGAGATAT TTTTGGCTGA TATGCTTCGT
TTAGCCAATG GTATTGCACC GAGTAGCTAT GCAAATGATC TATTTAATGC ATTAGCCAAT
AGCTGTGCGT CCTATGCGGT GCGTGGGCAT CCAGATTTTT TTGCCAAAGC CTATCAGTTT
GCTATGGCTA TTTCAGTGCC GTGGCAGCGT GCCCAAGCCC TAAGAAGCGT CGCCAATGGC
TATGCTAAAG TTGATGACCG CGTGATGGTA GAAATGATTA TTGCTGAAAT AAGCCGACTT
TCGCCTAATT ACTTAAGTCT TGATGCTGTC GCTTTGATCT ATGCGCAACG AGGCGATTTG
GCTTTTGCTC AAACGCTGAT TGCCAATGAT GAAGCATCTG AAGAACGAGA TGCTGTCTTG
GATTATCTGA TTCCAGCTTT GCTGCAAACC GATGCTGTCG TTGCTGCATA TCAGATCTCG
CATGGGTTTA CTAGGCTGAC GAAACGGATT AAGTTCTTGC ACCAAATCGT TAACTACTAC
GTTGAGCGTG GGCAGATTGC CGAAAGTATT CAGATTATTC AAGCCGCATG GCGTAATTGT
GGTGCTGCTG CTGATCTATG GGAGCTGCGG ACAATCGTCT TGCCGTTTGA TTCAACCCAT
CCTTGGCTTG GCACTGCCGT GCTCGATAGC GTGCCATGGG TTGAGCAGCA ATTAGCCCGA
TTGAATTAA
 
Protein sequence
MMTTPNTQVL HEAANQRESA RLLQLRLTGF VGRQAEQTAI RGLIDQTRPS GGYVLVTGEA 
GAGKSSLLAQ LIVNAGLDQT PQHFIALTPG RAYQLDVLRS IVAQLLLKHD LASNYFPADS
YPALRLEFGQ LLQTLSARGI SETIYLDGLD QLQPEVDGTR DLSFLPLQLP PGIVIVLGSR
PNETIDSLAL EHGVVYQVPP LHEQDAIGRW QQVQPTLEPA LLHGLAQAVK GNALLIELAA
NVLRHTSTSE MLALLDHASA DATNLFRLSL GRIEQAAPRH WQPLIRPLLA VLLITQEPLE
PAVLAAIIER PTATVVEALT LMSDWVSVAA DQRVALRHLL FHDFLIQHEF TQPELQVWHG
RMTQWCGAAL DQIWHDSTES VEQARRWYAR QHYITHLDCA EQWEALWQVI DAGDYGEHKV
RFEPSTRLYG LDLDRARESV IAAGQSIEQQ LELLPRLWRY SLLRTSLTAH ADQWHDDVFV
ILAMLGRVSE ALAQIEICSD QVRQVLLWSR VVAYTEPELR LHIFQRMEQV ARSLHESEER
DYALHLVAMA YADHGLLNMA YPIAISLGNT RDETLAYLVD VVIKQHDLAR VKLIIGQIQT
PKYRIKSSML LANALIEETE FIEARQLLIE TLPFAQNEQV VEIKSLLATI AWRLGDHQQA
DILLAEARSM HKYFADDAKI AALLAIIKGY LAQGNLAQAY NLHDEIKLNR FRRELVDIYI
NRDDIAIAVE LAATITHWHS SDLAYAALVA WYCKEADFSK AEQALGLIKA PDQQIKSYCL
LANSHADRFQ WMRLLESAQL SLSSVISSIS LAKCWLQLAD AYAFQHAHDR AQSMFEHALT
AILATSNSFL NDQMYDLLQL AHFAKRYNYD DLCQRVIYTT FLVGKHEDFE YSLPFEHAML
HLNHGEIDQV RQIIDTSVGP YVAVNLLQIL IAESIKQQDH PQAQLYLFEA LNHARKLENP
SYRVSLLGEL ADIALASGFE LLAKTILGEA TQLLPVITAE NEQRWAGVGL VRRYYSHGML
ASAATITQVM TVSQTHDHIM VEISLCYADS GQLAQAYATL KVINTQTEVY ARNLCQIIIK
AHEHGLATLA AEYYDELIEA WSVIADPIRL LKDLKDLAIA QINYGSNQYL PSLLEAIRTI
RHPALAEYQY VEVLCEIARA YIKQANYPDF ADWLAYAHSI AQSIVIESRP KVAAYHYVAI
TYLCHATDSD TEIFLADMLR LANGIAPSSY ANDLFNALAN SCASYAVRGH PDFFAKAYQF
AMAISVPWQR AQALRSVANG YAKVDDRVMV EMIIAEISRL SPNYLSLDAV ALIYAQRGDL
AFAQTLIAND EASEERDAVL DYLIPALLQT DAVVAAYQIS HGFTRLTKRI KFLHQIVNYY
VERGQIAESI QIIQAAWRNC GAAADLWELR TIVLPFDSTH PWLGTAVLDS VPWVEQQLAR
LN