Gene Haur_1534 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1534 
Symbol 
ID5733421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1787524 
End bp1790130 
Gene Length2607 bp 
Protein Length868 aa 
Translation table11 
GC content50% 
IMG OID641278674 
ProductTPR repeat-containing protein 
Protein accessionYP_001544306 
Protein GI159898059 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTTTAC TTGATGATGC CTTCGAATCA ACGGTTACAC CGGCACTTAC GTCTTACATC 
TGGCTCCTGC GTTGGTGTGA TTCGGCTCAG TTGGCTCACC TAACGCCTTA TTCACCCGAA
CAGATCGAAC GCTTTTGGCA AAGCGCCTTG GTCATTGAAC ACCCCCATCA TGGTTGGTAT
CAACTACGCG AAGCACCTAG CTTAAATGAA CGACCTTATC GTGAGCACGA AGTCTTTGCT
GCCGCCTTTG AGTATAGCCA GCAACAACTC AATCGTCTAG AGGCTGAAGC TTGGCAATTT
GAACTCCAAC GCTGGCTCTA CTACCTTGAG GAATATTTGG AAGTGCTTTC GGCTCGCCGC
GATTGGCCAA CCATCGCCGC AGTGCTCGAA AAAGCCACCA GCATTCCGCA AGTCAATCTG
CGCCAACAGC AACTGCTGAT GCTCTACAAG GCGATTATCA CCATGCGGCT TGAACGTCAG
TATGATACCG CCCAAAGCTT ATTACAACAA TTACGCGATG ATATCCAGCT TGAAGCCGAT
TTAGTGCCGA TGGTGATCAA CAGTATGGGC ACATTAGCAT GGTTTCGCGG AGCCTACGAT
CAGGCGATTC AACATTATAT TGAGCAACAT CAACACGCTC AGCAGGTGCA AAATTGGCTC
TATCAAGGCC ACAGTTTGCT CAATCAGAGC ATTTTATCCA ACCAACTACA ACGCCCAGAA
TACGCCTTGG AGCTAAGTTT ACAAGCGCTC GAAGCCTTAC AACGGGCGGG CAATCGCTAT
CGCGAGGCGC ATGCGCTCTA CGAAGTTGGC TCGAATTTGC TCTATTTGTG CCGTTGGGAT
GAGGCCGATA GCTATTTTTC ACGCTCGGCG GCGCTCTACG AAACCCTCGA TACGATCGGC
GATTTGGCCA ATTTGTATTG GCATATTGGC TTTTTGAAGC ATTTAGCAGG CGATTTGCCG
GCCAGCGAAC AAGCCTATCA AATTAGTATT GAGGCGGCGC GGGCCAGCGT TGTGCCCAAT
GATGATAGTC TGCGCGATTC GTTGGCATTT TTGGGCTTGC TCTATTGCAG TATGCACGAG
TATCAAACGG CCTTGGAAAT TTATGCTCAA GCCGAAGCCT TAGCCCGCCA ACACAACAAT
CGCCATGAAT TAGCCCTGAT TCTCAATCAG CGCGGCGATG CTGAACGGCG GAGCGGGGCA
ATTGATGCAG CCTACGCCGC CTTTGCCGAA TCGATCGCAA TTATTGAAGA TTTACGCACC
TCGTTCGGCG ATGAAGATAC CAAATTGGGC TTGATTAGTA CAGCCCAACA GGTTTATGAG
CATATGGTTG TGCTATGTAT TGAGCGCGGC GATGCGGCTC AAGCAGTCAA TTATATCGAG
CGAGCACGTT CGCGAGCCTT CCTTGATGCG CTGCAAGCTG GCGATGAAAG TACCGCAATT
GAGCTTTCGC AGCAATGTGC CGATTTGGCC GAAATTCAAG CCCAACTTGA CCAACGCACA
GCGGTGATCG AATACTTTAC GGTTGGCGTA TTAGGTCGAT CATTGCGTTT CTTGGCAGCG
TTGGCCGAAC GCAAATCACC CATTTTACAT CATTTCAGCC TTGATCCAGC CTTGTATAGT
GTGGCAATTA CGGCCAACAA TGCCACGATT CATCAGCATA GCTTCAATCC ATTGAATCTA
ACGCGCGGCC ATGGCGGGCA TCATCGCTTG CTACAACCTC GGATTTTGAA GGCCATTTCG
CAAGCATTAA TCGAGCCATA CGAAGCCATG TTGGCGACAG TTGATTTGGT GTATGTTGTG
CCGCATGGTC CGCTACACGA TGTACCATTT ATGGCCTTGC AAACTAGCGA CGGCAATTGG
TTGGTGCGCG AAGAAAACCC AGCGATTGCC TTGGCTCCTA GTGCCACTAT TTTGGTGCGC
TATGCCTTAG GTCGCGCCGC CAGCAGCCAA ACTCAGCACT ATTGTTTTGG CTACAACAGC
GTCGGAGCCG AAGCCCTGAC CTATGCTGAA CACGAAGCCC AAGAAATTGC CAAATTAGTG
GGTGGGCAAG CATGGACAGG CGCATTAGCC ACCGATCAGT TTTTGCGCTA TGCCCACGAT
GCGCGAATTA TTCACATCGC CTCGCACTGT GTGTACGATG CTCAACAGCC ATTAAATTCG
CACTTGATTC TTGGCCACGA AACGCTAAGC GCCCAAACGA TTATGGATCA GGTTGAAATT
GACACAGATT TGGTAGTGTT AAGTGCTTGT GTCAGTGGAC GCAGTTTTGT GGCGGTCAGC
GACGATCAAT ATGGCCTACA ACGGGCATTT TTATATGCCG GAACCCGTAG TTTACTCTGC
TCGTTGTGGA ACGCCTCGGA TGTAGCGGCG TTGTTCGTGA TGGATCGTTT TTACCGTGAA
TTGCAGGCCG GAGTACGCAT TGCTATAGCA CTCAAACATG CCGTCATCGC TGTACGTGAC
TTGACGCGGG CTGATATTAT TAAACAGTTC CAGCTTTGGC AGCTACCAGC TAGCGCAATT
CCGCTCGAAC CAGACGGCCA GCACAGCGAA AGCCCTTTGG CAGACCCGCG CTTTTGGGCT
GGCTTTATGG TGATTGGCAA AGCCTAA
 
Protein sequence
MVLLDDAFES TVTPALTSYI WLLRWCDSAQ LAHLTPYSPE QIERFWQSAL VIEHPHHGWY 
QLREAPSLNE RPYREHEVFA AAFEYSQQQL NRLEAEAWQF ELQRWLYYLE EYLEVLSARR
DWPTIAAVLE KATSIPQVNL RQQQLLMLYK AIITMRLERQ YDTAQSLLQQ LRDDIQLEAD
LVPMVINSMG TLAWFRGAYD QAIQHYIEQH QHAQQVQNWL YQGHSLLNQS ILSNQLQRPE
YALELSLQAL EALQRAGNRY REAHALYEVG SNLLYLCRWD EADSYFSRSA ALYETLDTIG
DLANLYWHIG FLKHLAGDLP ASEQAYQISI EAARASVVPN DDSLRDSLAF LGLLYCSMHE
YQTALEIYAQ AEALARQHNN RHELALILNQ RGDAERRSGA IDAAYAAFAE SIAIIEDLRT
SFGDEDTKLG LISTAQQVYE HMVVLCIERG DAAQAVNYIE RARSRAFLDA LQAGDESTAI
ELSQQCADLA EIQAQLDQRT AVIEYFTVGV LGRSLRFLAA LAERKSPILH HFSLDPALYS
VAITANNATI HQHSFNPLNL TRGHGGHHRL LQPRILKAIS QALIEPYEAM LATVDLVYVV
PHGPLHDVPF MALQTSDGNW LVREENPAIA LAPSATILVR YALGRAASSQ TQHYCFGYNS
VGAEALTYAE HEAQEIAKLV GGQAWTGALA TDQFLRYAHD ARIIHIASHC VYDAQQPLNS
HLILGHETLS AQTIMDQVEI DTDLVVLSAC VSGRSFVAVS DDQYGLQRAF LYAGTRSLLC
SLWNASDVAA LFVMDRFYRE LQAGVRIAIA LKHAVIAVRD LTRADIIKQF QLWQLPASAI
PLEPDGQHSE SPLADPRFWA GFMVIGKA