Gene Haur_3321 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3321 
Symbol 
ID5735191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4186112 
End bp4189537 
Gene Length3426 bp 
Protein Length1141 aa 
Translation table11 
GC content52% 
IMG OID641280468 
ProductTPR repeat-containing protein 
Protein accessionYP_001546085 
Protein GI159899838 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTGA AGGAGTCTGC CTTGGATTCA CGCGAGACAA CGATCAGCCT ATGGTCGCGG 
CGGGTGATGG AAGCCTGCTG GCTATTGGCT TTAGCAATGA TCCCGGTCTA TTTTAGTTTG
CTCAGTGACC GCCACTTTGA GCCAGATAAA GCGGTAGCCT TGCGCTCAAT CGTGATGATC
CTTGGCGGTG CGTGGATTAT CAATTGGCTC GAACGTGGCC AAGTTTTTCG TTCATGGCCG
CGTTGGCGCG ATTGGTGGCG CTCGCCGTTG GTTGCTCCGG CTATCGTTTA TGTTGGGGTC
TTTTTCTTTA CCACGCTCAC CTCGGTGTTG GTGTTTACCA GCTTTTTTGG CGGCTATAAT
CGGCTTCAAG GCTTTTACAC CAATTTCTCG TATGTCGTGG TGTTTGGGGC GATGCTGGCG
CATGTGCGTC GCCGCGAGCA ACTTGAGCGG ATTATTACGG TGATTATTGC CACAACCTTG
CCAACCTTGG GCTATGGTTG GGTGCAATAT CAGCGCAGCG ACCCCTTGCC GTGGGCTGGC
GATACTGCGG CACGGGTTGC CTCAAGCATG GGCAACTCGA TTTTTGTGGC AGCCTATTTG
ATTTTAACCC TGCCCTTTAT GTTCTATCGC TTGATTACCA GCGTGATGGC TACCAAACGT
GCTGAGAGCG AAGCTACAAA CTCGGTTGGC TTAGATGCAG CGTGGTTTGT TACCTTGGGC
TTGATTCCGC TTGGTCAATT AAGCCTCTTG TATGCCACCC TCAAGTTGGG AGCATTGCTG
CAAGCGCCCT TGTTGGGCAT CGGCCATTGG TGGATTTTCC CAATGTCGGT GATTGTAGTA
GGCAGTACCT TGCCCTTAAT TTCGTGGGTA ACCAGCACCC GCAGTCGCAG CGATTGGCGC
TTATATCTGC CCGGCGGCTT GATTTTGCTG TATATGCTGA GTTTAGTGTT GGGTGGGTAT
TTAACTGCCG ACCAATGCCA AGGCACAATT AGCGAAACCT GCTACAACCT TGATATGGCA
ACTGCTAAGC GTGCTAGCGA CTTTCGTACC TGGTTCTTGT TGGCGATGGC GGCCTATTTT
GGTTTTTATG GCTTAGTGCT AGCCTTGCCG CGCCGCAGCG AAGCAGCAGC CCATGCAATT
GTGCAATGGC TCAGTGCTGC AATCTATGCT GGGTTAAGCC TGTTCACGAT TGTCATTATC
TTCTTTACCC AAAGCCGTGG CCCACAAATC GGCATGTTCG TAAGCATTTT CGTCTTCTTC
ACCCTCTTTT TGTTGCAAGG CCTACGCAGC ACCAGCTTCA AACGCATTTT TGGCGCGGCG
CTGAGTGCAT GGGTTGTGCT GGCCTTAGCA GGAGCCGCCT TCTTGGTGGT GCTTAATACA
GACTCAAGCA GTTTTAGTGG GCTACGCCAA AGCAACCGCT ATATCAGCCG TTTGGGCAAC
TTGCTTGAAA CTGATGGCGG CACTGGTTTG GTACGGGTGT TGATTTGGCG TGGCGATGAG
CATACCCAAG GCGCAGTTGG TTTGGCCTTG AGCGATCCGT TACGCACAGT AATTGGTTGG
GGGCCAGAAT CGATGTTTGT GGCCTACAAC CCATTTTATC CACCACGGTT GGCCAATTAT
GAATCGCGCG GGGCTTCGCC CGACCGTTCG CACCAAGCCT TGTTGGATGA ATTGGTGACC
AAAGGCGCGA TTGGCTTGTT TAGCCATTTA TTCTTATTTG GCTCATTCTT AATTATTATG
CTGCGGCTAC TGTGCATTCC ACGACTGATC AATTTAGGCC TAACCAGCCT TTTGATGCTT
GGAATTGGCA TCTTCTTTGC CGTCTTTTTG AAGAGCCTCG CACTTGGTTT GATCGCTGGT
AGTGTGGGTT TGTTGGTGGT TGGCCTCGCC ACGTGGTTGG GCTATGCCAA GCCGCTCGAA
GCATCGCTGA GCTTTACCTG GCAATTGTTG ATTATCACGG TGCTTTCGGC GGTTGCCGCC
AATTTTGTCG AAAACCTGTT TGGGATTCCA ATTGTTTCGT CGTTACTCTA TACCTGGGTG
ATTATGGCGG TTGGCATACT TGCCGGGGCA CACGCTGGAG CCTATCAACT TGGCACAAAG
CCAGTGGTTG TGGCAGCACC AGTTGTCGAA GAAGCCGCCG AATCAACCCC AGCCAAAGCT
GGCACCAAAC GCCAAGCAGC CCAAAATGCT CGTCGCACCC CTGCTGGCCG TGGTCGTACC
AGCAGCGGCG TAACTGGCGC TGCACCTGCT CGCATTTTGT ATGCAGTCGT TGTGCCAATT
GTCTTGCTGT TGGTCTGGTT CCTCAACCTC GATAATATTT TTGCTGATAT GCGCTATTTG
CAGGGCAAAC AATTTATCGA CCAAGGCCAA GGCCTTGATC AACATCTTTT GGGCTTTGCC
GCGATTCAGG ATGCAGTTGA GCATGCACCC AACGAAGATT TATATTTCTT GATGTATGGT
CGGGCCTTGA TGACCTTGGC AACTGATTTG AGCATTGAAC AAAACAAATT GGTTAGCGAA
AATCAAAATG CGGCGATTGC CCAAGTACTG AATAGTCGCC CACGGCCTGA TGCTGAACTC
GCCGATTTGC CCGATGCTGA ATATAGCGTG GCTGGTTTGC AAACCGTTGC CCGCGATTTC
TTGACCAAGT TTGGGCCATT GCAAGTGCTC GATTATGCCC GCTTGGCCTT GGAAGAAGCC
CAGCGGCTCA ATCCCCAGAA CAAAGATCAT CCGGCTAACT TAGGCCGTTT GCATTCATCA
TGGTTCCGTA ACACCGAGCA AAGCGACCCT GAAGGCGCAC GGATGCACCT TGATGCCGCG
ATTGAGGCCT ACAAACAAGC GCACACGGTC GCGCCCCAAG ATGTTGAGTT GACTGGTCAA
TGGGCTATGT TGTATTTGTA TCGCCAAGAA TACGATACAG CGATTGCCGA ATTGACCAAA
GCGACTACGC TTGATCCATT ATGGTCGCTC AATTTCATTC GCTTGGGCGA GGCCTACCGC
CGCAAGGGCG ATTTGCCCAA CGCAGCTTTA GCCTTTGCCA ATGCCTTGGC GCTTGATCCA
CGGGCGCTCA GCAGCAGTGG CTTGGTTGAC GTAGCCGAAT TGCCAGCCGA ACGCACAGCC
CGTGTCCAAG CAACCTTTGC CTCGATGCAA AGCGATCCTG CGGTGTTTGA TAGCTTCTTG
ACAGGCTTTG AGCGAGCGAT TGCCAGCAAG CCAGGCGATA TGTCGTATCG CCAGATTTAC
ACCCAAGTGT TGAGCGATAG CCAACGCTAT GATGCAGGCT TGACCCAAGT CCAACTAGCT
TTGGCCGAAA TGGACAAAAT GGGTGCAGCT GATCCGACAT TCAACACCAC CTATGCTGAT
ACCCGAACGG CCTTTGAAAA GCTAGTTAGC TTTTTTCAAA GCCAACTCGG CCAAAGTAAA
CCATAA
 
Protein sequence
MPLKESALDS RETTISLWSR RVMEACWLLA LAMIPVYFSL LSDRHFEPDK AVALRSIVMI 
LGGAWIINWL ERGQVFRSWP RWRDWWRSPL VAPAIVYVGV FFFTTLTSVL VFTSFFGGYN
RLQGFYTNFS YVVVFGAMLA HVRRREQLER IITVIIATTL PTLGYGWVQY QRSDPLPWAG
DTAARVASSM GNSIFVAAYL ILTLPFMFYR LITSVMATKR AESEATNSVG LDAAWFVTLG
LIPLGQLSLL YATLKLGALL QAPLLGIGHW WIFPMSVIVV GSTLPLISWV TSTRSRSDWR
LYLPGGLILL YMLSLVLGGY LTADQCQGTI SETCYNLDMA TAKRASDFRT WFLLAMAAYF
GFYGLVLALP RRSEAAAHAI VQWLSAAIYA GLSLFTIVII FFTQSRGPQI GMFVSIFVFF
TLFLLQGLRS TSFKRIFGAA LSAWVVLALA GAAFLVVLNT DSSSFSGLRQ SNRYISRLGN
LLETDGGTGL VRVLIWRGDE HTQGAVGLAL SDPLRTVIGW GPESMFVAYN PFYPPRLANY
ESRGASPDRS HQALLDELVT KGAIGLFSHL FLFGSFLIIM LRLLCIPRLI NLGLTSLLML
GIGIFFAVFL KSLALGLIAG SVGLLVVGLA TWLGYAKPLE ASLSFTWQLL IITVLSAVAA
NFVENLFGIP IVSSLLYTWV IMAVGILAGA HAGAYQLGTK PVVVAAPVVE EAAESTPAKA
GTKRQAAQNA RRTPAGRGRT SSGVTGAAPA RILYAVVVPI VLLLVWFLNL DNIFADMRYL
QGKQFIDQGQ GLDQHLLGFA AIQDAVEHAP NEDLYFLMYG RALMTLATDL SIEQNKLVSE
NQNAAIAQVL NSRPRPDAEL ADLPDAEYSV AGLQTVARDF LTKFGPLQVL DYARLALEEA
QRLNPQNKDH PANLGRLHSS WFRNTEQSDP EGARMHLDAA IEAYKQAHTV APQDVELTGQ
WAMLYLYRQE YDTAIAELTK ATTLDPLWSL NFIRLGEAYR RKGDLPNAAL AFANALALDP
RALSSSGLVD VAELPAERTA RVQATFASMQ SDPAVFDSFL TGFERAIASK PGDMSYRQIY
TQVLSDSQRY DAGLTQVQLA LAEMDKMGAA DPTFNTTYAD TRTAFEKLVS FFQSQLGQSK
P