Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2040 |
Symbol | |
ID | 5733929 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 2543870 |
End bp | 2546644 |
Gene Length | 2775 bp |
Protein Length | 924 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 641279184 |
Product | TPR repeat-containing protein |
Protein accession | YP_001544811 |
Protein GI | 159898564 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.962873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAACG CCCTTCCGGA CGCTGTTTTG GCCAAACTCC ATGCCGTGTT TCCCGACCTT GCCGCTGCCG ACCTGCATGC CATCCTTGAA CGGGTGCTCA GCGGTGAGCG GGTCGCCATT GCCGATAAAG TCCTAAGTGT GCAGACGGAT CGCCATGGAC AGGCGATTGC GTTGGGCTGT ACCGTGCTTG GGGATTTAAC CCAAGTGGTG TTCCAGATCC ACCTCCCCGA ACCCATTGAC CTGCTGCCTG CGGCCCTTGA GCGCTTGGCG ACCCTCCCGC TCGAGACGAT TCCTGAGCCA CGGATGGATC TGCCTGCGCG GTCGCACCTT GATTTCCGCC CAAATCCTAA TTTTGTTGGG CGGGCCGCCG CTTTTCGTGA TCTGGCAGCA GCCCTCAAAC GGCATCAAAC CGCTATTTTA ACGCCAGCGG TGGCCGTGGG TGTGGGCGGG ATTGGGAAAA CGAGCCTTGC GGTGGAGTTT GTGTATCGGT ATGGCTGGTA TTTTGCGGGC GGGATTTTCT GGATTAATAG TGCTGATCCG ACGCAGATCG CCAGTCATGT GGCGGCCTGC GCTCCGGCCT TGGGGATTGA TCCCCGGGGG ATGCCGCTTG ATGAGCAAGT GCACCAGGTC CTGCTTGCAT GGCAAGCAGC GATGCCACGC TTGATCGTCG TCGATAATTG TGATGATGCG AAGGTCATTG ATGGATGGCT ACCCGCGATT GGGGGCTGTC GGGTGCTCAT AACCTCGCGA TCCGATCAAT GGGCCAGTGT CCCACTGGTT CGGGTGGGAT TGCTCTCCCC GCGTGAGAGT CGCGCATTGG TACAGCGCCT CTGTGCACGG TTGACCGACA TCGAGGCCGA TGCGATTGCG GAAGATGTCG GCTATTTACC ACTGGCGTTG CATCTCGCTG GGAGTTATCT CACCGCCTAT TCCCATCATA CGGTCGAACA ATACCGTAAG GATTTGACGA TTGCTCATCG CTCGCTCAAG GGACGAGGAG CGTTGCGTTC ACCGACACGG CATGAACAGG ATATTGAAGC CACCTTTATG CTGAGTTTTA ACCAACTTGA TCCGACCAAT GCCCTTGATG CGTTGGCCTT AGGTATGCTT GATGGGGCGG CGTGGTGTGC GCCAGGCGTG CCAATCCCGC GCGATCTGGT ACTGGCATTC GTGCCGGATG AGACGGATGC TGATGATGCG GTTGATGCGT TACGGCGTTT GCAGCAACTC GGCTTGCTGG ATGGAGCAGA TGCTGTGGTG TTGCATCGCT TGTTGGCCCA AGTCGTCGAG GCACGATTAG GATCGACGGA AACGCTAGCC ATGGTGGAAG ATCGGATTGA TGCTGTGGCA TCCCGTGTTA ATGAGATGGG TGTGCCACGT TCGATGCTGT CGCTTGAGCC ACATCTACGC CACACCACCA CACGGGCGCT GAAGCGTGGT GATGCCCTAG CCGCACTTCT TGCCAATAAC CTAGGGTATT TTGAAGCCCT GCGGGGAGCC TATGCCGACG CACAGCCCTT GTATGAACGG GCCTTGGCCA TCAGGGAAGC GGTGTTGAGG GCCGATCATC CCGATACGGC ACAGAGTGTG AACAATCTGG CCTCGGTCTT GTTGCATCAA GGGCGGTATG CCGACGCACA GTCCTTGTTT GAACGGGCCT TGGCGGTGCG GGAAACGGTG TTGGGGGCCG ATCATCCCGA TACGGCGACG AGCGTGAACA ATCTGGCGTT TGTCTTGGAG CGTCAAGGGC GGTATGCCGA CGCACAGCCC TTGTTTGAAC GGGCCTTGGC CATCAGGGAA GCGGTGTTGG GGGCCGATCA TCCAGCGACG GCGGTGAGTG TGAACAATCT GGCGGGGGTA TTGTTGCGTC AAGGGCGGTA TGCCGACGCA CAGCCCTTGT TTGAACGGGC CTTGGCCATC AGGGAAGCGG TGTTGGGGGC CGATCATCCA GCGACGGCGG TGAGTGTGAA CAATCTGGCG GGGGTATTGT TGCGTCAAGG GCGGTATGCC GACGCACAGC CCTTGTTTGA ACGGGCCTTG GCGATTTATG AAGCGGTGTT GGGGGCCGAT CATCCCGATA CGGCGGTGAG TGTGAACAAT CTGGCGGGGG TATTGGATCG GCAAGGGCGG TATGCCGACG CACAGCCCTT GTATGAACGG GCCTTGGCGA TTTATGAAGC GGTGTTGGGG GCCGATCATC CCGATACGGC GGTGAGTGTG AACAATCTGG CGGGGGTATT GGATCGGCAA GGGCGGTATG CCGACGCACA GCCCTTGTAT GAACGGGCCT TGGCCATCAG GGAAGCGGTG TTGGGGGCCG ATCATCCCGA TACGGCACAG AGTGTGAACA ATCTGGCGAT GGTCTTGGCG AGTCAAGGGC GGTATGCCGA CGCACAGCCC TTGCATGAAC GAGCCTTGGG CATCTATGAA GCGGTGTTGG GGGCCGATCA TCCAGCGACG GCGGTGAGTG TGAACAATCT GGCGGGGGGC TTGGCGAGGC AAGGGCGGTA TGCCGACGCA CAGCCCTTGT TTGAACGGGC CTTGGCGATT TATGAAGCGG TGTTTGGGGC CGATCATCCC GATACGGCGG TGAGTGTGAA CAATCTGGCG GAGGTCTTGG AGCGACAAGG ACGGTATGCC GACGCACAGC CCTTGTTTGA ACGGGCCTTG GCGATTTATG AAGCGGTGTT GGGGGCCGAT CATCCCGATA CCCAGTTTAT TCGTGCGAAT CTCGTCTCTT TACAAGCCAC CATGACGGCA GACGACCATC TGTAG
|
Protein sequence | MTNALPDAVL AKLHAVFPDL AAADLHAILE RVLSGERVAI ADKVLSVQTD RHGQAIALGC TVLGDLTQVV FQIHLPEPID LLPAALERLA TLPLETIPEP RMDLPARSHL DFRPNPNFVG RAAAFRDLAA ALKRHQTAIL TPAVAVGVGG IGKTSLAVEF VYRYGWYFAG GIFWINSADP TQIASHVAAC APALGIDPRG MPLDEQVHQV LLAWQAAMPR LIVVDNCDDA KVIDGWLPAI GGCRVLITSR SDQWASVPLV RVGLLSPRES RALVQRLCAR LTDIEADAIA EDVGYLPLAL HLAGSYLTAY SHHTVEQYRK DLTIAHRSLK GRGALRSPTR HEQDIEATFM LSFNQLDPTN ALDALALGML DGAAWCAPGV PIPRDLVLAF VPDETDADDA VDALRRLQQL GLLDGADAVV LHRLLAQVVE ARLGSTETLA MVEDRIDAVA SRVNEMGVPR SMLSLEPHLR HTTTRALKRG DALAALLANN LGYFEALRGA YADAQPLYER ALAIREAVLR ADHPDTAQSV NNLASVLLHQ GRYADAQSLF ERALAVRETV LGADHPDTAT SVNNLAFVLE RQGRYADAQP LFERALAIRE AVLGADHPAT AVSVNNLAGV LLRQGRYADA QPLFERALAI REAVLGADHP ATAVSVNNLA GVLLRQGRYA DAQPLFERAL AIYEAVLGAD HPDTAVSVNN LAGVLDRQGR YADAQPLYER ALAIYEAVLG ADHPDTAVSV NNLAGVLDRQ GRYADAQPLY ERALAIREAV LGADHPDTAQ SVNNLAMVLA SQGRYADAQP LHERALGIYE AVLGADHPAT AVSVNNLAGG LARQGRYADA QPLFERALAI YEAVFGADHP DTAVSVNNLA EVLERQGRYA DAQPLFERAL AIYEAVLGAD HPDTQFIRAN LVSLQATMTA DDHL
|
| |