Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0195 |
Symbol | |
ID | 5732041 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 228345 |
End bp | 231122 |
Gene Length | 2778 bp |
Protein Length | 925 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641277319 |
Product | TPR repeat-containing protein |
Protein accession | YP_001542975 |
Protein GI | 159896728 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.39307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGCGGCGTT TTTTCCTGTT TGGCCTGGCT TTGCTGGGGT TGTGGCTGGG GCTGATAGCT TGGTTTAGCT ATCAGCTCGC TGATCAACGC CAAAGCCTAC ACACCTTAAT GCAAACCCGA CCATTGCAAA GCAGCCAAGC CCAAGCGTTT TGCAGTGAAC TTGATTCAAC TCACAGCAAT ATGGTGTGGC TGCGCCGATT GAGTTTGCCG TTACACCCCA TTTTGCGCCA GAGCGATACC TGGCTGGGTG CGCTGCCCGC CGCCAGTGAT GCTGCGCAAC AAGGGGCAGA GTTGGCCCAA GGTTATTGTC AACAATTCAA TCCGCTGATT GCTATACTCG ATTTACCAGC CAACCAACGA ACCAAGGCGC TGCTTGATTG GCTGAGTGCT AACCCGCCAA ATTGGCTTGA GTTACAAACT GAGCTGAGTC AATTGCAGCA AACCTGGCAA ACTGTGCCCA GCAACATCCA AACAGCACCA TTTTTTCGCA ACTATCAAAC CCAATTAGCC CAGTTCGATC AGCAATTAAC CTCAGCCCAA ACTACGCTTG GCTTGCTTGA ACAAGCTTGG CCCTTGGTCG AGGCTGGTTG GGGCTGGCAA AAGCCGCTAC GCTTGCTGAT TGCTGGTCAA AATCCGTTGG AGTTGCGGCC AAGTGGCGGC TTTATTGGCA GTATTGCGAC GCTTACAATC GAGCAAGGCC AAATTCGCAC TATGGCCTAC TTCAATAGTG CCGATTTTGC AGCAGTTGCG CCGACTGGCT CGGCCATGCC CAAGCCCTAC AACGATTATT TACGAGCCTC AATTTGGACG CTCCGCGATG CCAACTGGTG GCCCGATTGG CCGACCTCGG CTCAAAGTTT GCAAACATTT TGGCAATTAA ACCAACAGCC TGAGGTTGAT GCAGTTATCG CCCTCGATCT ATATGCCTTA CAGGGTTTGA TTCAGGTTTT AGCGCCGTTG GAGATCGCAG GTTACGGCCA AATCAGCCAA GCTGAGAGCC TTGAGCAAAT TTTCGGGCTG TACGATGGGC GCAGCGTCAC TGGCGATAAA CAATTTTTGG CCGCCTTGTT CAACAGCACC TTGGAAACTG CCCGCCATGC CAGTTTTAGC CAATGGCTGG GAATTGGCGC AAGTTTGCAG CAAGCCTTGC AGCAACGCCA TCTCAGTATC TATTTTAACG ATCAACCAAG CCAAGCTTTG ATGCTCGCGA ATGGTTGGGC TGGCACGATG CCAGCGCTTG AACACGATGT CTTGGCCTTA GTTGATGCTG ATCTATCCTA CAGCGACGGC CAGAATTTTA TTGAACAACG GATGCAGCTT GAGGTGCAAC TTGATGCCCA AGCCCGACCA CTGACGAACA CCCTCACTAT CACGTATACC AACCGCTACG ACGATTGGCG AGCCGATTTG AGCAAACACG CGGTTTATGG CTATTGCTAC AACGTTAAAT TGGCAGTGCA ACAACGTATT CCAGGCTGTT ATGGCGATTA TGCTCGTGCT TATTTGCCAA TTAATGCGAT TCCCTTGAGC TTGGATGGTG CAGACACGCC GCCGGATCTC ACGCAGGAAG GCCAATTTAC CAGCGTTGGT TGGTATATGC TGCTCTACCC AGGTCAAACT CGCACAATTC GCCTGCGCTA TTTGCCAAAT ATTCAAAGCC AGCCTTATCA ATTAACGTGG TTCAAACAGG CTGGAACCTT GGGCCATCCG ATCACCTTAA TCATCAATCA AGCCAATCTG CAAGCGCAAT GGCATGGCTC GTTGCGCCAC GATCGCCAAC TGCGGTTTGA GGCTGGCACA ATTCAAGCGT CAGCCAGCGA TTCACCAGCA CCCGAGCCAC AACAATCAGC TGAGCAGGCT TGGCAACTGT GGCAACAAGG CCAAACCAGC GTCGCGCTCG ACCTCTGGCA AAGCAGCAAC ACGCTTGATC GCGCTTTGGA TTGGGTGGTG GCGCTGCGTT GGACAGACGA TCCAGGAACT GCTAACCAAT TCTTGAGCCA ACTCCAGCCA TTGCTGCCTA ATTCTGGTCG GGCGGCGTTT TTGGCGGGCT GGCTCGCCGA ATTGAATGAC GATCAGCCCA CAGCCCTGCA AGCCTATCAA ACTGCCTTGG AGCATGAGCC AACTAGTCAA GCGGCGCGTT TGGCCTTGGC CTTGCTCCAA CTCCAGCTTG GCGATGCTCA GGCCGCTCAA ACCACTTTGC AACAGCTGGA AAATCCTCGT TTGGCCTTGC AGCGCTTGGC CTTTGATCAA CGTATGGCTG GCGATTTGGC GCAGGCTGAG CGCTATTATC AGCTGCTTTT GACCCTCGAC CCGCGTGATC GCGAAGTATG GGAAGAGCGG TATTGGCTGC GACGTTATGC CAACGATCAG CCCGATTGGC AGGCCGTCGA ACAATTGGCG AATCAAGCAA TTGGCATATT TGCCAATGAT GCCCAATGGC TCAGCCGCCG CGCCGAGAGC TACGAGCGCC AAAATCTGCC GCAACAGGCG ATTGCCGATT GGCAGCAAGT TACCACGATT TCGCCGACCA ATAGTTTGGC TTGGTATTAT TTGGGCTTGC AACAACGGGC GCTTGGCGAT TGGTCGGCTG CTCAATCCTC ACTTGAAACG CTGATTGCGC TTGATCCACA GGCTGATTAT TATTTGGTGC TGGGCGATAC CTTGCGCGAA TTGCAATTAT TCGATGCTGC GCGTGAAGCC TATGCCAATG CTGCCAAGCT TGAGCCTGAA CACCCTGGTT TGGCCGAGCG CTTACGTTTG CTGGAAGCTA GCCCGTGA
|
Protein sequence | MRRFFLFGLA LLGLWLGLIA WFSYQLADQR QSLHTLMQTR PLQSSQAQAF CSELDSTHSN MVWLRRLSLP LHPILRQSDT WLGALPAASD AAQQGAELAQ GYCQQFNPLI AILDLPANQR TKALLDWLSA NPPNWLELQT ELSQLQQTWQ TVPSNIQTAP FFRNYQTQLA QFDQQLTSAQ TTLGLLEQAW PLVEAGWGWQ KPLRLLIAGQ NPLELRPSGG FIGSIATLTI EQGQIRTMAY FNSADFAAVA PTGSAMPKPY NDYLRASIWT LRDANWWPDW PTSAQSLQTF WQLNQQPEVD AVIALDLYAL QGLIQVLAPL EIAGYGQISQ AESLEQIFGL YDGRSVTGDK QFLAALFNST LETARHASFS QWLGIGASLQ QALQQRHLSI YFNDQPSQAL MLANGWAGTM PALEHDVLAL VDADLSYSDG QNFIEQRMQL EVQLDAQARP LTNTLTITYT NRYDDWRADL SKHAVYGYCY NVKLAVQQRI PGCYGDYARA YLPINAIPLS LDGADTPPDL TQEGQFTSVG WYMLLYPGQT RTIRLRYLPN IQSQPYQLTW FKQAGTLGHP ITLIINQANL QAQWHGSLRH DRQLRFEAGT IQASASDSPA PEPQQSAEQA WQLWQQGQTS VALDLWQSSN TLDRALDWVV ALRWTDDPGT ANQFLSQLQP LLPNSGRAAF LAGWLAELND DQPTALQAYQ TALEHEPTSQ AARLALALLQ LQLGDAQAAQ TTLQQLENPR LALQRLAFDQ RMAGDLAQAE RYYQLLLTLD PRDREVWEER YWLRRYANDQ PDWQAVEQLA NQAIGIFAND AQWLSRRAES YERQNLPQQA IADWQQVTTI SPTNSLAWYY LGLQQRALGD WSAAQSSLET LIALDPQADY YLVLGDTLRE LQLFDAAREA YANAAKLEPE HPGLAERLRL LEASP
|
| |