Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3344 |
Symbol | |
ID | 5735214 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 4215732 |
End bp | 4217276 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641280491 |
Product | PT repeat-containing protein |
Protein accession | YP_001546108 |
Protein GI | 159899861 |
COG category | [R] General function prediction only |
COG ID | [COG5401] Spore germination protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGGTG AGGCTCACGA CATCCCAGTG CCCCAAGCCT CGTTTTGTCT AAGTAGCAAC CATGTGCGTT GCCCATTATA TGCAGGTGAA GATCTGCCGG TTGCGCAGGT TATCAGCACG CCTACGCCAG TTGCGGTGGG TGGTTGGCGC GGCTGGCTGG CTGGTTTATC GACCCGCGAT CGCCGCATTT ATGCCACCTT GGTGGGCCTA CTTGGCTTAA TTATTGTGGC CTATGCGATT AGCGGCGTTG TTTTATTTAG CAACCCCGAT AACCCTGCCA CGCCTAGCGC TACCTCGCAA GTGCTTCAGC CAACATCCGA TAGCCCAACA TTAACGGTTT CACCATCGCC AAATGCCTTT GCCACAGCAG CGGTTCGTCA AACTCAAACA GCCGAAGTTA TTGCTCAAAC CACTACCGTC ACTCCCTCGG TCTCTGCTAC GTCATCGGCT TCTGCAACCA CTCAGGTGAT TCTTGCATCG CCAACCTTTG TTATTGTACC GCCAACTGAA GATGTGATTG TCGCCTCTGC TACTCCTAGC ATTCCGTTTG CCACCGATCT CCCAACCTTT GAGCCAACGT TATCGCCAAT TGCGACAACT GCCGTGCCAA CGCTTGAACC AACCGCAGAA CCGACTGTTG AACCAACTCT TGAGCCAACG CTCGAACCAA CAGTTGAGCC GACTGTCGAG CCAACGCCTG AACCGATCCC TGAGCCAACT GCTCAGCCGA CCGAGGAAAC TGGCGGTCGC GAGGTTAATC AATTAACCTT GTTTTTTGCC GATAGCACTG GCCAAGTGTT AGTGCCAGTC TCGCGCCAGA TTGCGGCAAC TCGTCAGTCA CGGACTGCCG CAATCCAACA GTTAATTCAA GGTGCACGCA GCGATTTGCG TAGTTTGTTG CCCAGCGATA CCCAATTACT TGGGCTACGC TTGAATAATG GCATTGCTAC CGCTAATTTT AACCGTATCC CGACGTTTGG CAATTCAAGC CTCGAAGATT TGGGTTTGCG TTCGATTGTG TTGGCCTTGA CTGAGCAACC AGAGGTTAAG CAGGTGCAAA TTCAAGTCCA AGGCCAAAAT TTAGGTGGCC TGCGCTATCG TCCCAATGTC AACCCCGATA ATCCGCAGGG TTTAAATGGT CAGTTTAACA CAACTTCGTT CTTGCCGTTA TATTTTCAGC AAAGTAGTGG CCGTTGGGTG CGGGTGATGC GGCTTGTGCC AAGCACCAAA ACCGAGGCCC GCGCTACCGT CAATGAGCTG ATTCGCGGAG CTGGCCGTTA TAGTCATGTT GTTAGTAGTG CCATCCCGAG CGCCAGCCAA GTACGGCGTT TGGTGATTGT TGATGGGGTT GCTCAACTTG ATCTTAGCGC TGAATTCAGC CAAACCAGCA ATCCGCAGGC GGCGGTTGAT GCCTTGGTCT TGGCGTTAAC TTCGTTCAGT AGTGTGCAAC AGGTACAGAT TACCGTCGAA GGCCAATCGC TCAGCAGCAT TTGGGGCGCA ACATTCAGCA ATCCTTTCGT TCGCCCACAA CTTAACCCTG AATAG
|
Protein sequence | MTGEAHDIPV PQASFCLSSN HVRCPLYAGE DLPVAQVIST PTPVAVGGWR GWLAGLSTRD RRIYATLVGL LGLIIVAYAI SGVVLFSNPD NPATPSATSQ VLQPTSDSPT LTVSPSPNAF ATAAVRQTQT AEVIAQTTTV TPSVSATSSA SATTQVILAS PTFVIVPPTE DVIVASATPS IPFATDLPTF EPTLSPIATT AVPTLEPTAE PTVEPTLEPT LEPTVEPTVE PTPEPIPEPT AQPTEETGGR EVNQLTLFFA DSTGQVLVPV SRQIAATRQS RTAAIQQLIQ GARSDLRSLL PSDTQLLGLR LNNGIATANF NRIPTFGNSS LEDLGLRSIV LALTEQPEVK QVQIQVQGQN LGGLRYRPNV NPDNPQGLNG QFNTTSFLPL YFQQSSGRWV RVMRLVPSTK TEARATVNEL IRGAGRYSHV VSSAIPSASQ VRRLVIVDGV AQLDLSAEFS QTSNPQAAVD ALVLALTSFS SVQQVQITVE GQSLSSIWGA TFSNPFVRPQ LNPE
|
| |