Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4505 |
Symbol | |
ID | 5736356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5768749 |
End bp | 5770110 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641281668 |
Product | PT repeat-containing protein |
Protein accession | YP_001547265 |
Protein GI | 159901018 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCCCGA CACTGCGACG ATGGACAACG CGGGTTGTGC TACTTGCAAC AATGTTGATG GCAGGTGGTC CAAGTGCCAG CTTTGCCAAT GTGAGCAAAG GCGTAGAATC AGCCAGTGTT GAGACAGTCG TTGAACCAAC TGCTGAACCA ACTGCTGAGC CAACTGCTGA ACTAACCGCT GAGCCAACCG CTGAGCCAAC CGGAACAGCA GCCCCAACCG CAGCCCCAAC CGGAACAGCA GCCCCAACCG CAGCTCCAGT CTGTGACCCA AAGGCTGATC TTTCGGGTTG GTTCTCAAGC AATACCGTCG GCAAGATTTG GAATAAATCA AGCACCTGCT CCTACGATGT AGGTATGGCT TCGTATCAAA AATTTGATGA AATCATCGAT CACCAATTGA TTTATTCGTG GGCAATTGGC CGAGTTGGCC CAAAAACAAC TGTTTCTTTA AGCGTTTCAG TGCCCGATTG TGCTACCCAA ATTGATGTAT TCTATGGGCC TGTATTGCAC TCGCTCGATG GTCAACGCTA TGGCGAGCGC TTGCTCTCAT CTCGCCACCT TGGCGGAACC AGCTATTGTG GTGTAACAAC TCCAACCAAC ACGCCAGAAC CAACGGCTGA GCCAACGGTT GCCCCAACCA ATACGCCAGA ACCAACCGCC TACCCAACGC CAACAGCAGA ACCAACAGTT GCGCCAACCA ATACACCAGA ACCAACGGCA GAACCAACAG TTGCACCAAC TAACACGCCA GAACCAACGG TTGCGCCAAC GGTTGCACCA ACTAACACGC CAGAACCAAC GGTTGCGCCA ACCAACACGC CAGAACCAAC TGCCTACCCA ACACCAACCG CTGTGCCAAC CGCGACCAAG ACACCAGTCC CAACCGCGAC GAAGACACCC GCGCCAACTG CAACCAGCAC GCCAGCGCCA ACCGCGACGA AGACACCAGT CCCAACCGCG ACGAAGACAC CCGCGCCAAC TGCAACCAGC ACGCCTACTG GTAAAGATTG TACCTATACC CAAGGCTACT GGAAGAATCA CCCAAGTGCT TGGCCAGTTA CGAGCTTGAG CATTGGTGGT GTGGTTTACA GCCAAAGCCA ATTGATGGCA ATTTTCAACA CCTCGCCACG TGGTGATGCC ACCTACATCT TGGCTCACCA ATTGATCGCG GCCAAGTTGA ATGTTGCTCA AGGTGCTAAT GGCAGCACAG TTAATGCAAC GATCGCTGCT GCTGACGCTT GGTTGCAACA ATATCCATTG GGCAGCAAAC CAAGTGGCTC AGCCTCCAAC ACTGGCACCA GCTACGCCAC TCAATTGGAT AACTTCAACA ATGGCGTAAT TGGCCCAGGC CACTGCGACT AA
|
Protein sequence | MTPTLRRWTT RVVLLATMLM AGGPSASFAN VSKGVESASV ETVVEPTAEP TAEPTAELTA EPTAEPTGTA APTAAPTGTA APTAAPVCDP KADLSGWFSS NTVGKIWNKS STCSYDVGMA SYQKFDEIID HQLIYSWAIG RVGPKTTVSL SVSVPDCATQ IDVFYGPVLH SLDGQRYGER LLSSRHLGGT SYCGVTTPTN TPEPTAEPTV APTNTPEPTA YPTPTAEPTV APTNTPEPTA EPTVAPTNTP EPTVAPTVAP TNTPEPTVAP TNTPEPTAYP TPTAVPTATK TPVPTATKTP APTATSTPAP TATKTPVPTA TKTPAPTATS TPTGKDCTYT QGYWKNHPSA WPVTSLSIGG VVYSQSQLMA IFNTSPRGDA TYILAHQLIA AKLNVAQGAN GSTVNATIAA ADAWLQQYPL GSKPSGSASN TGTSYATQLD NFNNGVIGPG HCD
|
| |