Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4504 |
Symbol | |
ID | 5736355 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 5766968 |
End bp | 5768338 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641281667 |
Product | PT repeat-containing protein |
Protein accession | YP_001547264 |
Protein GI | 159901017 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATCCAA AATTGCAACG ATGGATAACG CAGGTTATAC TCCTTGTAAC GATCGTGGTG GCTGGTGGAG TTAGTACCAC CTTTGCCAAT GTGGGCCAAG CAATCGATTC AGCAGCGGTA GACACAGTAG TTGAGCCAAC CGCTGAACCA AGTGTTGAGC CAACAGCCGA GCCAACCACT GAGCCAACTG CCGAACCAAC TGCAACCGAT CTGCCTAGCA CGCCGATTCC AACCGTAACA CCTGTTTGTA ATCCTCAGAT TGATCTGTCT GGTTGGTTCT CCACGAACTC TCTTGGCAAA ATTCAGAATA AATCGACTAC TTGTGTCTAC ACGGTGGGCA TGGCTTCGTA TCAAAAAGTC GATGAAATTA TTGATCATCA AGTGATTTAT TCGTGGGAAA CCGGACGAAT TGAGCCAAAC CAAATTCTAG CCTTGAACGT TGCAGTTCCT GAGTGTGCCG CCCAAATTGA TTTATTTTAT GGGCCAGTTT TACACTCGCT TGATGGGCAA CGCTATGGTG AGCGATTAAT TACTGCTCGT CATACTGGCG GAATTAACTA TTGTGGTCTT GCTGCGCCAA CCAGCACCGT TGAGCCAACC GCAGAACCGA CGGCAACCAG CACGCTTGCG CCAACCGCAG AACCGACGGC AACCAGCACC GTTGAGCCAA CCGCAGAGCC AACGGCAACC AGCACGCTTG CGCCAACGGC AACCAACACG CCAACCGCAG AACCGACGGC AACCAACACG CCTGCGCCAA CGGCAACCAA CACGCCAACC GCAGAACCGA CGGCAACTCA TACGCCTGCG CCAACTGCCA CCAATCTACC AACATCTACC GCCACCAGAA CACCAACGGC AACGCCGACA ACACCTCCAA CCAGAACACC AACGGCTATT CCAAGCCCAA CATCAACCAA AACACCAACT CCAACGGCAA CGATTCGACC AACCTCTACG CCAACCAGAA CACCAGCACC GACTGCCACC AATACGCCAA CTGGTGCAAA TTGTACCTAT ACTGATGGCT ATTGGAAGAC CCATCCGCGG GAGTGGCCTC TAGGATCGAT GATGCTCGGT GGAGTTCAAT ATAGCCAAAA TCAATTGATG GCGATTTTTA TTATGAACGT TAGAGATGAT ATGAGCTATA CCCTAGCGCA TCAATTGATT GCTGCCAAAT TGAATGTGGC CCAAGGTGCC GATGGTAGTC AGATTAATGG CACTATCGCT GCTGCTGATA TGTGGCTCGA GCAAAATCCT TTGGGCAGCA AGCCGACTGG TTTTATTGCA ACCACTGGCA CTGGCTATAG CTCGACCTTA AATAGCTTCA ATAGCGGTTT ACTTGGCCCT GTTCACTGTA ACAACTATTA A
|
Protein sequence | MNPKLQRWIT QVILLVTIVV AGGVSTTFAN VGQAIDSAAV DTVVEPTAEP SVEPTAEPTT EPTAEPTATD LPSTPIPTVT PVCNPQIDLS GWFSTNSLGK IQNKSTTCVY TVGMASYQKV DEIIDHQVIY SWETGRIEPN QILALNVAVP ECAAQIDLFY GPVLHSLDGQ RYGERLITAR HTGGINYCGL AAPTSTVEPT AEPTATSTLA PTAEPTATST VEPTAEPTAT STLAPTATNT PTAEPTATNT PAPTATNTPT AEPTATHTPA PTATNLPTST ATRTPTATPT TPPTRTPTAI PSPTSTKTPT PTATIRPTST PTRTPAPTAT NTPTGANCTY TDGYWKTHPR EWPLGSMMLG GVQYSQNQLM AIFIMNVRDD MSYTLAHQLI AAKLNVAQGA DGSQINGTIA AADMWLEQNP LGSKPTGFIA TTGTGYSSTL NSFNSGLLGP VHCNNY
|
| |