Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5152 |
Symbol | |
ID | 5737110 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 220882 |
End bp | 222030 |
Gene Length | 1149 bp |
Protein Length | 382 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641282317 |
Product | hypothetical protein |
Protein accession | YP_001547908 |
Protein GI | 159901662 |
COG category | [R] General function prediction only |
COG ID | [COG3889] Predicted solute binding protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.698253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAACACG ATGACGGGTC GTCAGTGCGG AAACGAGGGC GGGTGCGAAC TGGAGTCTGG GTGGGGGTGG GCATGCTTGG GCTGGTGGCG TGGTTGCTCG CCCTGCCAGC GGTCTATCTA GCGAGGGCCG CAACCGATCC GCCGCTGCGT CCGCTGCATG GGGGCGCGCA TCCTGGGGCG GTATGGCCCC TCGGGTCGGT CATGCCCTCG CCAACGGGGA CGGACGGAGC CGGATCAGTC CAGTCCGGAA TCCAATCAAC CCTATGGGAG ACCGCCTACC ATAGTGCCCT TGGGGTCGAT CCGCGCGTGG AACTCGTCCC TGGGCAAACC TATCCCTTTG TGCATGCGGC GCATGGGGTG ATCGACCAAA CGACCGGGGT GACGATTTCC GTGATCAACA CTGAGTCATC CTTCCTGGGA ATAGCGCTGT ACTATGGCAC GATGTCGGGG AGTCAGGCTG GTGAAGCCAC GCCTTGCCCG ACCAATTACC CGGTGTTTTT CAGTGGAAGC TTTCGCGGTC ACCCCCCGCA ATCCGAAGCG TATATCATCT CGAAGTGCAC GATTCCGCCC CATCCAAGCG GGACGCGGGT CTGGTATCGC CTGGTCCTTG TGAAGCCTGG TATCTACCGC TACTTGCAGG CAGATGCTAG TGATGGGTGT GAAGAACTGA CCCCGCTCGG CCAATGTATC CAAGAATCAT TCTTTTTTCC GCGGATTGGC GTAGGAAATT ACTTCTACGA TGTCGGCTAT GCCAGCCCAA CGCCAACGGA GACCCCGACA CCAACCAACA CGCCAACGGA GACCCCGACA CCGACCAACA CACCAACCAA CACGCCAACG GAGACCCCGA CGAATACACC GACCAACACG CCAACCAACA CGCCAACGGA GACATCGACA CCGACCAACA CGCCAACCAA CACGCCAACG GAGACCCCGA CGAATACACT GACCAACACG CCAACCAACA CGCCAACGGA GACCCCGACG AATACACCGA CCAACACGCC AACCAACACG CCAACGGAGA CATCGACACC GACCAACACG CCAACCAACA CGCCAACCAA CACACCGACC AACACACCAA CAGCCACCAT TGTTCTTCCG CCGCGCTACA CCATGTTTTT GCCGTGGGCG CAGAAATAA
|
Protein sequence | MEHDDGSSVR KRGRVRTGVW VGVGMLGLVA WLLALPAVYL ARAATDPPLR PLHGGAHPGA VWPLGSVMPS PTGTDGAGSV QSGIQSTLWE TAYHSALGVD PRVELVPGQT YPFVHAAHGV IDQTTGVTIS VINTESSFLG IALYYGTMSG SQAGEATPCP TNYPVFFSGS FRGHPPQSEA YIISKCTIPP HPSGTRVWYR LVLVKPGIYR YLQADASDGC EELTPLGQCI QESFFFPRIG VGNYFYDVGY ASPTPTETPT PTNTPTETPT PTNTPTNTPT ETPTNTPTNT PTNTPTETST PTNTPTNTPT ETPTNTLTNT PTNTPTETPT NTPTNTPTNT PTETSTPTNT PTNTPTNTPT NTPTATIVLP PRYTMFLPWA QK
|
| |