Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3191 |
Symbol | |
ID | 5735066 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4037333 |
End bp | 4039114 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641280337 |
Product | hypothetical protein |
Protein accession | YP_001545956 |
Protein GI | 159899709 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000124226 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGTCTT TTCGATGGAG TTTACTAGCA TTTCTAGTAG GCAGTGGTTG GTATACTGCT CAACCCCAAG CGAGCAATCT TGCCCCAACC AATCAGATTC AAGTTACCAC GACGGTTGAT GATACCAACC CCAGCAACCA AACCTGTTCG TTGCGCGAGG CGGTTTTGGC TGCTACCAGC GATCAAGCGG TTGATGCTTG CTCGGCTGGC GCTGCGGTTG ATCAGATTTT GCTGCCTGCT GGAGTGTATC GGCTAACCAA CATTGGCCGC GATGACGATC AAGGCCTAAC TGGCGATTTG GATCTGCGCG GATCGCTGCA AATTACGGGA GTGAGTTCAG CGACCACGAT CATCGATGGC AATGTAACTG ATCGCGTGTT GGATTTGCAC GAAGGCAGAT TGACCTTGGA GCATGTAGAA ATCCGCAATG GCTATATTTT GAATAATGAT GATACGGCAA CCTATTATGG GCATGGTATT TTTCAACGCA GTGGTAATTT AAGCCTCGAT CATGTTCGGG TGATTCACAA TGGCATTATT ACCTCGCCAA TTTTGTACGG TTTAGCCAAT TATGGTGGTG GGGTGGCTAG TTTGGCGGGC ATGCTCAGCA TTCAACATAG CTTTTTCAAT GATAACGAGG TGCGCACTAT TTTTATGGGT GGGGTTGGTG TTGGAGGTGG TCTCTACATC AAACAAAGCA CAGTAAGTAT TGCTAACACA ATTTTTGAGG CCGATACAGC GGGCACTGGT ATGGCGATTG CCAACGATGG CGGCAATTTG ACGCTCAGCC AAAGCCAGAT TCGGCTAGCC ATTGGTCAAG GTGAGCAAGG TGCAGCAGTC GATACGATCA AAGGCATGAC CCTGATCGAA ACCAGCGAAT TTCAGCAAAA TCAGCCACGC GCAGTTAAAA TTAACCAAGG GGCTGAGGCC GAAATTATCA ACAGCCTATT TGGGCAAAAT GGTGGGGTTG GCAATTTCTA TTGTGCCAGT GGTGGCGCAA TTGCCAATGC TGGGCGCATA TTGATCAGCG ATTCACGCTT TATCGACAAT TATGCCGATC AAGGCGGCGC ATTAGTCCAA AGCCAAGGTA GCAGCGAAAT TTATCGTAGC GAATTTAGCG CCAATCGGAG TAATGGAGTC AATCGGCTGC GTGAAAACTG CCATGCCTCA GGTGCGGCAA TTAGCCAACA AGCTGGAACC ATGCTGCTCG ACACCAGCAC GCTCGCCTTC AATGATAGCC GGGGTTTGGG TGGAGCCTTG GATCAACGGG GAGTTACTTC AGTCTTGACC CTGACCAATT CGACCGTGGT TTCGAATACC AATCGTTTTG TAACGGGCAT TGGTGGTGCT GGAGTTTCAA TCAGCGGCAC CTTGGCACTG AACAATAGCA TGATCGCCAA TAATTGGCAT ACGCCCAGCC AAACCGCCAA CGATTGTCTC GGCAACCTGA CCAGCCAAGG CCATAATTTG CTGGAGCGAC CGACCGCCCA ATGTCAATTG CTCAATCAGC AAGCCAGCGA TCTGCTGAAT CTTGATCCCT TGCTTGGTGA GTTTGCGCTG CATGGTGGTA GCTCACGTAG TTTTAGTTTG ACCGCTGCTA GTCCCGCGCT TGATGCTGGG CCTGCCGATT GTGGCTTGGT TGATCAACGG CTCTATCCCC GGCCTGTCGA TGGCAATAAC GATCAAACCG TGCGCTGCGA TATTGGGGCA TTTGAAGCTG GTATGATTGC CCAAACCCAG TATTTTAGTT TCTTGCCCTT TGCGCTGGCC GGAGTTCGTT AA
|
Protein sequence | MRSFRWSLLA FLVGSGWYTA QPQASNLAPT NQIQVTTTVD DTNPSNQTCS LREAVLAATS DQAVDACSAG AAVDQILLPA GVYRLTNIGR DDDQGLTGDL DLRGSLQITG VSSATTIIDG NVTDRVLDLH EGRLTLEHVE IRNGYILNND DTATYYGHGI FQRSGNLSLD HVRVIHNGII TSPILYGLAN YGGGVASLAG MLSIQHSFFN DNEVRTIFMG GVGVGGGLYI KQSTVSIANT IFEADTAGTG MAIANDGGNL TLSQSQIRLA IGQGEQGAAV DTIKGMTLIE TSEFQQNQPR AVKINQGAEA EIINSLFGQN GGVGNFYCAS GGAIANAGRI LISDSRFIDN YADQGGALVQ SQGSSEIYRS EFSANRSNGV NRLRENCHAS GAAISQQAGT MLLDTSTLAF NDSRGLGGAL DQRGVTSVLT LTNSTVVSNT NRFVTGIGGA GVSISGTLAL NNSMIANNWH TPSQTANDCL GNLTSQGHNL LERPTAQCQL LNQQASDLLN LDPLLGEFAL HGGSSRSFSL TAASPALDAG PADCGLVDQR LYPRPVDGNN DQTVRCDIGA FEAGMIAQTQ YFSFLPFALA GVR
|
| |