Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1146 |
Symbol | |
ID | 5733038 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1312730 |
End bp | 1314370 |
Gene Length | 1641 bp |
Protein Length | 546 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641278285 |
Product | TROVE domain-containing protein |
Protein accession | YP_001543922 |
Protein GI | 159897675 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTGCTT TGTTGAGAAG AGTCGTTTTC CTACCTTATA ACTTGGAGGT GGCCATGAAC ATTGTTCGCC GCCTGTTTGC TCGCACGCCT GATGCAGCAC ATGGGCTTAA CCACGAGGGC TTCCCTACCT ATCAACGCAG TTTGGCTGAA CGGTATATGC AAACTTTGTT GACCAATACC ATTGGCTCAA CCTTTTACGC CTCGCAAGGT AGCAATTATG CCTTGGCCTT GGAGTTGCAT CAAGCCATGT TAGCCCATAA CCCACTCTTT GCTGCAAAAG CCTTAGTGTA TGCTCGCGAA CAAGGAACCA TGCGCCTTCA GCCAATTATC GGCTTGGTTG TGCTATCAAC TGTCGATTTG GGCTTGTTTC ACCTAATATT CAAGCGCATT ATTCTAACGC CAGGCGATTT GCAAGATTTT GTGCAAATTG TGCGTTCGCG TCAGATTCGG CCTGGTATGG GTCGGGCAAT CAAGCAAACA ATCAATGATT GGTTGCTTAA CCTGAGCGAA TATCACGTAA TTAAATATGG TGGCACGAAT GCAGGCAGCA TGACCCTACG CGATGTGCTA CGCCTAACGC GTCCGCAACC TATCGATGAT CGTACTAATG CTTTGTTCAG TTATTTGATC GATCGTGAGC GCTGGCGCAC AACTTGGGCT GAGCAAGCAT CCACGCTGTT GCCGCAAATC GCAGCGGTCG AGCAACTCAA GCGCACGAGC GATCCGACTG AGCAGCGAGC GTTGGTTGAG GCTGGTCGTC TGCCCTACGA AATTGTCACA GGTACGGGCA AGCCAGATTT GGCCATGTGG CGAACGTTGA TCGAGCAAAT GCCCTATTTG GCCTTGCTAC GCAATTTGGC TAGCCTACAA CGAGCAGGTG TGTTCCACGA TGCGGCGATG ATTGAGTATG TGGTTGGGCG TTTGGGCGAC CTTGAGGCCT TGCGCCGCGC CAAGATTTTG CCCTTCCGTT TGCACGCAGC TTGGTTGGCC TTCACGCCAC TAAGCGAGCA GGAAAAGCTG ATTCAGCAAA CGCTTGAGCA GATGATCGAA ATGGCCTTCG TCAACATGCC CGAAATTCCT GGGCGGGTCG TGGTTGCCCC AGATGTTTCT GGCTCGATGC GCGGCTCTAT CAATCCGAAG TCGCAAGTAC GTTATGTCGA TGTTGCAGGC ATTTTCGCTG GCTCGCTCTA TCGCAGCAAC CCAACCGCCC AACTGCTACC TTTTAATACC AGCATTGTTC AGATGGAGAC TTGGCGCGAA ACCAAATTGA TGTGGTTGAC AAAGCAAATT ACGGCCAAAC TTGGTGGTGG AACCGCGGTT TCCGCCCCAA TTTCCTACTT GTACGAGCGC CGTGAGGTGG TCGATGTAGT AATTGCGATT ACTGACAACG AAGAATGGGC ACGTGATAGC GATAGTGGAA CAAGTTTTGT CAGTGTCTGG CGTAAATATT TGGCCAAGGT TAATCCCAAA GCTCAAGCAT TTTTAATCAC GATTGCGCCC TATCCACACG CGGTTGCCCC GCCCGATGAG CCAAATGTCA GCTTTATTTT TGGCTGGGCC GAGCATGTGC CAGCCTATAT CGCCCAAAGC TTGCTTGGAT ATGCCGATCA GCTGAGCACG ATCGAGCAGA TTACACTCTA A
|
Protein sequence | MVALLRRVVF LPYNLEVAMN IVRRLFARTP DAAHGLNHEG FPTYQRSLAE RYMQTLLTNT IGSTFYASQG SNYALALELH QAMLAHNPLF AAKALVYARE QGTMRLQPII GLVVLSTVDL GLFHLIFKRI ILTPGDLQDF VQIVRSRQIR PGMGRAIKQT INDWLLNLSE YHVIKYGGTN AGSMTLRDVL RLTRPQPIDD RTNALFSYLI DRERWRTTWA EQASTLLPQI AAVEQLKRTS DPTEQRALVE AGRLPYEIVT GTGKPDLAMW RTLIEQMPYL ALLRNLASLQ RAGVFHDAAM IEYVVGRLGD LEALRRAKIL PFRLHAAWLA FTPLSEQEKL IQQTLEQMIE MAFVNMPEIP GRVVVAPDVS GSMRGSINPK SQVRYVDVAG IFAGSLYRSN PTAQLLPFNT SIVQMETWRE TKLMWLTKQI TAKLGGGTAV SAPISYLYER REVVDVVIAI TDNEEWARDS DSGTSFVSVW RKYLAKVNPK AQAFLITIAP YPHAVAPPDE PNVSFIFGWA EHVPAYIAQS LLGYADQLST IEQITL
|
| |