Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2436 |
Symbol | |
ID | 5734317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3120861 |
End bp | 3122681 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641279577 |
Product | hypothetical protein |
Protein accession | YP_001545204 |
Protein GI | 159898957 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTATT CTAGCAGCGC TGTCAAGCTT CATGCCACGA ATGCCCTGCA TGCAGTGGCC GTTCGGACGT TGCGCACTCT CGCTCATTAT ACCGATTGTT CGCGCCAACG TAGCCAAACT GGCCATGCTT TAGCCGCTGC GCTCCAGCGC CACTGGCAAA CACCGCACTA TCGCCAGCTG GTGCGCCGTT CCTTAACTGC CGCCGACCGA GCATTGTTGC AGGCATGGTG GCAGGGTCAG CAGCCGTTGC CAACGCCACA AGCGCTTGAT CTCTGGCGCT GGCAGGCTCC TTGGCCTACT CTGGAGCAGC TCTCGTCGGA GCAACGCTTG GCTGCCTTAG GCTTGGTGGT GCCAATCCGC ACGACCACAG GCCGCACGGT GGTCTTAATT AATGATACCA GCCGTTGGTT ACGCCGCACT CCGCCACTGC CACCAACTCC TGTTGCTGCC AGCTTGCAAG CCTTGTTTCA AGCGGTGGTC GCGTTGCTTG CCGCCTGTGC CAATACCCCT CAACCCCGCC AAGCAGCTGG CTTGGCGCTG CATATCGCGC AATCAGCCGG CTGGCTGGCC GATCGGCTTA ATCAATGGCG CATTACGCCG CGTGGTCGGG TTTGGCTGCA TAGCCCAATC GCTGAGCAAC AACGCTTGTT ACACCAACAG CTCATCACCT GTAACCCGCC TGCACGTGGC TTGGTCGCAT GGCGTAGCCC CGATTGGGCG GCATTATTTG CCGATTTGGA ACGGTTGATG GAGGCCCAAG CCCAGCGGCG CAGCATGGAT GTGGCTGCCT TGCTCCACGA TCATCCAGCG TGGAATGGAT TGCCAGCAGC CCAGCAGATT CGGCTCGTGC ATGGTTGGTT GTGCACCGTC TTGCAACCAG CGGGCGTGGT GAGCTTAGCC AAGGGCTGGC TCTTTTGGCA TGGCTGGCAG CAGCTCGCAG CCCAAGCGCC AGCCTTCGAT GGCCTGCGCT TGCCCAAACG TGCGGCGCTC CCCGCAGCCT TACAGGTGTG GGGATTAACT TGGGGGATGG CAACGAGCCA TGGGTGGCGC ATAACGCACG CATCGGTTAC CGCTCGCTTG CAACAGGGGC TTGATCTCAA TGGTTTTTGG CAGCCGATTG ATCAGTGGTA TGCTGAACGG CCCGCCCTTA TTCAGGCCTT GATCGCAAAA CTTCAGGCCA CGCCGCCATT GCGCCTGCGT CGCATCACAC TGCTTGAGGG TAGCCCCGAA GCCGTGGCAA GCGCCCACGC CAATTGGCAG ATTCAAGCCT ACCTACAACC TGGGTTTGAT CAAGCCCAAC GGGTGGTGTG CCAAGGGGCG GAGCAGGTGG TAGCCAAGGT GTTGGGACTA CATGCCACGC CTACGCCAAG CCTCGATACG CAGACGAGCA TACAGATAAT GGCCTTGCGG ATTGCAGCTC AGCACCTGCC CAGCCATCGG CTTGCCTTTA ATCAGCAAGC CCAGCATCTG CTGGCCGAGC TGTCGTTTGA GCAACGGTGC ATCATCGACG ACGATTGGGA ACGTCTCCAA TTAAGTGATG CGCCAGACCT ACTAGCGAGC AGTCAAGCGC TTGCCGTTGG GCAACAACCA CGAGCGCAGA TCACGGTTGA ACAGGCTCGC CAAACATGTC GCCAAGCGAT CAACAACCAG CAAAGCGTGA CCGTGCGCTA TTACACGCCA GCCGAGCATC GCATCACGAC GCGCACGATT CGCCCGCTCG AGCTGACCAG CACCGGGATG CGCGGTTGGT GTGAATTACG GCAACAGGAG CGGGCTTTTC GCTTTGACCG AATCTTGGCG ATTGAAGCCA ATACCAGTTA A
|
Protein sequence | MAYSSSAVKL HATNALHAVA VRTLRTLAHY TDCSRQRSQT GHALAAALQR HWQTPHYRQL VRRSLTAADR ALLQAWWQGQ QPLPTPQALD LWRWQAPWPT LEQLSSEQRL AALGLVVPIR TTTGRTVVLI NDTSRWLRRT PPLPPTPVAA SLQALFQAVV ALLAACANTP QPRQAAGLAL HIAQSAGWLA DRLNQWRITP RGRVWLHSPI AEQQRLLHQQ LITCNPPARG LVAWRSPDWA ALFADLERLM EAQAQRRSMD VAALLHDHPA WNGLPAAQQI RLVHGWLCTV LQPAGVVSLA KGWLFWHGWQ QLAAQAPAFD GLRLPKRAAL PAALQVWGLT WGMATSHGWR ITHASVTARL QQGLDLNGFW QPIDQWYAER PALIQALIAK LQATPPLRLR RITLLEGSPE AVASAHANWQ IQAYLQPGFD QAQRVVCQGA EQVVAKVLGL HATPTPSLDT QTSIQIMALR IAAQHLPSHR LAFNQQAQHL LAELSFEQRC IIDDDWERLQ LSDAPDLLAS SQALAVGQQP RAQITVEQAR QTCRQAINNQ QSVTVRYYTP AEHRITTRTI RPLELTSTGM RGWCELRQQE RAFRFDRILA IEANTS
|
| |