Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2388 |
Symbol | |
ID | 5734269 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3040804 |
End bp | 3042624 |
Gene Length | 1821 bp |
Protein Length | 606 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 641279529 |
Product | hypothetical protein |
Protein accession | YP_001545156 |
Protein GI | 159898909 |
COG category | [K] Transcription |
COG ID | [COG2378] Predicted transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCTATT CTAGCAGCGC TGTCAAGCTT CATGCCACGA ATGCCCTGCA TGCAGTGGCC GTTCGGACGT TGCGCACTCT CGCTCATTAT ACCGATTGTT CGCGCCAACG TAGCCAAACT GGCCATGCTT TAGCCGCTGC GCTCCAGCGC CACTGGCAAA CACCGCACTA TCGCCAGCTG GTGCGCCGTT CCTTAACTGC CGCCGACCGA GCATTGTTGC AGGCATGGTG GCAGGGTCAG CAGCCGTTGC CAACGCCACA AGCGCTTGAT CTCTGGCGCT GGCAGGCTCC TTGGCCTACT CTGGAGCAGC TCTCGTCGGA GCAACGCTTG GCTGCCTTAG GCTTGGTGGT GCCAATCCGC ACGACCACAG GCCGCACGGT GGTCTTAATT AATGATACCA GCCGTTGGTT ACGCCGCACT CCGCCACTGC CACCAACTCC TGTTGCTGCC AGCTTGCAAG CCTTGTTTCA AGCGGTGGTC GCGTTGCTTG CCGCCTGTGC CAATACCCCT CAACCCCGCC AAGCAGCTGG CTTGGCGCTG CATATCGCGC AATCAGCCGG CTGGCTGGCC GATCGGCTTA ATCAATGGCG CATTACGCCG CGTGGTCGGG TTTGGCTGCA TAGCCCAATC GCTGAGCAAC AACGCTTGTT ACACCAACAG CTCATCACCT GTAACCCGCC TGCACGTGGC TTGGTCGCAT GGCGTAGCCC CGATTGGGCG GCATTATTTG CCGATTTGGA ACGGTTGATG GAGGCCCAAG CCCAGCGGCG CAGCATGGAT GTGGCTGCCT TGCTCCACGA TCATCCAGCG TGGAATGGAT TGCCAGCAGC CCAGCAGATT CGGCTCGTGC ATGGTTGGTT GCGCACCGTC TTGCAACCAG CGGGCGTGGT GAGCTTAGCC AAGGGCTGGC TCTTTTGGCA TGGCTGGCAG CAGCTCGCAG CTCAAGCGCC AGCCTTCGAT GGCCTGCGCT TGCCTAAACG TGCGGCGCTC CCCGCAGCCT TACAGGTGTG GGGATTAACT TGGGGGATGG CAACGAGCCA TGGGTGGCGC ATTACCCAAG CATCCGTCGC CGCTGCGCTG GCTAACGGAC TTGATCTCAG TAGTTTTTGG CAGCCGATTG ATCAGTGGTA TGCTGAACGG CCCGCCCTTA TTCAGGCCTT GATCGCAAAA CTTCAGGCCA CGCCGCCATT GCGCCTGCGT CGCATCACGC TGCTTGAGGG TAGCCCCGAA GCCGTGGCAA GCGCCCACGC CAATTGGCAG ATTCAAGCCT ACCTACAACC TGGGTTTGAT CAAGCCCAAC GGGTGGTGTG CCAAGGAGCG GAGCAGGTGG TAGCCAAGGT GTTGGGACTG CATGCCACGC CTACGCCACG GCTCGATACG CAGACGAGCA TACAGATAAT GGCCTTGCGG ATTGCAACTC AGCACCTGCC CAGCCATCGG CTTGCCTTCA ATCAGCAAGC CCAGCGGCTG TTGGCCGAGC TGTCGTTTGA GCAACGGTGC ATCATCGACG ACGATTGGGA ACGTCTCCAA TTAAGTGATG CGCCGCAACC ACTGGCTACG AGCCAGTCGC TCGCGGTTGG GCAGCAACCA CGAGCGCAGA TCACGGTCGA ACAGGCTCGC CAAACATGTC GCCAAGCGAT CAACAACCAG CAAAGCGTGA CCGTGCGCTA TTACACGCCA GCCGAGCATC GCATCACGAC GCGCACGATT CGCCCGCTCG AGCTGACCAG CACCGGGATG CGCGGCTGGT GTGAATTACG CCAACAGGAG CGGGCTTTTC GCTTTGATCG GATTCTGGCA ATCGACCTGA ATTCGAGCTA A
|
Protein sequence | MAYSSSAVKL HATNALHAVA VRTLRTLAHY TDCSRQRSQT GHALAAALQR HWQTPHYRQL VRRSLTAADR ALLQAWWQGQ QPLPTPQALD LWRWQAPWPT LEQLSSEQRL AALGLVVPIR TTTGRTVVLI NDTSRWLRRT PPLPPTPVAA SLQALFQAVV ALLAACANTP QPRQAAGLAL HIAQSAGWLA DRLNQWRITP RGRVWLHSPI AEQQRLLHQQ LITCNPPARG LVAWRSPDWA ALFADLERLM EAQAQRRSMD VAALLHDHPA WNGLPAAQQI RLVHGWLRTV LQPAGVVSLA KGWLFWHGWQ QLAAQAPAFD GLRLPKRAAL PAALQVWGLT WGMATSHGWR ITQASVAAAL ANGLDLSSFW QPIDQWYAER PALIQALIAK LQATPPLRLR RITLLEGSPE AVASAHANWQ IQAYLQPGFD QAQRVVCQGA EQVVAKVLGL HATPTPRLDT QTSIQIMALR IATQHLPSHR LAFNQQAQRL LAELSFEQRC IIDDDWERLQ LSDAPQPLAT SQSLAVGQQP RAQITVEQAR QTCRQAINNQ QSVTVRYYTP AEHRITTRTI RPLELTSTGM RGWCELRQQE RAFRFDRILA IDLNSS
|
| |