Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0618 |
Symbol | |
ID | 5732516 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 713594 |
End bp | 715330 |
Gene Length | 1737 bp |
Protein Length | 578 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641277745 |
Product | fibronectin-binding A domain-containing protein |
Protein accession | YP_001543394 |
Protein GI | 159897147 |
COG category | [K] Transcription |
COG ID | [COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATTG ATGCGCTAGT TGTTGCAGCG ATTGTCGCTG AATTACAGAT CTTGGTTGGC GGCAAAATTC AACAGGTGGT TTTGCCAAAT CCTGATAGTG TAGGTTTTGA AGTCTATGCT GATGGCCAGC GCCATCAGTT ATTGCTTTCG GCCAATCCTA AATTTGCCCG CATGCATACT ACCCCAACCA AGCTAACTCG TGATCCGAAT GCCGATTCGC CTTTGTTGCT ATTATTGCGT AAATATGTTC GTGGTGGACG GATTACCAAA ATCGAATCTG CGCCATTGGA ACGGGTTATT TCGCTCAGTA TAGCCAAGAT GCCGATTCCT CGCAAGGAAC TTGAGCCTGA CGATGATGAC GACGATGAGG TGATGCTTAC GCCGCGTTAT AGTGAGTTGG TACTAGAAAT TATCGGTCAT TCATCAAATA TTATTTTGGT CGATGATAAT GGCTTGGTCT TGGAAAGTAT TCGTCACTAT AACCCGCAAC GTTCGCAACG CCCAATCATG CCACGTGGCA TGTACGAAGC GCCGCCCAGC CAAGGCAAAT CTGACCCGCT CCAAGCAACC GCTGAACAAA TTGCGGCCTT AGGTGGCGAT TTGGCCAAAG CCTTGGTGAC CGAATATAGT GGCATCTCGC CGCAAACTGG CCGTGAAATT GCTTGGCGGG CAGTCGGCGC AACCAGCGTC GAAATTACGC CAGAACTAGA TTTTGCACAG ATTGCCCAGC TTTTGCGCCA ACTTACCAGC CTTAGCAGGA GCGAGCCAAC CCTTGCCCGC AATGCTGATG GCACGCCAAT TGGGATTGCT GCCTTTAATT TGCAGCACCA AGCGCATACT GAAACCTTCC CCAGCATGAA CGAGGCCTTG GCAACCGCCT TTGCTGAGCT TGATCAGGTG ACAGCGCACG CTCAACGGCG TGAAGCCTTG CTCGAACGGG TGGCTGAAGC TCAGCGCCGC ATCAAAACCA AAGCTGATCA ATTACGCACT CAGTTGGCGC GGGTTGAGCA ACTTGAGCGT TTGCGCTGGG AAGGTGAGAT GATTTTTGGC TATATTTATG CGATCAAACC TGGACAAAGC GAATTGCTGC TTGACCAAGG CGTGATCACG CTTGATCCAA CATTATCGGC GGTCGAGAAT GCTCAAGCAA AATTTCGCGA GTACGACAAA GCCAAAGGTG CATTAGAGGA TGTGCCACAA CTCTTGGAGC AAACCGAGGC TCAAGCCGAA TATTTGCAAC AAACCAACGA TCTGTTGAGT TTGGCCGAGA GTTTTGCTGA AATTGAGCAA TTTGAGCGTG AGTTGATTGC TGGTGGCTGG CTGCGCCAAA CGATTGGCAA AGCCAAAAGC AAGCCCAATT CTAGCGTTGG GCGTGGCCCG TTGCGGGTAA TTTCGCCCGA TGGCTGGACA ATTTTCGTTG GTCGTACCGC TGACCAAAAC GATGAAGTAA CCTTCAAACT TGGTCAGCCC GAGGATTATT GGTTGCATGC CCGTGAACGA ACTGGCGGCC ATGTGATTAT TCGTATGCAA TCGGCGAATG TGCCGCCGCG TACCCTTGAG CAAGCGGCGC AACTGGCGGC CTACTATTCA TCGGCTCGCA ACGATGGCGC AGTTGAAGTC GATATTGCCT TACGCAAACA TGTGCGCAAA ATCAAAGGCG GCCCACCTGG TTTAGTGCGC TATACCGCTG AGCAAACCCT ACGCGTCGCA CCCCAAAAAG AACCGAAGAG AACATAG
|
Protein sequence | MHIDALVVAA IVAELQILVG GKIQQVVLPN PDSVGFEVYA DGQRHQLLLS ANPKFARMHT TPTKLTRDPN ADSPLLLLLR KYVRGGRITK IESAPLERVI SLSIAKMPIP RKELEPDDDD DDEVMLTPRY SELVLEIIGH SSNIILVDDN GLVLESIRHY NPQRSQRPIM PRGMYEAPPS QGKSDPLQAT AEQIAALGGD LAKALVTEYS GISPQTGREI AWRAVGATSV EITPELDFAQ IAQLLRQLTS LSRSEPTLAR NADGTPIGIA AFNLQHQAHT ETFPSMNEAL ATAFAELDQV TAHAQRREAL LERVAEAQRR IKTKADQLRT QLARVEQLER LRWEGEMIFG YIYAIKPGQS ELLLDQGVIT LDPTLSAVEN AQAKFREYDK AKGALEDVPQ LLEQTEAQAE YLQQTNDLLS LAESFAEIEQ FERELIAGGW LRQTIGKAKS KPNSSVGRGP LRVISPDGWT IFVGRTADQN DEVTFKLGQP EDYWLHARER TGGHVIIRMQ SANVPPRTLE QAAQLAAYYS SARNDGAVEV DIALRKHVRK IKGGPPGLVR YTAEQTLRVA PQKEPKRT
|
| |