Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2549 |
Symbol | |
ID | 5734427 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3276091 |
End bp | 3277719 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641279689 |
Product | hypothetical protein |
Protein accession | YP_001545315 |
Protein GI | 159899068 |
COG category | [R] General function prediction only |
COG ID | [COG0433] Predicted ATPase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.170922 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACATG CAACACGGAT TGGGCTAGTA ACCAGTGGCT CGTTATTGGA AGGCTTAACT GCCCGTTTAG ATGAACGCTA CGAAATTGAG CGTTTGCGCG TTGGTCAATT TATGGTGGTG CAAGGCCGCC AAAACCGCTT TTTTTCAATG TTGACCGATG TGCAACTGGC CGCTACCAGC CTTTCGATTT TGGCCGATCC GCCAGATGAT GAGCACCCGT TGTTGCGTGA GATTTTGGCT GGGCGTAACA CCTATGGCAC ATTTAAATTA ACCCCTCAAT TGATGTTGCC CGAAGATTCA CTCGAAACAC CTCGCCCAGT TAAGACTATT CCTGCCCATT TTGCGCCAAT TTACGAGGCC AGCGAAGATG ATTTTGGCTT GGTGTTTGGG GCTGAGGGCG ATGGTAAGTT TCAAATGGGC ACGCCGCTGG ATATGGATGT ACCAGTCTGT ATCGATCTTG AGCGCTTTGT TGAGCGCTCG AATGGTGTGT TTGGTAAATC GGGCACAGGT AAATCATTCT TAACGCGTTT ATTATTATGC GGCGTGATTA AACATAATGC TGCGAGTAAT TTGATTTTTG ATATGCACTC CGAATATGGT TGGAGCGGCA CAACCGAGGA TAAGATTCAA GAAGTTAAGG GCTTAGCGCA GCTTTTTCCT GGCCAAGTCT ATATCTACAC GCTTGATCCT GAGTCGTCGC GGCGGCGCGG AGTAAAATAC GATGGTGATA TTACGATTGG CCTGAATGAA ATTCGGGTTG ATGATATTTT ATTATTGCAA GATGCGCTTA ATCTCAATCC GACTGCGGCA GAATCGGCCT TTATTTGTGC TCAGCGTTTT GGCGACGATT GGATTCAAAA ACTGCGTGAA TTAGATACCG AACAACTCAA AGAGTTTGTC GAATCGACAG GCGCAAATAT GTCGTCGATG TCAGCACTTT CGCGCAAACT AGCTCAGCTT GAGCAACTCA AATTTGTCAC TCGCAAATCG AGCCAATCAT CAATTCGTCA AATTATTGAT GCCTTGTTGG CTGGTAAAAA TGTAGTGGTA GAGTTTGGTC AATATCGCAG TGAATTGGCC TATATGTTGG TTTCAAATAT TCTGACGCGC CTGATTTATG ATGAATGGGT ACGGCGTACC GAAACCTTTC TAGCCACAAA AAAATCGAGC GATAAACCGC CGCAACTGAT GATTACGATT GAAGAAGCGC ATAATTTTCT TACGCCCAGC CTGGCCAAAC AAACCATTTT TGGCAAAATT GCCCGCGAAT TACGCAAATA TTCGGTCACG TTGTTGGTGG TTGATCAACG GCCATCGTCG ATTGATAACG AAGTGATGAG CCAACTTGGC TCGCGGATTA CTGCTTTGCT TAACGATGAT CGTGATATTG ATGCGGTATT TATGGGTGTT GGTGGCTCGA AAGGCTTGAA AACCGTGTTG GCCTCGCTTG ATTCGCGCCA GCAAGCCATG ATGTTGGGCC ATGCCGTGCC GATGCCAGTG GTGATGCGTA CCCGAGCCTA TGATAAAGCT TTTTATGAAG CGATGATGCA GGGTAATCGA CGACGGCCTA AGCCTATTCC AATTACCGAT GACGATGCTA ATGATGATTT ATTTGGATCA AGGCAATAA
|
Protein sequence | MAHATRIGLV TSGSLLEGLT ARLDERYEIE RLRVGQFMVV QGRQNRFFSM LTDVQLAATS LSILADPPDD EHPLLREILA GRNTYGTFKL TPQLMLPEDS LETPRPVKTI PAHFAPIYEA SEDDFGLVFG AEGDGKFQMG TPLDMDVPVC IDLERFVERS NGVFGKSGTG KSFLTRLLLC GVIKHNAASN LIFDMHSEYG WSGTTEDKIQ EVKGLAQLFP GQVYIYTLDP ESSRRRGVKY DGDITIGLNE IRVDDILLLQ DALNLNPTAA ESAFICAQRF GDDWIQKLRE LDTEQLKEFV ESTGANMSSM SALSRKLAQL EQLKFVTRKS SQSSIRQIID ALLAGKNVVV EFGQYRSELA YMLVSNILTR LIYDEWVRRT ETFLATKKSS DKPPQLMITI EEAHNFLTPS LAKQTIFGKI ARELRKYSVT LLVVDQRPSS IDNEVMSQLG SRITALLNDD RDIDAVFMGV GGSKGLKTVL ASLDSRQQAM MLGHAVPMPV VMRTRAYDKA FYEAMMQGNR RRPKPIPITD DDANDDLFGS RQ
|
| |