Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5286 |
Symbol | |
ID | 5737244 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009974 |
Strand | + |
Start bp | 75342 |
End bp | 76469 |
Gene Length | 1128 bp |
Protein Length | 375 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641282450 |
Product | hypothetical protein |
Protein accession | YP_001548041 |
Protein GI | 159901796 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR02391] conserved hypothetical protein TIGR02391 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000747572 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCTAACAC CACCCGATCC GCAACACCAG TATGTCGTGG CTTTATCGCG TCTTATCCAT CAAGGCAATG CCTTGTTTAA ACAGTATGAT CGACTTTCTT TTATCATTTG GAATCGTCAT CACCACGATA AAGCCGTGCA GAGCCACCGT GACTTTCAGG CGTGGGTCGA TCATGCAAAG AGCGACTTGC ACCCAGCCGA TCATGACGAA TTCTGCACGA TTCTGGCCGA AGGGGATAGT ATCAGCTGGG AGCGTGCATG GGACATTGTC ACAGGCACGA TAGAGCATGT GTGGGAGAAT CAGCAGGAAG CATATAGTAT TGGCCAACGG TGCATTATGA AGCGCTTTCT GCGGCTGCGT GATTATCTCA AAGCGCTGCG GATTCGCCTC GCCCCACCCT CGCCCTTGTC TGCTGATTTT TATCAGCAAT TTGCCGACCA ATTCCACCGG ATGAGTCCCG ATATGATTGA TGAATTGTTT ATCACGAGTG GCTGTATGAT CCAGTATTGG ATACCACCCT ATAAGCCGCA AACGAAACAA AGTGCCGATC GGGCGTATGG CTGGCTGGAC GGATTAAATC TGTATATGCC TGATTCGCTG GTCGCGATGG TCGAGACGGT CTACCAAGCC TACCGAGACT ATAAACGGTT CCCGCGTGAT CAGACCCTCG ATCCAATTCA AGGTGCGCTG CGGGATCTGA AACTGGCAGA CACGAAGCCA TCACTACTTA CCCGCTACGA GCTGCACCCG CGCGTGGTCG AGCGAGCAAC CCTGCTTTGG CAGATTGGCG AATATGACAC CGCGTTATCG CAGGTGTGTA TCGAACTTGA TAATGCGGTT AAAGCGAAAT CGGGTCTCAA GGAGGACGGC ACCACCTTAA TGCGAACAGC CTTTTCACCC AAAAAAACAC GGTTGGCCAT CGATCCCCGC TTTGGCAATC AACAAGGCTT TATGGATCTG TTTGCGGGGG TGATGGATGC CATTCGCAAC CCACGCGCCC ATCACCACAA AAGCAATTTA AGCGCGGATG AGGCTATCGA ATGGCTGGCA TTCCTGTCAG CCCTGTTTCG GGTGCTTGAT GCGACAATCA TCAATACCCC TGATGAAACC GAGGCAAGGG GTACCTAA
|
Protein sequence | MLTPPDPQHQ YVVALSRLIH QGNALFKQYD RLSFIIWNRH HHDKAVQSHR DFQAWVDHAK SDLHPADHDE FCTILAEGDS ISWERAWDIV TGTIEHVWEN QQEAYSIGQR CIMKRFLRLR DYLKALRIRL APPSPLSADF YQQFADQFHR MSPDMIDELF ITSGCMIQYW IPPYKPQTKQ SADRAYGWLD GLNLYMPDSL VAMVETVYQA YRDYKRFPRD QTLDPIQGAL RDLKLADTKP SLLTRYELHP RVVERATLLW QIGEYDTALS QVCIELDNAV KAKSGLKEDG TTLMRTAFSP KKTRLAIDPR FGNQQGFMDL FAGVMDAIRN PRAHHHKSNL SADEAIEWLA FLSALFRVLD ATIINTPDET EARGT
|
| |