Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0889 |
Symbol | |
ID | 5732790 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 1015172 |
End bp | 1016947 |
Gene Length | 1776 bp |
Protein Length | 591 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 641278021 |
Product | hypothetical protein |
Protein accession | YP_001543665 |
Protein GI | 159897418 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTCGAT GGCTCTATCG CTTACATCTA TGGTTGGTTC TGCTGCTGTT GATTGCGGCC TGTACCCAAG TTGGCGAATC GGGCGGCAAC CAAACGCTTA GCTTACGTAC CCTAACCGGC AATGCTGCGG TTTTTGGCAC GATTGAGTTG GCGATTGATA CCACTATCAC CGTCGCCAAT CCTTACGATC CAAATCAAAT CGATCTGATG GTGAGTTTTA TCTCAGCAAC CGGCCAAATC TATCGTGTGC CAGCCTTTTG GTATCAAGAT TTTGATCAAC TTTCGCTGCA ACCCAAAGGC AACCCTGAGT GGCGGGTGCG TTTCACGCCG AGCGAACCAG GTGCATGGCA AGTCAAGGCC GAGCTAGCCA AGCCAGCGCT GAGCAGCGAC GTGATCACGA TTGAAGTTTC AGCGAATAAG CAATCGCCAG GCTTTGTACG GATCAACACC AGCAATCCGC GCTATTTCGC TCGCCAAGAT GGCACCTTCT TTATGCCAAT CGGCCTCAAT TTGGGCTGGT CAACCCAACA AGGCACGGGC ATTTTGCGCG AATATGAACA CTGGTTTGAT CAATTAAGCA AAAACGGTGG CAATATTGCG CGAATTTGGA TGGCCTCGTG GTCGTTTGGC ATCGAATGGC AAGATACCGG TTTAGGCGAT TATTCCAAAC GCATGCAACA AGCATGGATG CTTGACCAAA TTTTCAAATT GGCCGAACAG CGCAACATCA CAATTATGTT AACCCTTATC AACCATGGCG CATTTAGTAC CAGCACTGAT TCAGAGTGGG CTAGTAATCC GTATAACGCT GCGAATGGCG GGCCAATTGC CGAGCCACGC TTGTTTGCCA CCGATATTCA ATCGCGTGAA GTGTTCAAGC ATCGAGTGCG TTACATTGCG GCTCGTTGGG CACATTCGCC TAGCCTATTC GCATGGGAAT GGTGGAACGA AGCCAACTGG ACACCAATTA ATGATGCTTT GATGCAACCA TGGATCAGCG AAATGACCCG TCATTTGGCG CAGTTTGATC CCTATCAACA TTTGGTTTCA ACCAGCTATG CCAGCAATAC CAGTACCTCG ATGTGGGTAC AACCAGAGAT CAACTTCACC CAACACCACG ATTATACAGG CCGCGATTTA GGACAAGCCT TCCCCTTGGT GATCCGTGAG TTGAATGCGG CAGCACCACA AAAACCAGCC TTGGTCAGCG AACTTGGCTA TGCTGGCACT GGGCGCGACG AGGTAATCAA TCGGGATGTT TGGCAGTTTC ATCAAGGCTT GTGGGCTGCA CCATTCAGTG GCTTTGCTGG CAGCGGCATG TATTGGTGGT GGGACACCTT GGTCGATCCC GACAACTTGT GGAGCGAATA CAGCAAGTTG GCCGAATTTT TCAAAGACCA AGATCTCACG ATCTACAACC CAGTTGTGGC TCAAATTTCG CCGTTGAAGG CGCGGGCCTT AGCCTTACAA ACGAAATCGC AGGCGTTAGT CTGGGTGCGC AGCAACGAAT ATGAGCCTGA AGCATTAACC AAAGCCTATG AAGAAGCGCT CAAAAAACGT GAATTTAACG ATACATGGGA ATATGTACCG CCGACTTACG CCGATTTGAC GCTTAAGTTG AATGGGCTAG AAGCCGGAAA CTACCAAGCA ACCTGGTACG ACCCGCAAAC TGGCACATGG TCGCAACCAA CGACGGTAAC CCTTGAAGCT AACCAATCCA GTATTGCAGT TCCAAGCTTC AACTACGATT TAGCCTTGAA ATTAGTCAAG CAATAA
|
Protein sequence | MRRWLYRLHL WLVLLLLIAA CTQVGESGGN QTLSLRTLTG NAAVFGTIEL AIDTTITVAN PYDPNQIDLM VSFISATGQI YRVPAFWYQD FDQLSLQPKG NPEWRVRFTP SEPGAWQVKA ELAKPALSSD VITIEVSANK QSPGFVRINT SNPRYFARQD GTFFMPIGLN LGWSTQQGTG ILREYEHWFD QLSKNGGNIA RIWMASWSFG IEWQDTGLGD YSKRMQQAWM LDQIFKLAEQ RNITIMLTLI NHGAFSTSTD SEWASNPYNA ANGGPIAEPR LFATDIQSRE VFKHRVRYIA ARWAHSPSLF AWEWWNEANW TPINDALMQP WISEMTRHLA QFDPYQHLVS TSYASNTSTS MWVQPEINFT QHHDYTGRDL GQAFPLVIRE LNAAAPQKPA LVSELGYAGT GRDEVINRDV WQFHQGLWAA PFSGFAGSGM YWWWDTLVDP DNLWSEYSKL AEFFKDQDLT IYNPVVAQIS PLKARALALQ TKSQALVWVR SNEYEPEALT KAYEEALKKR EFNDTWEYVP PTYADLTLKL NGLEAGNYQA TWYDPQTGTW SQPTTVTLEA NQSSIAVPSF NYDLALKLVK Q
|
| |