Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3634 |
Symbol | |
ID | 5735495 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4568530 |
End bp | 4570497 |
Gene Length | 1968 bp |
Protein Length | 655 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280783 |
Product | hypothetical protein |
Protein accession | YP_001546398 |
Protein GI | 159900151 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACAACTG GCAAAACTGA ACTGATCACG ATTGCCCTTG CTGATCCAAT TGATAGCATT ATTCAGCAAG TGCGCAATGC CAAAGCAGAG CATGTTGATA TTTTCATGCC CGAAGGCTCG ATTTCGTTAC AAAGCCGTAA AACCTGTGAT CGCTTGCGCG AAACTGCCAA TCGCGAGGGC ATTGAGTTAA CGCTCTATAC CAGCGATCCC AAGGTGGTTA AGGCGGCTTT GACTTGCCAG ATGATGGTGG TCGAAATTGA GCCAGCCGTT GTTTCAACCA AAGCGCCTGC TGCTCCGCCA GCTCCAGCTA AACCAGTTGC ACCAGCTAGC CCTGAAGAAG ATTTTTTGGC TTCGCTGGAA GGGTTGCCCT CTGCTTCCAC CACCGATCGG TTGCAAACCA GCCCCAGCAT TAGCAAGCCG ATTCCAACGG CAGCCCCCAG TCAACCAGCC CGCAGCGAAA TCGACGATTG GGCTGATGCC CTTGATAGCC TTTCAGTGGC TTCGACTGCT GGCGAAACTG GCTTGCGCCA ACATGAACAA GCGCGTGGCA ACGATAGCTG GGATTTTGAT TCATTCAACG ATCTTTCTGA TGCGCTGAGT GGCGATACGG CCAGCGCTGC GCCAGCTCGC CCACGGATTC GCCCCGAGGA TATTGAATTA ACTAGCGATG ATATGCATCG CCAAGATGCC AAAGCTCGCC GTGCTCGGGC CGAGCAAAAA CGCGCTGACG AAAAGGCTGA ACTCAAACAA CAAACCGCGC CCAAACGCAA TTATCCCTTG TGGGGTATGG TTATTCTAGG GGCGATTGCA ATTATCGCGT TGTTATGGCT GCTGTTTGGC AATAAATTGG CCAGCGGCGT AACCGTGGTG GTGCGACCAG CTCCAACCTC TGGCGGCCAA ACCTACGAAG ATGTGCGGCT CAACTATCAA GCCGACCCAA TTAGCGAACC AAGCTCGGCG GCAATTCAAG GGCGCTTGAT CAACGTGCCA ATTTCGGTGA GCGTGCGTGG TACGGTCATT ACGGCAACCG AACAACCTGA TCAAGCGGCG ACTGGCATCC TCGAAGTTTA TAACCGCAAC ACCCAGGCCT ACCCAATTCC TGCTAATACG CGGGTTCGAG TGCTCAACTC GGCTGGCGAA GAAATTATCT TTTTGACCCA AGCCGAAACG ACGATTCCTG CCGCAACAGG CAACTTTGCC GGAATTGTCA GCGGGGTTGG TTCATTGCAA ATCGTGGCCA GTGCTGGTGG TACGCGCTAC AACATGCCTG CTAGCCAAGA TCCGGTTTGG ACGATTGAAG GCTATGAAGG TGCACTCTTT TCGATCAATC CTGAGCCAAT TCAAGGCGGC ACAACTGCGC TGCTAAAATT GCCAGTCGAA AGCGATTGGA TGCCGTTACT GCCCCAAGCA GTAGCCCAAT TCCGCAGTGC GATTCCGGCT CAAATGCAAA CCGTGCTACA AGAAGGCGAA GTCTTGGCTG ATGTTGAATT TTTGCCGAAT GTCGATGCCT TGACTCAAGA CCCTTCGCTT TACGATATTC AAACCCGACC AGTGCCCGAA ACCGATGGCG GTTTTGAGTT GGTCGTAACA GCGAATTTCC AAGGCTTGGC GGTGCGTGGC AGTTTTGCCG AGCAGTTGAA TCGAGCTTTG CCGAATGCCT TGCGGATCAA AGACCCATCA TTCAGTACCG ATACCACCAG CATTGTGCGT AGCCAAGTGC GCTTGGATGA TTCAGGTGCT AGTTTGTTGC TGGCAACGGT CGAAGTTGCT CCCAAAACCG CAGTTGGTTT ACCCGATAGC ACTAAACTGA AAATCGCTGA TAGCATCAAA GGCCTAACCC CAGCCGAAGC CTTGGTGCAG TTGGAAGCTT TGCGCGAAAG TGGCTTGATT GGCGATATTG TGTCAGTGCC CGAAGTTGAG CGCTTGCCCG TTGATCCAGC CGAAATTAAT ATTCAGGTGC AGGAATGA
|
Protein sequence | MTTGKTELIT IALADPIDSI IQQVRNAKAE HVDIFMPEGS ISLQSRKTCD RLRETANREG IELTLYTSDP KVVKAALTCQ MMVVEIEPAV VSTKAPAAPP APAKPVAPAS PEEDFLASLE GLPSASTTDR LQTSPSISKP IPTAAPSQPA RSEIDDWADA LDSLSVASTA GETGLRQHEQ ARGNDSWDFD SFNDLSDALS GDTASAAPAR PRIRPEDIEL TSDDMHRQDA KARRARAEQK RADEKAELKQ QTAPKRNYPL WGMVILGAIA IIALLWLLFG NKLASGVTVV VRPAPTSGGQ TYEDVRLNYQ ADPISEPSSA AIQGRLINVP ISVSVRGTVI TATEQPDQAA TGILEVYNRN TQAYPIPANT RVRVLNSAGE EIIFLTQAET TIPAATGNFA GIVSGVGSLQ IVASAGGTRY NMPASQDPVW TIEGYEGALF SINPEPIQGG TTALLKLPVE SDWMPLLPQA VAQFRSAIPA QMQTVLQEGE VLADVEFLPN VDALTQDPSL YDIQTRPVPE TDGGFELVVT ANFQGLAVRG SFAEQLNRAL PNALRIKDPS FSTDTTSIVR SQVRLDDSGA SLLLATVEVA PKTAVGLPDS TKLKIADSIK GLTPAEALVQ LEALRESGLI GDIVSVPEVE RLPVDPAEIN IQVQE
|
| |