Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_3868 |
Symbol | |
ID | 5735717 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 4860648 |
End bp | 4862906 |
Gene Length | 2259 bp |
Protein Length | 752 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641281019 |
Product | alpha amylase catalytic region |
Protein accession | YP_001546630 |
Protein GI | 159900383 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0366] Glycosidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTCGCA ATTGCCCTAT CAGTGCGAGA TGTTGTATGT ATTACGAATA TCAGCATGAG CAAAATTTGG TTGTGAGCAC TGCCGAAATG GAGCTTGGGT TTTCGACCTC AACAGGCATG CTCACCTTGC TGCGTCGCCC TAACCAAGCC AACATTTTGA ATATCGGTAG CATTGGCATG TCGGTCGATG TGCAATTAGC CGATGGTTGG GTCAGCGAAA CTCATTTTCC CCGCTATTTA AGCCATCATG GCCGTGAAGT TGACGGCGTA ACTGAATTGA CGATCAACAT TGGCTTGGGC TTTTTGCGCA TCAGCGATAG CTTGCAAATT ACTGGAACCT TGATTCGGCG TTCGGTCGCT GTGACCAACA GTGGCGGCAA CGAGGCGCGG GTACATTGGG TGCGGTTGAG TTGGCCCTAT GCGCGGGTTG GCAGTTATAG CGAAGCTCGG TTTGAAGCAC CTAGCAATAG TTTTCGGCCA CGGCTGGATA TTGCCGCAGT TTCAAAATTG CGGCGTGGTA CATTGCCGAC GCAAACGATT GCCCCAGCCA TTCGCCGTGG ACGCTTATTT GAAAATGCAC CAGATCGTGG ACCCGGGTTA TTAGCCTTGC ACAGCACCAG CGAGAGCGAT AATTTGCTGT GTTGGTATTG GAGTAAAAGC CAATCGGCTT GGCCCGATAT TGATGGCAAC GATTTGGCAT TAACGGTCGG CCACGAGCTT GAAATTGCTG GCTGGCTCGC GCCCGATGCC ACGCTGAGCG GCGGCACGCA ATATTGTATG TTGGTGCATG GCAATTGGTA CGATGCGATG CATGCCTTTC ATAACACCTG GCCTGTGCTG GGAGTGCAGA CCTTGCCCGA TGTGCCCGAT TGGGTTTGCG CCGCCAATAT TTATGAAACG CACGTTGGTT TATGGGGTGG CTTTGCCAAA TTTAGCCAAG AGCTAACCCG TTTGCGCGAT TTGGGCTTCG ATACGATTAA TCTCATGCCA ATTTGGCGCT ATCACAATCT TTCGGACCAG CCGTGGGATA TGAACTGGCA GGCTTCTGGT TCGCCCTACG CGATCGAAGA TTTTGAGCAG CTGGAGCCTA GCTTGGGCAC TGCCGAAGAA TTTAAGGTCT TGGTTGAGCA AGCGCACGCC TTGGGCATGC GAATTTTATG CGATCTGGTG GTGCAAGGTT GCTCGCGCAC TGCCCGCTAT GTGCAAGAGC GGCCAGGCTG GTTTTGTCGC GATGAGCGGG GGCGCTTGGT TTCATCGCAC GGTTGGAACG ATACCTACAG CTTTGATTGG GCGAATCCTG AAGTGCAAGA TTTCTATGTT GATTGGACGA CTCGTTTTGC CCAAACCTAT CAGATTGATG GCTGGCGAGT TGATGCCCCA CATCGCAAGG AGCCAAATTG GGATCGGCGC TTGGAGCGGA TGGCTGCTAG CACCTCATTT GGCGTATTGA CGATTGTTGA GCGCATGCGC CAAGCCTTGC GCCAAATCAA CCCGCAAGCA GCATTATTGT GTGAATTGTA TGGCCCGTTG TTTCCAATTA ATCACGATTT TGCCTACGAT TATCTGGCGC ATTTGATGTT TTTCCACGCT GGCTTGGGCG TGCTCTCGCC CTACGAATTG GGCGAATGGC TCGAAGATCA CTTTTTGGCT TTGCCCAAGG GAGCAATTCG AGTTTGCTTT ACTGAAACCC ACGATACCCG CGATGTCAAC CCGATTGCCG ATGCCGTGCG AGGTTCGCGT TTGGCGCGGT TGCTGCTGAC TGGCATGGTT GGCTGTGGCT TTGTGCCAAT GCTTTGGACG GGACAGGAAG TGGGACAGGA AGCCTGGCTC AAACAATTAT TCAGCATTCG TGCCAACTAC CCAATTTTGC GTTATGGCAA ACAACTGTTT AACGTCATGC CCTGCGATAT GCCCTCAGTT TGGAGCGTGC TACGGGTTTG GCACGAAGAA CGCTTGGCGG TGGTGCTGAA TATGGGGCCA CATCGGCGCA CTGCCACCCT GAGCATGCCC GTTGATCGTA TGCACATGGT CGAAGGTGAC TATCATTTGT TTGATTTAGT GCGCGGCCAA GCAGTCGAAT ACGCTGGGCG CAACACTTGG CGACGTGATG ATTTGTTGAA TTTGACCTTG ATTTTAGAGC CATTCGATAG TCTGCTGCTG CATATTCGAG CTGGTACGCC GCCCCAATCA GAGCCTGCCA AGGCTGAGCC AGTTGCCGCC GCTGCACCAG CAACCACGAG CCGACGACGG AATCGATAA
|
Protein sequence | MGRNCPISAR CCMYYEYQHE QNLVVSTAEM ELGFSTSTGM LTLLRRPNQA NILNIGSIGM SVDVQLADGW VSETHFPRYL SHHGREVDGV TELTINIGLG FLRISDSLQI TGTLIRRSVA VTNSGGNEAR VHWVRLSWPY ARVGSYSEAR FEAPSNSFRP RLDIAAVSKL RRGTLPTQTI APAIRRGRLF ENAPDRGPGL LALHSTSESD NLLCWYWSKS QSAWPDIDGN DLALTVGHEL EIAGWLAPDA TLSGGTQYCM LVHGNWYDAM HAFHNTWPVL GVQTLPDVPD WVCAANIYET HVGLWGGFAK FSQELTRLRD LGFDTINLMP IWRYHNLSDQ PWDMNWQASG SPYAIEDFEQ LEPSLGTAEE FKVLVEQAHA LGMRILCDLV VQGCSRTARY VQERPGWFCR DERGRLVSSH GWNDTYSFDW ANPEVQDFYV DWTTRFAQTY QIDGWRVDAP HRKEPNWDRR LERMAASTSF GVLTIVERMR QALRQINPQA ALLCELYGPL FPINHDFAYD YLAHLMFFHA GLGVLSPYEL GEWLEDHFLA LPKGAIRVCF TETHDTRDVN PIADAVRGSR LARLLLTGMV GCGFVPMLWT GQEVGQEAWL KQLFSIRANY PILRYGKQLF NVMPCDMPSV WSVLRVWHEE RLAVVLNMGP HRRTATLSMP VDRMHMVEGD YHLFDLVRGQ AVEYAGRNTW RRDDLLNLTL ILEPFDSLLL HIRAGTPPQS EPAKAEPVAA AAPATTSRRR NR
|
| |