Gene Haur_3868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3868 
Symbol 
ID5735717 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4860648 
End bp4862906 
Gene Length2259 bp 
Protein Length752 aa 
Translation table11 
GC content53% 
IMG OID641281019 
Productalpha amylase catalytic region 
Protein accessionYP_001546630 
Protein GI159900383 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGTCGCA ATTGCCCTAT CAGTGCGAGA TGTTGTATGT ATTACGAATA TCAGCATGAG 
CAAAATTTGG TTGTGAGCAC TGCCGAAATG GAGCTTGGGT TTTCGACCTC AACAGGCATG
CTCACCTTGC TGCGTCGCCC TAACCAAGCC AACATTTTGA ATATCGGTAG CATTGGCATG
TCGGTCGATG TGCAATTAGC CGATGGTTGG GTCAGCGAAA CTCATTTTCC CCGCTATTTA
AGCCATCATG GCCGTGAAGT TGACGGCGTA ACTGAATTGA CGATCAACAT TGGCTTGGGC
TTTTTGCGCA TCAGCGATAG CTTGCAAATT ACTGGAACCT TGATTCGGCG TTCGGTCGCT
GTGACCAACA GTGGCGGCAA CGAGGCGCGG GTACATTGGG TGCGGTTGAG TTGGCCCTAT
GCGCGGGTTG GCAGTTATAG CGAAGCTCGG TTTGAAGCAC CTAGCAATAG TTTTCGGCCA
CGGCTGGATA TTGCCGCAGT TTCAAAATTG CGGCGTGGTA CATTGCCGAC GCAAACGATT
GCCCCAGCCA TTCGCCGTGG ACGCTTATTT GAAAATGCAC CAGATCGTGG ACCCGGGTTA
TTAGCCTTGC ACAGCACCAG CGAGAGCGAT AATTTGCTGT GTTGGTATTG GAGTAAAAGC
CAATCGGCTT GGCCCGATAT TGATGGCAAC GATTTGGCAT TAACGGTCGG CCACGAGCTT
GAAATTGCTG GCTGGCTCGC GCCCGATGCC ACGCTGAGCG GCGGCACGCA ATATTGTATG
TTGGTGCATG GCAATTGGTA CGATGCGATG CATGCCTTTC ATAACACCTG GCCTGTGCTG
GGAGTGCAGA CCTTGCCCGA TGTGCCCGAT TGGGTTTGCG CCGCCAATAT TTATGAAACG
CACGTTGGTT TATGGGGTGG CTTTGCCAAA TTTAGCCAAG AGCTAACCCG TTTGCGCGAT
TTGGGCTTCG ATACGATTAA TCTCATGCCA ATTTGGCGCT ATCACAATCT TTCGGACCAG
CCGTGGGATA TGAACTGGCA GGCTTCTGGT TCGCCCTACG CGATCGAAGA TTTTGAGCAG
CTGGAGCCTA GCTTGGGCAC TGCCGAAGAA TTTAAGGTCT TGGTTGAGCA AGCGCACGCC
TTGGGCATGC GAATTTTATG CGATCTGGTG GTGCAAGGTT GCTCGCGCAC TGCCCGCTAT
GTGCAAGAGC GGCCAGGCTG GTTTTGTCGC GATGAGCGGG GGCGCTTGGT TTCATCGCAC
GGTTGGAACG ATACCTACAG CTTTGATTGG GCGAATCCTG AAGTGCAAGA TTTCTATGTT
GATTGGACGA CTCGTTTTGC CCAAACCTAT CAGATTGATG GCTGGCGAGT TGATGCCCCA
CATCGCAAGG AGCCAAATTG GGATCGGCGC TTGGAGCGGA TGGCTGCTAG CACCTCATTT
GGCGTATTGA CGATTGTTGA GCGCATGCGC CAAGCCTTGC GCCAAATCAA CCCGCAAGCA
GCATTATTGT GTGAATTGTA TGGCCCGTTG TTTCCAATTA ATCACGATTT TGCCTACGAT
TATCTGGCGC ATTTGATGTT TTTCCACGCT GGCTTGGGCG TGCTCTCGCC CTACGAATTG
GGCGAATGGC TCGAAGATCA CTTTTTGGCT TTGCCCAAGG GAGCAATTCG AGTTTGCTTT
ACTGAAACCC ACGATACCCG CGATGTCAAC CCGATTGCCG ATGCCGTGCG AGGTTCGCGT
TTGGCGCGGT TGCTGCTGAC TGGCATGGTT GGCTGTGGCT TTGTGCCAAT GCTTTGGACG
GGACAGGAAG TGGGACAGGA AGCCTGGCTC AAACAATTAT TCAGCATTCG TGCCAACTAC
CCAATTTTGC GTTATGGCAA ACAACTGTTT AACGTCATGC CCTGCGATAT GCCCTCAGTT
TGGAGCGTGC TACGGGTTTG GCACGAAGAA CGCTTGGCGG TGGTGCTGAA TATGGGGCCA
CATCGGCGCA CTGCCACCCT GAGCATGCCC GTTGATCGTA TGCACATGGT CGAAGGTGAC
TATCATTTGT TTGATTTAGT GCGCGGCCAA GCAGTCGAAT ACGCTGGGCG CAACACTTGG
CGACGTGATG ATTTGTTGAA TTTGACCTTG ATTTTAGAGC CATTCGATAG TCTGCTGCTG
CATATTCGAG CTGGTACGCC GCCCCAATCA GAGCCTGCCA AGGCTGAGCC AGTTGCCGCC
GCTGCACCAG CAACCACGAG CCGACGACGG AATCGATAA
 
Protein sequence
MGRNCPISAR CCMYYEYQHE QNLVVSTAEM ELGFSTSTGM LTLLRRPNQA NILNIGSIGM 
SVDVQLADGW VSETHFPRYL SHHGREVDGV TELTINIGLG FLRISDSLQI TGTLIRRSVA
VTNSGGNEAR VHWVRLSWPY ARVGSYSEAR FEAPSNSFRP RLDIAAVSKL RRGTLPTQTI
APAIRRGRLF ENAPDRGPGL LALHSTSESD NLLCWYWSKS QSAWPDIDGN DLALTVGHEL
EIAGWLAPDA TLSGGTQYCM LVHGNWYDAM HAFHNTWPVL GVQTLPDVPD WVCAANIYET
HVGLWGGFAK FSQELTRLRD LGFDTINLMP IWRYHNLSDQ PWDMNWQASG SPYAIEDFEQ
LEPSLGTAEE FKVLVEQAHA LGMRILCDLV VQGCSRTARY VQERPGWFCR DERGRLVSSH
GWNDTYSFDW ANPEVQDFYV DWTTRFAQTY QIDGWRVDAP HRKEPNWDRR LERMAASTSF
GVLTIVERMR QALRQINPQA ALLCELYGPL FPINHDFAYD YLAHLMFFHA GLGVLSPYEL
GEWLEDHFLA LPKGAIRVCF TETHDTRDVN PIADAVRGSR LARLLLTGMV GCGFVPMLWT
GQEVGQEAWL KQLFSIRANY PILRYGKQLF NVMPCDMPSV WSVLRVWHEE RLAVVLNMGP
HRRTATLSMP VDRMHMVEGD YHLFDLVRGQ AVEYAGRNTW RRDDLLNLTL ILEPFDSLLL
HIRAGTPPQS EPAKAEPVAA AAPATTSRRR NR