Gene Haur_3705 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3705 
Symbol 
ID5735569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4658032 
End bp4659885 
Gene Length1854 bp 
Protein Length617 aa 
Translation table11 
GC content52% 
IMG OID641280857 
Productalpha amylase catalytic region 
Protein accessionYP_001546469 
Protein GI159900222 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00647332 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACTACC CAACTTGGAC AAGCGCTGTG CACCACGATG GTTCGGCGCT GTACCTTCAA 
CCAAGCCAGC CCTATCACCT TGGTCAACAA GTAACTGTTC GCTTGCGAAC TCCATTAGCT
GCGCCAATTA CCCAAGCCTT TATTCGCATC TGCCCCGATG GCGAGCAAAC GTTTGTGGCG
ATGCAACCAG CCGAACGGAC TGAAACGATT CAATGGTGGC AAGGCCAAAT CACGCTCTCG
ATGCCGCGCA CTGGCTATCG CTTTTGGCTG ATGACCGAGC AAGGCGGCTG GTGGCTTTCG
GCAGCAGGCA TGCAACGTTC AACCCCTACC GATGCGACCG ATTTTAAGTT GTTGGCTGAT
TATCATGCCC CAACCTGGGT GCATTCAGCA GTGTTCTATC AAATTTTTCC TGATCGGTTT
TGTGATGGCG AGCCAAGCAA TAATGTGGTT GATGGCGAAT ATACAGTTTA TGGCAAGCCA
ACGATTGCCC GCCAATGGGG CGAGGCTCCT CAGAAAGCCA CGGGTGGTAT CGAGTTTTTC
GGTGGCGATT TACAGGGTAT TAGCCAAAAG CTTGATTATC TTGCGCAGCT AGGGATTAAT
GCGCTGTATC TCACGCCGAT TTTTACTGCT CCCTCAAACC ACAAATACGA TACCGCCGAT
TATCTGCAAA TCGATCAGCA TTTTGGCGGT GAGGCGGCTT TGGCTGAATT GCGCCAAGCA
ACCCAACGCT ACCAGATGAA ATTGATGTTG GATATTGTGC TCAATCACTG TGGCTACACT
CATCATTGGT TTACGGCGGC TCAAGCCGAT ACCAACGCGC CTACCGCTGA TTATTTTTCG
TGGAAGCAAC ATCCCAACGA GTATGAATCA TGGTTAGGTC ATCGCTCATT GCCCAAACTC
AATTACACCA GCCATGGCTT GCGCCAAGCA ATTTATGGCA GCGAACAGGC GATTGTGCGC
CATTGGTTGC GTCAACCCTA TGCGATCGAT GGCTGGCGAA TCGATGTAGC CAATATGTTG
GCCCGCCAAG GTTCGAGCCA GTTAGGGCAT AAAATTGGCC GCGCACTGCG CCGCGCCGTA
AAAGCCGAAT CACCTGAAGC CTATTTACTT GGCGAACATT TCTATGATGG CACGAATCAC
CTTCAAGGCG ATGAACTTGA TGCCAGCATG AACTATCGTG GCTTTACCTT CCCGACCTTG
CAATGGCTAG TTGGCTTCGA TATGGCCTCG GTGTGGAACC TGGTTTGGGA AGATCGGGCC
TTATTGCCGA CTGAAGCCTT GGGCGAGCAA TGGCTGGCCT TTTTGGCCGT GATTCCATGG
CAAGTCGCTT TGCAACAATT CAATTTGCTC GATTCGCACG ATACGCCACG TTTGTTGACG
ATTGTTGGTG GTGATCTGTC ATTACATCAC GTCGCAGTTA CCCTGCAAAT GACCTTCCCG
GGCGTGCCCT GCATCTATTA TGGCGATGAA GTCGGCATGC AAGGCGGCGG CGATCCCGAG
TGTCGCGGCT GTATGCCATG GGATGCACAA GTTTGGGATC ACGATCTGCT AGCTTTTTAT
CGTTCGCTGA TTGGATTACG CCGTAGTTCG AGCGCTTTGA GTGTTGGCGG ATTTCAATTA
TTGCTGGCCG AAGGCGATAC GGTGGCCTTT ATGCGGCGCA GCGCTGATGA ATGTTTGTTG
ATCGTTGCCC AGCGGGCTGC TACCAGCATT CCCCCAATTC CAATGTTCGC AACCGGATTG
ACTGATGGTA CAAGCTTTAT CAAAGTCGCT GGCACAACGA AAATTACGAT TCAGGCTGGG
GTACTAGTTT TGCCACAAAC TGGCATTAGC GCCAGCATCT GGCAAATGCA GTAG
 
Protein sequence
MHYPTWTSAV HHDGSALYLQ PSQPYHLGQQ VTVRLRTPLA APITQAFIRI CPDGEQTFVA 
MQPAERTETI QWWQGQITLS MPRTGYRFWL MTEQGGWWLS AAGMQRSTPT DATDFKLLAD
YHAPTWVHSA VFYQIFPDRF CDGEPSNNVV DGEYTVYGKP TIARQWGEAP QKATGGIEFF
GGDLQGISQK LDYLAQLGIN ALYLTPIFTA PSNHKYDTAD YLQIDQHFGG EAALAELRQA
TQRYQMKLML DIVLNHCGYT HHWFTAAQAD TNAPTADYFS WKQHPNEYES WLGHRSLPKL
NYTSHGLRQA IYGSEQAIVR HWLRQPYAID GWRIDVANML ARQGSSQLGH KIGRALRRAV
KAESPEAYLL GEHFYDGTNH LQGDELDASM NYRGFTFPTL QWLVGFDMAS VWNLVWEDRA
LLPTEALGEQ WLAFLAVIPW QVALQQFNLL DSHDTPRLLT IVGGDLSLHH VAVTLQMTFP
GVPCIYYGDE VGMQGGGDPE CRGCMPWDAQ VWDHDLLAFY RSLIGLRRSS SALSVGGFQL
LLAEGDTVAF MRRSADECLL IVAQRAATSI PPIPMFATGL TDGTSFIKVA GTTKITIQAG
VLVLPQTGIS ASIWQMQ