Gene Haur_3073 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3073 
Symbol 
ID5734945 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3880872 
End bp3882305 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content45% 
IMG OID641280217 
Productalpha amylase catalytic region 
Protein accessionYP_001545839 
Protein GI159899592 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACCA GCGATTGGCA AACGCCCGAT TGGGTCAAAC ATGCCGTCTT TTATCAGATT 
TTCCCTGAGC GCTTTGCCAA TGGTGATCGG ACAAATGATC CGGCAAATGC GCAACCTTGG
GGTACAAGCC CAACCTTGTA TAATTATATG GGCGGCGATC TACAAGGAAT TATCGATAAG
CTTGATTATT TAGTGGATTT GGGCATTAAT GCGCTGTATC TCAACCCAAT TTTTCAAGCC
ACCACCTCAC ATAAATATAA TACCTTCGAT TATTTTAAAA TCGATCCCCA TTTTGGTACG
CTAGAAACGT TTAAAACCTT ATTGAATGAA GCGCATCGAC GTGGCATTAA AGTTATTCTC
GATGCGGTGT TTAATCATTG CGGTCGCGGC TTTTTTGCCT TTCACGATGT AATTGAAAAT
GGTGTGCACT CGCCCTACAC CAATTGGTTT CATATCTCAC GCTTTCCAAT TCATCCCTAT
GAATCGCGCT ATGCCGCTAA TTATCGCACG TGGTGGGATT TTCGCGAGTT GCCCAAATTC
AACACCGATA ATCCGGCGGT ACGCAAATAT TTGCTTGATG TAGCTCGCTA TTGGATTGAA
TTGGGTATTG ATGGTTGGCG CTTGGATGTG CCAAATGAAA TTGATGATCA TAATTTTTGG
CGTGAGTTTC GCACAATTGT CAAAGATATC AATCCTGAAG CCTATATTGT GGGCGAAATT
TGGACTGACG GCTCAGCTTG GCTGCAAGGC GATCAATTTG ATGCCGTGAT GAATTATCTA
TTTCGCGATT TATGTACCGA TTTCTTTGCT AGCTATCGGG TACGTGCCGC TGATTTTGCG
GCTGGAATTG ACCATTTAAT TGTGCGTTAT CAGCCCCAAG TGACCTATGT CCAATTTAAT
TTGCTTGGTT CACACGATAC TGCGCGGTTT TTGAGTGTGG CTGAAGAAGC TGGTAAATGG
GCTTTAGAGC GCATGAAATT GGCGGTTTTG TTCAAATTAA TCTTTCCTGG TGCGCCATGT
ATCTATTATG GCGATGAAAT TGGCTTGCAT GGCGGCAAAG ATCCTGATTG TCGGCGTTGT
TTCCCGTGGG ATCAACCGCA AACCTGGCAG ACCGATCTCC AAGCTTGGAC CAAACGCTGG
GTTAAGTTTC GCCATGAGCA TACAGCCTTG CGCACGGGCC ATTATGCGAC GCTGTTTGCC
GACAACGATA TGAATATTTT TGCTTGTGCC CGTTGGGATG AGCAAAGCCA ATTTGTGATT
GTGCTGAATA ATAACGAAAC ACCTTGGACA CTCGATTTGC CGTTGCATGC CCAATTACCA
AGCGTCACTC ATTATCGCGA TGTGCAAACT GGCGAGTTGT ATAGCGTGGC CGAGGGTAAA
ATTCGCGAGG TAGCATTGGC TCCGTGGAAG CATTTGGTAT TACAAGCTGA ATAG
 
Protein sequence
MTTSDWQTPD WVKHAVFYQI FPERFANGDR TNDPANAQPW GTSPTLYNYM GGDLQGIIDK 
LDYLVDLGIN ALYLNPIFQA TTSHKYNTFD YFKIDPHFGT LETFKTLLNE AHRRGIKVIL
DAVFNHCGRG FFAFHDVIEN GVHSPYTNWF HISRFPIHPY ESRYAANYRT WWDFRELPKF
NTDNPAVRKY LLDVARYWIE LGIDGWRLDV PNEIDDHNFW REFRTIVKDI NPEAYIVGEI
WTDGSAWLQG DQFDAVMNYL FRDLCTDFFA SYRVRAADFA AGIDHLIVRY QPQVTYVQFN
LLGSHDTARF LSVAEEAGKW ALERMKLAVL FKLIFPGAPC IYYGDEIGLH GGKDPDCRRC
FPWDQPQTWQ TDLQAWTKRW VKFRHEHTAL RTGHYATLFA DNDMNIFACA RWDEQSQFVI
VLNNNETPWT LDLPLHAQLP SVTHYRDVQT GELYSVAEGK IREVALAPWK HLVLQAE