Gene Haur_1418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1418 
Symbol 
ID5733326 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1635363 
End bp1638146 
Gene Length2784 bp 
Protein Length927 aa 
Translation table11 
GC content50% 
IMG OID641278556 
Productcellulose 1,4-beta-cellobiosidase 
Protein accessionYP_001544190 
Protein GI159897943 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCTG ACCACAGCCG TACCGTGATT GGTATCCGTT CGACGAGGAG CTTTTGGGCA 
CGAGGATTTT TCGTCTTTTG GTTGATTATT GCTTTAGGCT GCCAACAAGC GATTGTTCCA
GCCACCTCTC AGATTTATGC CCAACCCCCG CAAACTGTTG GTAATTTGCT GCAAAATGGC
GATTTCAGCG CTGGTTTTGC GCCATGGTGG GCCACAAGCT CGGTTGCTAC TGATACTGCG
AGTGGCGCAT TAGTCGCAAC CATCAATAAT GCTGGTAGTA ATCCATGGGA TGCAATCGTT
GGCCAAAATG GAGTTACGCT TCAATCAGGC CAAACCTACA CCGTCACGTT TCGCATCCGT
GCCTCAACCA CTGGTACTGT GGTGATGAAA CTGCAAAAAG AGGCCTCTCC CTATACCAAC
TATTTTAGCC AAGACGTGGC TTTGACTACT AGCGATCAAG CCCATCAATT TGTCTTTACC
TCAGGCTTTG ATGATGCTGG GGCTGCTTTT CAATTTCAAA TGGGCGGCCA AGGCACGAAT
GTACTGACAA TTGACGATGT GCAAGTGTTG GGCGAAACTG GCCCGGTCGA GCCATCGGGC
AATTTGGTGC AAAATGGCGA GTTTGTTGGT GGCCTACCGC CTTGGTGGAC TGGCGGCGAT
GTCAGTGTGA ACACCGATGA TGGCGCGTGT TTGACGATCA ACACTCCTGG CACAAATCCT
TGGGATGTGC AGCTTGGTCA ACATACCATT GCGATTGAAG CTGGTGTGAG CTATCAGCTT
AAATTTGCTG CCAAATCAAC TGTGCCAGTC ACTTTGCCCG TGCGTTTGCA AAAAAATGCC
GAGCCATACA CTGGCTATTT TTCGGCTGAT CCAGTGCTCA ACTCGGCCTG GACAGAATTT
GTCTATAATT TTACATCGGC CTATAGCGAT GCAGCTTCGT TGTTGTTCCA AATGGGTGGC
ATCGGAACCC CGACGATCTG TATTGATCAG GTGAGTTTGT ATATGTTAGA AACCGGAATT
CGGGTCAACC AAGCCAGCTA TTTGCCCACG ATGAGCAAAG TGGCAAGTTT GGTGCATCCT
GCGACTGAGC CATTGGCTTG GCAATTGCAT AACACTGCTG ATGCAGTGGT GGCAAGCGGC
CAAACCACGG TGTATGGCGA AGAATCAGCC TCGGCTGAAC ATGTGCATTT GATCGATTTT
TCAAGCTACC AAACGGTTGG TGAAGGCTAT TATCTGAGCA TCGGCAGCGA AACGAGCTAT
CCATTTGCCA TCGAGGCAGG TTTGTATTCG CGGATGAAAT ATGATGCTTT GGCCTATTTC
TACCATAATC GCAGCGGGAT TGCGATTACC ATGCCCTATG CTGGTGGCGA GCAATGGACG
CGACCAGCTG GCCACATCGG GGTTGCACCC AACCAAGGCG ATACCTCGGT AACCTGTTTT
ACTGGCACTG ACACCCAAGG TCAAAGCTGG CCCGGGTGCG ATTATCAACT TGACGTTTCA
AAGGGTTGGT ACGACGCTGG CGACCATGGC AAATATGTGG TTAACAGTGG TATTTCGGTC
TGGACGTTGC TTAATCAATA TGAACGAGCG CAGCAACGTG GCCCTGCCAG CCTAGCCCAA
TTTGCCGATG GCACGATGAA TATTCCCGAA AATAACAATG GCGTGCCCGA TTTGTTGGAT
GAAGTGCGCT GGAATATGGA GTTTATGCTA GGAATGCAAA TTCCTACGAC TGCGCCAGTT
TCCAAAACGG GTATGGTGCA CCATAAAGTT CACGATGCGA ACTGGACTGG TTTGCCAATG
GCTCCACACG AAGATAGCCA AATGCGCTAT TTGTATCCAC CAAGTACCGC CGCAACCTTG
AACTTGGCTG CGACCGCTGC CCAATGTTCA CGGATTTGGC GTGAGATCGA TGAAGCCTTT
GCTGATCGAT GTTTGGTGGC GGCTGAACGC GCTTGGCAAG CAGCCTTGGC CTACCCTAAC
GAAATTGCCC GCGATAACTT CAATGGCGGT GGTGGCTATG GCGATAGCAC CTTGAACGAT
GAATGGTATT GGGCCGCCGC CGAATTGTAT ATCACAACTG GCTCGGCCAC CTATCGCGAA
GCAATTGAAG AATCGAGCTA TTACTTGCGG CTTGATCTTG GTGGTGGAAG CGCCATGAAC
TGGGGCGGCG TGGCAAGCCT TGGCACCTTG TCGTTGGCCT TGGTTCCCAG TGATTTGAGT
AGCGCGAATC GCCAAACTGC CCGTGCAGCA GTGATTGCAG CGGCTGATCA ATTTGTGGCA
GCGCAGCAAT CCAGCGGTTA TGGCATTCCA TACAATCCTG GTGCGCAGTA TCCTTGGGGT
TCCAACTCAT CAATTCTGAA TAATATGATT GTAATGGGTT TGGCAGGCGA CTTTACTGGA
AATGCCAACT ATGCCGATGC AATTAGCCAA GGCATGGATT ACTTGCTGGG GCGTAATCCA
CTCAATCGCT CGTATATTTC GGGCTATGGC TCGGTCTCAT TGACCAATCC ACATCATCGC
TTCTGGGCTA AGCAAATTAA TCCAGAGTAT CCTGGTACGC CGCCAGGTGT GGTGGCTGGG
GGGCCAAATT CAAGCATTCA AGACCCTTAT GCTCAAGTTG AATTAGCTGG TTGTGCGGCC
TTGAAATGCT ATGTTGATCA TATCGATTCG TGGTCAACCA ATGAAGTGAC GATTAACTGG
AACTCGCCGC TGGCATGGGT TGCGGCCTAT CTTGATGATT ATGATACAAG TTCGGTGCAG
TATCTGCCGT TGATCAGTAA GTAA
 
Protein sequence
MSADHSRTVI GIRSTRSFWA RGFFVFWLII ALGCQQAIVP ATSQIYAQPP QTVGNLLQNG 
DFSAGFAPWW ATSSVATDTA SGALVATINN AGSNPWDAIV GQNGVTLQSG QTYTVTFRIR
ASTTGTVVMK LQKEASPYTN YFSQDVALTT SDQAHQFVFT SGFDDAGAAF QFQMGGQGTN
VLTIDDVQVL GETGPVEPSG NLVQNGEFVG GLPPWWTGGD VSVNTDDGAC LTINTPGTNP
WDVQLGQHTI AIEAGVSYQL KFAAKSTVPV TLPVRLQKNA EPYTGYFSAD PVLNSAWTEF
VYNFTSAYSD AASLLFQMGG IGTPTICIDQ VSLYMLETGI RVNQASYLPT MSKVASLVHP
ATEPLAWQLH NTADAVVASG QTTVYGEESA SAEHVHLIDF SSYQTVGEGY YLSIGSETSY
PFAIEAGLYS RMKYDALAYF YHNRSGIAIT MPYAGGEQWT RPAGHIGVAP NQGDTSVTCF
TGTDTQGQSW PGCDYQLDVS KGWYDAGDHG KYVVNSGISV WTLLNQYERA QQRGPASLAQ
FADGTMNIPE NNNGVPDLLD EVRWNMEFML GMQIPTTAPV SKTGMVHHKV HDANWTGLPM
APHEDSQMRY LYPPSTAATL NLAATAAQCS RIWREIDEAF ADRCLVAAER AWQAALAYPN
EIARDNFNGG GGYGDSTLND EWYWAAAELY ITTGSATYRE AIEESSYYLR LDLGGGSAMN
WGGVASLGTL SLALVPSDLS SANRQTARAA VIAAADQFVA AQQSSGYGIP YNPGAQYPWG
SNSSILNNMI VMGLAGDFTG NANYADAISQ GMDYLLGRNP LNRSYISGYG SVSLTNPHHR
FWAKQINPEY PGTPPGVVAG GPNSSIQDPY AQVELAGCAA LKCYVDHIDS WSTNEVTINW
NSPLAWVAAY LDDYDTSSVQ YLPLISK