Gene Haur_2820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2820 
Symbol 
ID5734701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3586898 
End bp3588799 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content48% 
IMG OID641279963 
Productglycoside hydrolase family protein 
Protein accessionYP_001545586 
Protein GI159899339 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000391085 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAGA AACAACGTTC GTTTCGCACC CGCCTAGCCA TGATCGGCGG GCTAGTTACC 
CTACTCTTGG CTGGTCAGCC CGCTACTACC AAGCCAACCG CTGCCGCCGC TATCTGCGAG
GTCACTTACA CAATCTCGAA TCAGTGGTCT ACTGGCTTCA CTGCTAATGT GAGTGTTAAG
AATCTTGGGA TCGGTCTCAA TAATTGGCAG GTTGGCTGGA CATTCGCGGG CAATCAGGCA
ATTACCAATC TTTGGAATGG GGTGCTCACC CAAACTGGCG CTCAGGTCAG TGTGTCAAAT
CCAGCATGGG CCGCCAGTTT ACCCAGCAAT GGTACTGCCA GCTTTGGTTT CCAAGCTTCG
TATACTGGTA GTAATGCGAT TCCAAACGCA TTTACCTTGA ATGGCGTTAG TTGCAACGGC
GATCAGCCAA GCCCAATGCC TACAAATACT GCTATTCCGA GCATTCCACC TGCCACCAAC
ACCCCTAATC CGCCAACCAA TACGCCAATT GCTACCACAA CCGGAACCCC TCGCCCAACC
AATACGCCAA CCAGCGTCAT TCCAACGGTC ACGAATACAC CTCGCCCAAC CAATACTCCA
GTTCCAACCA CGGTCAATCC AACGGCTACG AGTACCCCAA CTGGTAATAA CAATAATGAT
GATTGGCTCC ACACCAATGG CAATCAAATT GTTGATAGCG CAGGTCGCCC AGTTTGGTTA
ACTGGAGTCA ATTGGTTTGG CTTCAATGCA ACTGAGCGGG TGTTTCATGG CTTGTGGTCG
GCCAATTTGA CCAGCATGAT GCAAAGCATT TCGCAACGTG GATTGAACAT TATTCGCGTA
CCAATCTCAA CTGAATTGAT TTTGGAGTGG AAAGCCGGGG TTTTCAAAAC ACCAAATGTC
AACACTTACG CCAATCCTGA ATTAGAAGGC TTAACCTCGT TGCAAATATT TGATCGCTTC
GTGATGCTTT CAAAGCAATT TGGCATTAAG GTGATGATCG ATGTGCATAG CGCCGAAGCC
GATAATTCAG GCCATTATGC GCCACTCTGG TACAAAGGTT CGTTTACCAG CGAGCAGTTT
TATCAGGCTT GGGAGTGGAT TACTGATCGT TACAAAAATG ACGATACGGT GATCGCAATG
GATATTAAGA ATGAGCCACA CGGCACGGCC CACGATAATC AAACCAGCAG TCAATTTGCC
AAATGGGATA ACTCGACCGA TATCAACAAC TGGAAATACG TTTGCGAAAC TGCTAGCAAA
CGAATTTTGG CGATTAACCC TAATGTCTTG GTGCTATGCG AAGGCAACGA GGTTTATCCA
AAGGCCGGCG CAAGCTATAC CTCAAGCAAC AAAAATGATT ACTACTTTAC CTGGTGGGGC
GGAAATTTAC GTGGCGTGCG TGATTATCCG GTCAATCTTG GCAGCAACCA AGATCAATTG
GTCTACTCGC CACACGATTA CGGCCCGTTG GTCTTCAATC AATCGTGGTT CTACCCTGGT
TTTACCAAAG AAACGCTTTA CAACGATGTT TGGTATCCTA ATTGGTTTTT TATCCATGAA
GAAAATATTG CGCCATTGTT TATTGGCGAA TGGGGTGGCT TTTTGGATGG TGGCGCAAAT
GAACAATGGA TGAAGGCGTT GCGCGATTTG ATCAAAGAGC ACTATCTACA CCATACCTTC
TGGGTACTCA ACCCCAATTC TGGCGACACT GGCGGTTTGC TCGGATACGA TTGGGCCACT
TGGGATGAGG CTAAATATGC CTTGCTCAAG CCAGCCTTGT GGGCAGATCG CAATGGTAAA
TTTGTCAGCC TCGATCATCA AATTCCGCTC GGTGGCACAG CTACTGGCAC AACCATTACC
CAATATTATC AACAGGGCAA CCAAGCTCCA AGCAATCCCT AA
 
Protein sequence
MSQKQRSFRT RLAMIGGLVT LLLAGQPATT KPTAAAAICE VTYTISNQWS TGFTANVSVK 
NLGIGLNNWQ VGWTFAGNQA ITNLWNGVLT QTGAQVSVSN PAWAASLPSN GTASFGFQAS
YTGSNAIPNA FTLNGVSCNG DQPSPMPTNT AIPSIPPATN TPNPPTNTPI ATTTGTPRPT
NTPTSVIPTV TNTPRPTNTP VPTTVNPTAT STPTGNNNND DWLHTNGNQI VDSAGRPVWL
TGVNWFGFNA TERVFHGLWS ANLTSMMQSI SQRGLNIIRV PISTELILEW KAGVFKTPNV
NTYANPELEG LTSLQIFDRF VMLSKQFGIK VMIDVHSAEA DNSGHYAPLW YKGSFTSEQF
YQAWEWITDR YKNDDTVIAM DIKNEPHGTA HDNQTSSQFA KWDNSTDINN WKYVCETASK
RILAINPNVL VLCEGNEVYP KAGASYTSSN KNDYYFTWWG GNLRGVRDYP VNLGSNQDQL
VYSPHDYGPL VFNQSWFYPG FTKETLYNDV WYPNWFFIHE ENIAPLFIGE WGGFLDGGAN
EQWMKALRDL IKEHYLHHTF WVLNPNSGDT GGLLGYDWAT WDEAKYALLK PALWADRNGK
FVSLDHQIPL GGTATGTTIT QYYQQGNQAP SNP