Gene Haur_3462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3462 
Symbol 
ID5735323 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4356928 
End bp4359249 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content53% 
IMG OID641280609 
Productcellulose-binding family II protein 
Protein accessionYP_001546226 
Protein GI159899979 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0281342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTATC GCTGGCGAAT ATGCCTGCTA TTAATTGCAA CAATACTGAG TACATTTAGT 
TACACCAATA CCAATGCCCA AAGTGGCCTA AGTTGCCAAG TTAATTATGC GCTTACCAAC
CAATGGGGCG GCGGGTTTCA AGCCGATGTG GTGGTGCGCA ACACCGGAAC CAGCGCGATC
AACGGCTGGA CGGTTGCTTG GAGTGCCGCC AGCGGCCAGC AAATTGGCCA AATGTGGAAT
GCAACCTTTA CCCAAAGTGG CACGCAAGTT AGTGCCAAAA ATGTTGATTG GAATGCCAAC
ATCGCCGCTG GTGGCAGCCA AAGCTTTGGC TTTACCGCTA CCACGACTGG TAGTTTGGCC
GTGCCTAGCA GTTTTACGGT CAATGGTGTT GTGTGTGGCG GTAGCGTTAG CCCTACGGCA
ACCACTCCAG CAGCAACGGC AACGCGCACG CCAACCAGTG TGCCAACCGC AACCCGTATT
GCGACTAGCA TACCAACCGC AACAAATGCG GCAACCAGCG TGCCGACGGC AACGCGTGCA
CCAACCAGCG TGCCCACGGC CACCCCAAGC ACAACCACAG GCCGCCAAAT GGAAAAACTC
AATCGCGGGA TCATCAGCGT GCGCCAAGGC AGCAATAATT TTGTGAGCTG GCGCATGTTT
GGCACTGATC TTAGCGCGAT TGGCTTTAAC CTCTATCGTG GCACCACCAA AGTTAATTCC
AGCCCAATTA CCAATGCTAC CAGTTATTTG GATAGCGGCG CGGCGGCCAA TAGCGTTTAC
ACCGTGCGGC CTGTGATCGA TGGCGTTGAA CAAACTGCCT CAGAAAACTC GCTCAATTTT
GCCAATGGCT ATCTCGATGT GGCCTTGCAA ATTCCGGCTG GTGGCACGAC ACCTGATGGC
GTGGCCTATA CCTACACCGC CAACGATGCC AGCGTCGGCG ACCTCGATGG CGATGGGCAA
TACGAAATTG TACTGAAATG GGATCCAACC AATTCCAAAG ATAATTCGCA ATCTGGCTAT
ACTGGCAATG TTTATCTCGA TGGCTACAAA TTGAACGGAA CCCGTTTATG GCGCATCGAT
TTGGGCCGTA ATATTCGGGC TGGGGCGCAT TACACCCAAT TTATGGTCTA CGATTTGGAT
GGCGATGGGA AGGCCGAAGT TGCCGCCAAA ACTGCCGATG GCACGCGTGA TAATTCTGGC
ACCGTGATTG GCAACGCCAG CGCCGATTAT CGCAATTCCA GCGGCTACAT TCTTTCTGGC
CCCGAATATC TGACGGTATT CAATGGCCAA ACGGGCGTGA TTCGCTCGAC CGTCAATTAT
GATCCTGCAC GGGGCACGGT TTCGTCGTGG GGCGATAGCT ATGGCAACCG CGTTGATCGC
TTTTTGGGTG GAATTGCCTA CCTCGATGGT CAACGCCCGA GCCTGATTAT GAGCCGTGGC
TACTACACCC GCAGCGTAAT TGCCGCTTGG GATTTCCGCA ATGGCAGTTT GACCAAGCGT
TGGACGTTTG ATAGCAATGT GTCGGGCAGC CAATATGCTG GGCAAGGCAA CCATGGCCTT
TCGATCGCCG ATGTTGATCA AGATGGCAAA GATGAGATTA TCTTCGGAGC CATGACGATT
AATGATAATG GCCAACCACT GTGGAACACT CGCAATGGTC ATGGCGATGC GATGCACGTC
GGCGATCTTG ACCCAAGTCG GGCTGGCTTG GAAGTGTTCA AAGTCAGCGA GGATTCATCA
AAGCCTAGCT CGTGGTTTGC CGATGCCCGC ACAGGCCAAA TTTTGTGGCA AACAGCGGCA
GGTGGCGATA ATGGGCGCGG CGTTTCGGGC GATATTTGGT CGGGCAGCCC GGGCGCTGAA
TCGTGGTCAT CGATGGATAG CAATTTGCGT AGCGTCAGCG GGGCAACTCT TGGCCGCAAA
CCATCAGCAA CCAACTTCTT GATTTGGTGG GATGGCGATC CAATGCGCGA ATTGCTTGAT
GCCACCCGTA TCGACAAATA TGGCACATCA GGCGATACGC GCTTGCTGAC TGGCAGCAAT
GTTAGCTCCA ACAACAGCAC TAAATCGACC CCAGCGCTCA GCGGCGATAT TTTGGGCGAT
TGGCGCGAAG AGGTGATTTG GCGCACCAGC GATAACACCG CGCTGCGGAT TTATTCAACT
AGCACCAGCA CCAACCGCCG CATCTTCACC TTGATGCACG ATGCCCAATA TCGAGTGGCA
ATTGCTTGGC AAAACACCGC CTACAATCAA CCACCGCATC CTAGCTTTTT CTTGGGCGAT
GGCATGAGCA ATCCACCGCA ACCGAATATC TACTTGCGCT AA
 
Protein sequence
MNYRWRICLL LIATILSTFS YTNTNAQSGL SCQVNYALTN QWGGGFQADV VVRNTGTSAI 
NGWTVAWSAA SGQQIGQMWN ATFTQSGTQV SAKNVDWNAN IAAGGSQSFG FTATTTGSLA
VPSSFTVNGV VCGGSVSPTA TTPAATATRT PTSVPTATRI ATSIPTATNA ATSVPTATRA
PTSVPTATPS TTTGRQMEKL NRGIISVRQG SNNFVSWRMF GTDLSAIGFN LYRGTTKVNS
SPITNATSYL DSGAAANSVY TVRPVIDGVE QTASENSLNF ANGYLDVALQ IPAGGTTPDG
VAYTYTANDA SVGDLDGDGQ YEIVLKWDPT NSKDNSQSGY TGNVYLDGYK LNGTRLWRID
LGRNIRAGAH YTQFMVYDLD GDGKAEVAAK TADGTRDNSG TVIGNASADY RNSSGYILSG
PEYLTVFNGQ TGVIRSTVNY DPARGTVSSW GDSYGNRVDR FLGGIAYLDG QRPSLIMSRG
YYTRSVIAAW DFRNGSLTKR WTFDSNVSGS QYAGQGNHGL SIADVDQDGK DEIIFGAMTI
NDNGQPLWNT RNGHGDAMHV GDLDPSRAGL EVFKVSEDSS KPSSWFADAR TGQILWQTAA
GGDNGRGVSG DIWSGSPGAE SWSSMDSNLR SVSGATLGRK PSATNFLIWW DGDPMRELLD
ATRIDKYGTS GDTRLLTGSN VSSNNSTKST PALSGDILGD WREEVIWRTS DNTALRIYST
STSTNRRIFT LMHDAQYRVA IAWQNTAYNQ PPHPSFFLGD GMSNPPQPNI YLR