Gene Haur_3461 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3461 
Symbol 
ID5735322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4355100 
End bp4356560 
Gene Length1461 bp 
Protein Length486 aa 
Translation table11 
GC content52% 
IMG OID641280608 
Productcellulose-binding family II protein 
Protein accessionYP_001546225 
Protein GI159899978 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3866] Pectate lyase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.560967 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTAT CAATTGCCCG ACGTTGGGTT ATTGCCACCG GCTTAGCCAC TGCCTTGATT 
GGCGCAATCC AAAGCCCAAT CACCAACGCC CAAACTGGCC TAAGCTGCCA AGTCAATTAT
ACGCTTACTA ACCAATGGGG CAGCGGGTTT CAAGCCGATG TGGTGGTGCG CAACACCGGA
ACCAGTGTGA TCAACGGCTG GACGGTTGCT TGGAGTGCTG CCAGCGGCCA GCAAATTGGC
CAAATGTGGA ATGCGACGTT TACCCAAAGT GGCAGCCAAG TCAGTGCCAA AAATGTTGAT
TGGAACGCCA GCATCGCTGC TGGTGGCAGC CAAAGCTTTG GCTTTACCGC TACCACGACT
GGTAGCTTGG CCGTGCCCAG CAGCTTTACG GTCAATGGCG TTGTTTGTGG CGGTAGTGTT
AGCCCAACCG CAACCCGCAC GCCAGCCGCG ACTGCCACGC GTACTCCAAT TGCCACTGCA
ACCCGCACGC CAGCTGTAAC CGCCACGCGC ACCCCTGTGG CAACTACTAC CCGTACTCCA
ATTGCCACGG CTACCGTCGT TCCAACCAAT CCACCAGTCA GCAATGGCTT GATTGGCTGG
GCCACGGTTG CTGGTTCGGG CTTAAGCACA ACTACTGGCG GCACTGGTGG TAGCACAGTT
ACCGCAGCCA ACTTTACTGA ATTGCAAAAC TACGCCAAAT CGTCATCGCC CATGATTATC
AAGTTCTCGG GTACGATGCA AGGCACACTG ACGGTTGCCT CGAACAAAAC GATTATCGGC
AGCAATGGAG CCTTGATCCA AGGTAATGTC AAAATCTCAG GCGCTCAAAA TATTATTTTG
CAAAATTTTG CGATCAACGG CAATAGCTGC TCAAGCTACG ATAACTGCCG CGCTGGAAGC
GATGCCTTGG GGATTAGCAA TTCGCACCAT ATTTGGGCCG ACCACTTGAC GATTACCAAT
GGCCAAGATG GCAATTTCGA CATTAACAAT GGCTCTGATT TCATTACGGT TTCGTGGAGC
AAATTCGGCT ATACCACCAA CAAAGAGCAT CGTTTCTCGA ACTTGATTGG TAGCTCAGAC
GATGCAGCCT CGACCGATAG CGGTAAATTG AACGTGACCT TCCATCATAA CTGGTGGTTT
GGTGGGGCAA TGCAGCGCAT GCCACGTACG CGCTTCGGCA AAATTCACGT ATTCAATAAT
TTGTACACCA CCACTGGCAA CGATTATTGT GTTAGCTCAG GCTATCAATC CAAAGTGTTG
CTCGAAAATA ATGCCTTCAT TGGGGTCAAC ACGCCGCACC GCTTGCACGA TGGCGATCTC
AAGGCGGTGG GCAATCTCTA CCAAAACACC AGCGGCGATC AAATTAGTAC TGGCGTTGCC
TTCACGCCGC CCTACAGCTA TAGCGCCGAA GCTGCTAGCT CACTCAGCAG TTCAGTCCAA
GCTGGCGCAG GAGCGAAGTA G
 
Protein sequence
MKLSIARRWV IATGLATALI GAIQSPITNA QTGLSCQVNY TLTNQWGSGF QADVVVRNTG 
TSVINGWTVA WSAASGQQIG QMWNATFTQS GSQVSAKNVD WNASIAAGGS QSFGFTATTT
GSLAVPSSFT VNGVVCGGSV SPTATRTPAA TATRTPIATA TRTPAVTATR TPVATTTRTP
IATATVVPTN PPVSNGLIGW ATVAGSGLST TTGGTGGSTV TAANFTELQN YAKSSSPMII
KFSGTMQGTL TVASNKTIIG SNGALIQGNV KISGAQNIIL QNFAINGNSC SSYDNCRAGS
DALGISNSHH IWADHLTITN GQDGNFDINN GSDFITVSWS KFGYTTNKEH RFSNLIGSSD
DAASTDSGKL NVTFHHNWWF GGAMQRMPRT RFGKIHVFNN LYTTTGNDYC VSSGYQSKVL
LENNAFIGVN TPHRLHDGDL KAVGNLYQNT SGDQISTGVA FTPPYSYSAE AASSLSSSVQ
AGAGAK