Gene Haur_3384 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3384 
Symbol 
ID5735245 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4267435 
End bp4268661 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content50% 
IMG OID641280531 
Producthypothetical protein 
Protein accessionYP_001546148 
Protein GI159899901 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3693] Beta-1,4-xylanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAAAT CACGCTGGTT AAGTGCCGCC ATGGTCTGCT TGCTAGCAGT TAATCTTTTG 
GCGGCCTGTG GTGGCGATAG TGCGCCCACT ACCCAACCAA CCAATCCTGA AGCAGCCACG
GCTACGCCCG AAACTGCTGC TCCAACCACT GATAGCAATC TACCAGTAAC GACGGGCAAC
CCGTTGCAAT TGCCCTATTT GCAATATGGT GCAGCCGCGC AACTGTACTA TACTGATCGT
AATCGCGCCT TGACCTTGAT GAACAACGCT GGGTTCGATT GGGTGCGCCA ACAGATTCAA
TGGAAAGATA TTGAAGGCCC AAAAGGTAAC TTTGGCTGGG GCGAACTCGA TGCAATTGTT
GCTGATGCCA ACGCCAAAAA TATCAAAGTG CTGTTGAGCA TTGTACGCTC ACCATCGTGG
GCACGGGCCG ATGGAACCAA CGGCATGCCC GATAACATCA AAGATTTTGG CGATTTTGTT
GAGGCCTTGG TGGTACGCTA CAAAGGCAAA GTCCAAGCCT ACGAAATTTG GAACGAACAA
AATCTTGATC ATGAAAATGG CGGCTCACGT GATTCGATCG ACGCTACCAA ATATGTTGAT
CTCTTGGTCG AAGCCTACAA TCGGATCAAA CCGATCGATC CTGAAGCCTT TGTGATTTCA
GGAGCATTGA CTTCAACTGG CGATTCACCA GCGGCGATCG ATGATATGAC CTACTTTGAG
CAAATGTTTA GCTACAAAGA TGGCATTTTC AAAGATCACA TCGATGGTGT GGGCTTCCAT
CCTTCGCCAT CGTACAATCC GCCAGCAACC TTATGGCCCG ACCAGCCCGG CCCAGGCCCA
GGTTGGCTCG AAAGCCCAAC CCACTACTTC CGCCATATCG AAAATCTCAA AATCTTGATG
GATAAATATG GCATGCAAGA TTATCAAGTG TGGGTGACTG AATTTGGCTG GGCGACCCAA
AACACCAGCC CAGGCTATGA GTATGGCAAC GAAATTAGCT TCGAGCAACA AGGTCAATAT
GTGCTCGATG CGCTGCAAAT GACCCGCCGC GATTACCCAT GGGTTGCCAC CATGTTTGTG
TGGAACCTCA ATTTTGCGGT AACTTCGCCT GATCCGCTTG ATCAAACCGC CTCATTCGGT
ATTCTCAACC CCGATTGGAG TCCACGGCCA GTCTTTGAAA AAATTCAAGG CTTTGTTAAC
GCCGTCAAAA CCGAGGAAGG TCGCTAA
 
Protein sequence
MPKSRWLSAA MVCLLAVNLL AACGGDSAPT TQPTNPEAAT ATPETAAPTT DSNLPVTTGN 
PLQLPYLQYG AAAQLYYTDR NRALTLMNNA GFDWVRQQIQ WKDIEGPKGN FGWGELDAIV
ADANAKNIKV LLSIVRSPSW ARADGTNGMP DNIKDFGDFV EALVVRYKGK VQAYEIWNEQ
NLDHENGGSR DSIDATKYVD LLVEAYNRIK PIDPEAFVIS GALTSTGDSP AAIDDMTYFE
QMFSYKDGIF KDHIDGVGFH PSPSYNPPAT LWPDQPGPGP GWLESPTHYF RHIENLKILM
DKYGMQDYQV WVTEFGWATQ NTSPGYEYGN EISFEQQGQY VLDALQMTRR DYPWVATMFV
WNLNFAVTSP DPLDQTASFG ILNPDWSPRP VFEKIQGFVN AVKTEEGR