Gene Haur_3251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3251 
Symbol 
ID5735119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4112784 
End bp4114154 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content54% 
IMG OID641280397 
Producthypothetical protein 
Protein accessionYP_001546016 
Protein GI159899769 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0229094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGCAG ATATCGTCCA GCATAAGTCT GATGATATGG AACAAGTTTC CAGTCGGTTC 
CAAAAATTGC ATGATCACCA AGTGCAAATG GAAGCGATGA TTCGCCGCAT GTATGAGGAA
CTTCGAGGTG GTGCTTGGCA AGGGGATGCA GCCCAAGCAT TCTTTGCCGA AATGAACGAG
CATATCTTTC CTGCTATGAA GCGATTTCAA AATGTGCTAG CAGTGAGCGG CGAAGTTACC
AAGCAAATCT CGCAAATCTT CCGTCAAGCC GAAGAAGAAG CCGCCAAAGG CATTACCTTT
GATGGCAGCG GCGCACCATC TGGTGGTGGC AATGGCACGG CTAGTGCTGG TGGCGGCGGC
GGTGGCAACG CCGATGGTAG TGGCAGTGGT GGCAGTAATG CTGGTGCTAC TGGTGGTGGT
GCAGGTATTG CTGCTGGCGG CGGCGGTTCA TCGTCAGGTG GCGGTGGCGG CGGTTCATCG
TCAGGCGGCG GTGGTGGCGG CTCATCGTCA GGTGGCGGTG GTGGCGGTTC ATCGTCAGGC
GGCGGTGGTG GCGGCGGTAG TAGCACTGCT GGCGCAGGCG GCGGCGGTGG CGGTGGCGGC
GGTGGTGGCA GCAGCCAAGA ACCAATGTCA ACCGAGCAAG TCTTCAATGA TAAATACATG
GGCGATTTGG TCGGCCAACA ATTCCAAGGT GCTGGCAACC CTGAGTTGAA CTCGGCCATG
GAATTGCTGA CCAGTGGCAA TGCAACCCCT GAGCAAATTG AAGAAGCACT CAAGAAGATT
GCCGCAGCTC GTGGCGTGCC ATTAGAAAAA ATTCAAGCCG ACTATGGCAA GTTCCTTGAA
TTGCGCGAAC AAGCTGCCAA AACCGGCGCA GCCAACGGCC AATCGGCTGT TGAAGCAATC
AACCAAACCT TCCATGGCGA TTTCATGGGC AGCACCTCAA GCTTGCGCTA TGGTAAAGTC
GTCGGCGATG TCTTGGGCAT CGACCCAGTA TTTGGTTCAA TGCTCAATCC AAGCGGTGGC
TTGGTTGGCC CTGGCAACAA AGCAATCGAC TTAGGCGATT CACCAGTCAG CTATCACGGT
GCTGTCCACG ATGCTGCTGG CTACCTCTTC AACTACCACG ATATGGGCCC AGGCTATAAC
TACCTTGGCT TGGAACGCCG CGACACGGCC AACCCATTGA CTGGCCAAGA ATCTGGTATT
CGCTACTGGA ACGAAAAAAT GGGCAACACT GGCATCGGCG CAACCATTAG CAACGGCGCT
GGGAGCTTGA TTGGTAAAGC TCAAGATGCT GTCAACTGGT TTGGCGATGT CAAGCAAGAT
GTCCAAAATA CCTGGAGCGG AGTCAAAGAT TGGTTCTCCA AAACCTTCTA G
 
Protein sequence
MTADIVQHKS DDMEQVSSRF QKLHDHQVQM EAMIRRMYEE LRGGAWQGDA AQAFFAEMNE 
HIFPAMKRFQ NVLAVSGEVT KQISQIFRQA EEEAAKGITF DGSGAPSGGG NGTASAGGGG
GGNADGSGSG GSNAGATGGG AGIAAGGGGS SSGGGGGGSS SGGGGGGSSS GGGGGGSSSG
GGGGGGSSTA GAGGGGGGGG GGGSSQEPMS TEQVFNDKYM GDLVGQQFQG AGNPELNSAM
ELLTSGNATP EQIEEALKKI AAARGVPLEK IQADYGKFLE LREQAAKTGA ANGQSAVEAI
NQTFHGDFMG STSSLRYGKV VGDVLGIDPV FGSMLNPSGG LVGPGNKAID LGDSPVSYHG
AVHDAAGYLF NYHDMGPGYN YLGLERRDTA NPLTGQESGI RYWNEKMGNT GIGATISNGA
GSLIGKAQDA VNWFGDVKQD VQNTWSGVKD WFSKTF