Gene Haur_0312 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_0312 
Symbol 
ID5732207 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp370945 
End bp372111 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content50% 
IMG OID641277436 
Producthypothetical protein 
Protein accessionYP_001543092 
Protein GI159896845 
COG category[R] General function prediction only 
COG ID[COG4106] Trans-aconitate methyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0938935 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTAA CACCAGAAAA TTATCATGAG CAGTTGATCG CGTTGGCGTT AACTGACGAT 
TTTGTGCGCC TGACCATGAG TGGTGCTGCC CGTGCCACCG ATCTACGTTG GCAACGGGTG
GTGGTGCGGC CTGTGCAATT GAAACAAGGC CGCGCTTGGC AAGCAGCCTA TTTTGACCAG
CGTCAAAATA TCACCAAAAA TTACGCAATC GAGCAAGCCA GCAGCGCCCT CGGTGAGATT
ATTGCGATTC CGCTGAGCAA TATTACGCTG GAAACCACCA GCGAACGCAT CCAAATCCAA
CGGAGCAAAA AGGGCAAAGT GATTATTAGT CGAGTCCGCA ATCAAGCGGC GGCTCCAGAT
TTGCGCCATA ACCACGTTAA AGCCTTGCCT TTGCCCAGCG ATAGCCCTGA TGCCTATTTA
CAAAAAACGG GCATTATGAC CAACGATGGG GTGATTCGTG CCAGCATGAG CAAAAAATAC
ACCCAAATCA ATGAATTTTT GCGGGTGTTC GATGAGCTTG ATCTCAAACC CAGCCCTGAG
CAACCGTTGC GAATTCTTGA TGCTGGCTGT GGCTCGGCCT ATTTGACCTT TGCGGCCTAT
CACTATTTGG TCAACATTCG TGGCTTAGCG GCGGTGGTGA TTGGGGTTGA TTCAAACGAA
TATTTAATTG CTAAATGTCG CGCTCAAGCA GAAGAATTGG GCTACACCGA TATGCAATTT
ATCGCCATGC CCTTAGCCGA TTGGCAGCCT GAGCAGCAGC CAGATGTGGT GTTTTCGTTG
CATGCCTGCG ATACCGCCAC CGACGATGCC TTGGCTTTGG CGATTCGCAG CCAAGCCCAA
GCAATTTTGA GTGTGCCTTG TTGCCATAAA CATCTGACGC ATCAAATTCA AGCCGAGGTG
CTCAACTCCA TGTTGCGCCA TGGCAGCATT CGCCAGCGCA CCGCCGATTT AGTGACCGAC
AGCCTGCGGG CGCAACTGTT GCGGATCAAT GGCTATCGTA GCGAGATTAT TGAGTTTGTT
GATGCAGAGC AGACTGGCAA GAATTTAATG ATTCGGGCTA TTCGTAGCAA AAAACCTGAT
TCCAAAGCCG TGGCCGAATA TCAGGCACTT AAGCAATTTT GGGGTGTTAC GCCCTATTTA
GAGCAGTTGT TGGGATCAGG CAACTGA
 
Protein sequence
MQLTPENYHE QLIALALTDD FVRLTMSGAA RATDLRWQRV VVRPVQLKQG RAWQAAYFDQ 
RQNITKNYAI EQASSALGEI IAIPLSNITL ETTSERIQIQ RSKKGKVIIS RVRNQAAAPD
LRHNHVKALP LPSDSPDAYL QKTGIMTNDG VIRASMSKKY TQINEFLRVF DELDLKPSPE
QPLRILDAGC GSAYLTFAAY HYLVNIRGLA AVVIGVDSNE YLIAKCRAQA EELGYTDMQF
IAMPLADWQP EQQPDVVFSL HACDTATDDA LALAIRSQAQ AILSVPCCHK HLTHQIQAEV
LNSMLRHGSI RQRTADLVTD SLRAQLLRIN GYRSEIIEFV DAEQTGKNLM IRAIRSKKPD
SKAVAEYQAL KQFWGVTPYL EQLLGSGN