Gene Haur_4399 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4399 
Symbol 
ID5736249 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5622285 
End bp5623490 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content48% 
IMG OID641281561 
Producthypothetical protein 
Protein accessionYP_001547159 
Protein GI159900912 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTGAATC TGAACAAAAC TAGCAATGCA ATTTTGCAGC AATTACTTGA TCAACACGAG 
CAGCCAGAGC GTCAGCGAGT CAATCGGGTG CAGATTAAAG CGGCCAAATT TTCGCGTTAT
TTTGATGATA AACAGATTGA CGAACGCCAA CAAACCAATA ACTACTTGGT TGAGCTAGCA
AAAAATCAGA TAATCAAGTT GTATTGGCGT AAATGGGAAG AAGGCAATTG GCTTGAAGCG
GTTGATTTGC TTGATGCGGC GGCGCTATAT CGTTTGCTGA AACGCCAACC CTTGGCCGAG
CAACAGCAGG CCTTGCGTAC ATTATTGGCT GAATATGTGC CAGTTCAAGG CTGGCTGGCT
GATTGGCTGG CGTGGCTCGA ACAGCAATTA ATCCAACAGC GCTCGATCCA ACCGCTTGAT
TTAACCGACC CTGCTTGGAA TCGCGATGTA CTACGGGCGA TTTATGGCCT GACTCAGCTA
GAAACGCCAA TTTTAGAGCG TTTATTTAGC GTAGGTTGGC TGGGTCAGAG CAAACGATTT
AGCGAGCTAG AAGGCGCAGT TTTGCGGGTT TTGCGCCAAT TTGCCCCGCA AGCCAAGCAA
TTTGGCGACA ATGATCGGGC TTTGCTGCAA GCCTTTAATC TCGAAAAAGT GCCAGAATAT
GTGCTACTTG CTGGCGATTT ACAACTGGAA TTGCATGGGA ATCGGCTGGA ATTAGGCGCG
TTTCGGCCAA GTTTGGGCTT GCCTAGCTCG ATGTTGCGTC AAGCTCAGGT GCTGGATTCA
GCCTGTACTG AAATTATTAC GATTGAAAAC TTGACCAGCT TCCACAGTAT GCTTGCCCGC
CAACCACAGG CGTTGTTGAT CTATACCGGT GGCTTTGCTA GCCCCAGCCT CTGCCAATTT
TTGAGCAAAC TAGCCATGGC TTTGCCCAAT TTAACGTGGT ATCACTGGGG CGATTACGAT
GTAGGTGGTT TGCGAATTTT GGCGCATCTA CGCCAGCATG TTGCCAAGAT TCGCCTGTGG
CAGCCTGATC CAGCGATTTT TCAGCGGGCT GGCAAAGCTA CCCAAAGCTT GAATTCTAAA
GAACGCCAAA GCCTCACTGA ACTCCAACAA CACCCATTGC TTTATGATTG CCAAGCCTTG
ATCGGGGCAA TGCTGGAACA GAATATCAAA CTTGAGCAAG AGCAACTTGA TCTTTTGGGG
CACTAA
 
Protein sequence
MLNLNKTSNA ILQQLLDQHE QPERQRVNRV QIKAAKFSRY FDDKQIDERQ QTNNYLVELA 
KNQIIKLYWR KWEEGNWLEA VDLLDAAALY RLLKRQPLAE QQQALRTLLA EYVPVQGWLA
DWLAWLEQQL IQQRSIQPLD LTDPAWNRDV LRAIYGLTQL ETPILERLFS VGWLGQSKRF
SELEGAVLRV LRQFAPQAKQ FGDNDRALLQ AFNLEKVPEY VLLAGDLQLE LHGNRLELGA
FRPSLGLPSS MLRQAQVLDS ACTEIITIEN LTSFHSMLAR QPQALLIYTG GFASPSLCQF
LSKLAMALPN LTWYHWGDYD VGGLRILAHL RQHVAKIRLW QPDPAIFQRA GKATQSLNSK
ERQSLTELQQ HPLLYDCQAL IGAMLEQNIK LEQEQLDLLG H