Gene Haur_4309 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4309 
Symbol 
ID5736168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5502197 
End bp5504134 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content50% 
IMG OID641281469 
Producthypothetical protein 
Protein accessionYP_001547069 
Protein GI159900822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGGCGG CGTTTTCGTG GACTAATTTT TTGCTTGCAG CCAATGACCT AACGCTGGCA 
GCAATTGTGG TCGTGGGGTT TTCGTTATTT GCCTATATTG CGCTGCACAA TTGGCGCAAT
GGGGTAGCTC GTTCGTTTTG TTTGTTGATT GTGGGGTTGA TGATTGTGCT GGGGGGCGCA
ATTCTCCAAC GTCAAGCGCA AACCGAAGCT ACTCGTCATG TGCTTTGGCG CATTCAATGG
GCAGGTATTA GCCTTGTGCC TGCGGCCTAC TACCATTTTG CCGAATCTCT GCTTCGTAGC
ACGGGTGATC CACGTATGTG GACGCGAGTG CTCTTGCCTT TATTCTATAC ATTTAGTGTT
GGCTTTTGGC TGGTCGCGCT GACGAGTAAC ATTTTGGTGA TTGATGTGCC AAGTCAGCCG
TATGTTGGCT TTGGCAAAGG GCCATTGTTT TGGTTCTTCA TTAGCTATTT CGTCACCGTG
TGTTTGCTCG GGGTTTGGTG TATTCGCCAA GCCCATCGGC GCTCGATTAC TCCGGCCAAT
CGGCGGCGCT TATGGTATTT AAGTACTTCT TTTTTAGCCC CATTTCTGGG TGTATTTCCC
TATTTGATTA TCGCCGCCAA TACTAAGGGC GTGCCATCCT GGCTTTCGTT GATGCTACTA
GGCGCAAGCA CCACCGGCGT GGGAGTGATG ATGACCCTCA TGACCTATAG TGTGGCTTTC
CATGGGGTGA TTGTGCCAGA TCGTTTGGTT AAGTATAATT TCTTACGTTA TGTATTATAT
GGGCCATTCG TTGGCGTAGC CTTGATTATT TGTTTGCAAT TGGTTGAGCC GATCAGTGCC
GCCACCGGCT TGCCGCGTGC AACGATCACG ATTTTTGGGG TAATGTTGAT GACGGTGATG
ATGCCAATTT TTATTGGGCG AATTCGGCCA ACCGTCGATA CCTTGATTTA TCGCCAAGAT
AGTGATGAAG TGCGTTGGAT GCGCCGCTTC GAGGAGCGAG CTTTCACCCG CCAAGATTTA
CGCCAATTGC TCGAAAATAC CTTGGTGGCG GTTTGTGGCT CGTTGCGGGT TGAATCAGGC
TTTGTGCTGG CTCCCAATGA CGAGCATTTT ACCGTTCAAG CATCGTGTGG CCCGCGCCGC
ACCATCAAGC AATTTTTGAA TGTTCATGAT ATCAACGAGC TATTGCAAAA TCTGCCCCAC
TTGGCCTTCA GCAACGATCG CATTCCCGAA GTTGAGGATT TTAGCATTCG CGATGGCTTT
TGTTTGTTGC CATTGTACAA TAGCCAGCAG GAATTATTGG GCGCAATTGG GATTGGTTGT
CGGCCAGAAC AACTGACCAT TCCCACTCGC CAGTTAATTG CAACCTTGGC GCATCAGATG
GAATTGGCGC TAACCCATAT GCAATTGCAG CAAAACCTAT TTAGCTCGTT GCGTGGGCTA
GCCCCCCAAA GCGCTTCGCT GTTACAATTA ACCAGTGAAA TTGAAACGCC AGTCACCGAA
AAAAACGATG CGCTGGCTGA TGTGGCCTTG CACCCTGAGT TTCCACAGTT GGTCAAGGAT
GCACTTTCAC ATTATTGGGG TGGCCCAAAA CTGAGCGATA GCCCATTGCT CGATTTGCGC
ACAGTGCGCC AATTGCTCGA TACCCAAGGT GGCAGCCCAA CGCGGGCTTT GCAAGGCGTG
TTACGCCAAG CGATCGAAAA CATTCGGCCT GAAGATCAAC TTGATCCAAC GGCTCCCGAA
TGGATGATTT ACAATATTTT AGAATTACGC TTTCTCAAAG GCTTACGGAT ACGCGAGATT
ATAGATAAGC TCGCAATGAG CGAATCGGAT TTTTATCGAA AACAACGGGT GGCGGTGGAA
GAAGTGGCCC GTCAGTTGGC GCTGATGGAA GACCAAGGCG ATCGCCCTTC CGGCTCGGTT
GAGCGACAAC GTCCCTAA
 
Protein sequence
MVAAFSWTNF LLAANDLTLA AIVVVGFSLF AYIALHNWRN GVARSFCLLI VGLMIVLGGA 
ILQRQAQTEA TRHVLWRIQW AGISLVPAAY YHFAESLLRS TGDPRMWTRV LLPLFYTFSV
GFWLVALTSN ILVIDVPSQP YVGFGKGPLF WFFISYFVTV CLLGVWCIRQ AHRRSITPAN
RRRLWYLSTS FLAPFLGVFP YLIIAANTKG VPSWLSLMLL GASTTGVGVM MTLMTYSVAF
HGVIVPDRLV KYNFLRYVLY GPFVGVALII CLQLVEPISA ATGLPRATIT IFGVMLMTVM
MPIFIGRIRP TVDTLIYRQD SDEVRWMRRF EERAFTRQDL RQLLENTLVA VCGSLRVESG
FVLAPNDEHF TVQASCGPRR TIKQFLNVHD INELLQNLPH LAFSNDRIPE VEDFSIRDGF
CLLPLYNSQQ ELLGAIGIGC RPEQLTIPTR QLIATLAHQM ELALTHMQLQ QNLFSSLRGL
APQSASLLQL TSEIETPVTE KNDALADVAL HPEFPQLVKD ALSHYWGGPK LSDSPLLDLR
TVRQLLDTQG GSPTRALQGV LRQAIENIRP EDQLDPTAPE WMIYNILELR FLKGLRIREI
IDKLAMSESD FYRKQRVAVE EVARQLALME DQGDRPSGSV ERQRP