Gene Haur_5286 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5286 
Symbol 
ID5737244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009974 
Strand
Start bp75342 
End bp76469 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content52% 
IMG OID641282450 
Producthypothetical protein 
Protein accessionYP_001548041 
Protein GI159901796 
COG category 
COG ID 
TIGRFAM ID[TIGR02391] conserved hypothetical protein TIGR02391 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000747572 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTAACAC CACCCGATCC GCAACACCAG TATGTCGTGG CTTTATCGCG TCTTATCCAT 
CAAGGCAATG CCTTGTTTAA ACAGTATGAT CGACTTTCTT TTATCATTTG GAATCGTCAT
CACCACGATA AAGCCGTGCA GAGCCACCGT GACTTTCAGG CGTGGGTCGA TCATGCAAAG
AGCGACTTGC ACCCAGCCGA TCATGACGAA TTCTGCACGA TTCTGGCCGA AGGGGATAGT
ATCAGCTGGG AGCGTGCATG GGACATTGTC ACAGGCACGA TAGAGCATGT GTGGGAGAAT
CAGCAGGAAG CATATAGTAT TGGCCAACGG TGCATTATGA AGCGCTTTCT GCGGCTGCGT
GATTATCTCA AAGCGCTGCG GATTCGCCTC GCCCCACCCT CGCCCTTGTC TGCTGATTTT
TATCAGCAAT TTGCCGACCA ATTCCACCGG ATGAGTCCCG ATATGATTGA TGAATTGTTT
ATCACGAGTG GCTGTATGAT CCAGTATTGG ATACCACCCT ATAAGCCGCA AACGAAACAA
AGTGCCGATC GGGCGTATGG CTGGCTGGAC GGATTAAATC TGTATATGCC TGATTCGCTG
GTCGCGATGG TCGAGACGGT CTACCAAGCC TACCGAGACT ATAAACGGTT CCCGCGTGAT
CAGACCCTCG ATCCAATTCA AGGTGCGCTG CGGGATCTGA AACTGGCAGA CACGAAGCCA
TCACTACTTA CCCGCTACGA GCTGCACCCG CGCGTGGTCG AGCGAGCAAC CCTGCTTTGG
CAGATTGGCG AATATGACAC CGCGTTATCG CAGGTGTGTA TCGAACTTGA TAATGCGGTT
AAAGCGAAAT CGGGTCTCAA GGAGGACGGC ACCACCTTAA TGCGAACAGC CTTTTCACCC
AAAAAAACAC GGTTGGCCAT CGATCCCCGC TTTGGCAATC AACAAGGCTT TATGGATCTG
TTTGCGGGGG TGATGGATGC CATTCGCAAC CCACGCGCCC ATCACCACAA AAGCAATTTA
AGCGCGGATG AGGCTATCGA ATGGCTGGCA TTCCTGTCAG CCCTGTTTCG GGTGCTTGAT
GCGACAATCA TCAATACCCC TGATGAAACC GAGGCAAGGG GTACCTAA
 
Protein sequence
MLTPPDPQHQ YVVALSRLIH QGNALFKQYD RLSFIIWNRH HHDKAVQSHR DFQAWVDHAK 
SDLHPADHDE FCTILAEGDS ISWERAWDIV TGTIEHVWEN QQEAYSIGQR CIMKRFLRLR
DYLKALRIRL APPSPLSADF YQQFADQFHR MSPDMIDELF ITSGCMIQYW IPPYKPQTKQ
SADRAYGWLD GLNLYMPDSL VAMVETVYQA YRDYKRFPRD QTLDPIQGAL RDLKLADTKP
SLLTRYELHP RVVERATLLW QIGEYDTALS QVCIELDNAV KAKSGLKEDG TTLMRTAFSP
KKTRLAIDPR FGNQQGFMDL FAGVMDAIRN PRAHHHKSNL SADEAIEWLA FLSALFRVLD
ATIINTPDET EARGT