Gene Haur_2258 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2258 
Symbol 
ID5734145 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2888574 
End bp2890166 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content51% 
IMG OID641279399 
Producthypothetical protein 
Protein accessionYP_001545026 
Protein GI159898779 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTACGGC GTAAAAAAAT TCTGTTAAGT GGATTGATCA TGCTTGGTTT AGGGCTAGTT 
TGGGATAGCC GCCCAGTTGC TGCCGATAGC GTGGTGGTAG GCACTGGCAC ACCTGCTAGT
TGTAACGAGG CGGCCTTTGA TGCTGGCTTG GCTCAGCTTT TTCCAGGCGA ACAAGCCCCT
GGCGGCACGC TGACCTTTAA TTGTGGGCCG AATCCGCATA CGATTGTTTT AACCAGCCAA
AAATTTTTGC ACGATGGCTC GGTGATTGAT GGTGGCGGCA AAATTACGCT CTCTGGCGGC
AATACCACGC GAATTTTTTG GGTCAGCCAA CAAGCGCGGG TCGAAATTCA GCGCATCATC
CTGACGAATG GCAATGCTCA GCATAGCGGG GCGATTTTTG CCGAGCCAAA TTGGAGCGGC
GAGTTTACCA ATTTGGCCCT CAACCAAGTA ACAATTAAGC ATAGCCAAGC GACAACTTTT
GGTGGTGGGA TTGGGGCGCA ACATACCAAC CTGAGCCTGA TTGATAGCCT GATTGAAGCC
AATCGATCGA GTGGCAGCGG CGGTGGCGTA AGTTTTAATA CTGGCAATCT AACAATTCGT
AATAGCAAAT TTAGCACTAA CAAAGCCGAG ACCGAAGGCG CTGGGCTTGA GGCATGGACG
GCAAATTTAG ATATTAGCCA AACGAATTTT GAGCTAAACG AATTACAAGG CCGTGAACAT
ACCGATTTTG GTGGTGGCCT CGTGATTCAG CAAAGCTACG GGGTGTTTCA AGGCGGACGT
ATCTGGAGTA ATATTGCTGG TCAAGGCGGT GGCATTTATC TGCGCGGAGG CAGCACAATT
GAATTTAACG CCAGCAAAAT TGCCGATAAT GTGGCTTTTA ACGAAGGCGC TGGGGGCTAT
ATTACCGCTA ATTCAAGCTT GACCTTCAAA AATGGCATTA TTGATCAGAA TTTGTCGGCA
GTAGCTGGTG GCGGCATTGC CAACCAGGGT GGCCTGTTGA TCGAACGCTC GACCCTAACT
AATAATGAAG CTTTACAGAG CGATGGCGGA GCACTTGATA ATACAGGCGT GGCCGTCTTG
AGGTATAGCA CTCTCGCCAA AAACAAGGCT CAGCGTGGGG CTGGGCTGAA TAATCGCCCG
AATAGCACAC TGGTAATCGA TCGTGTGACC ATGACCGCCA ATAATGCAGA GATCGCTGGC
GGTGGAATCT ATCATGCTGG CACTCTATTT ACGGTCGATA ACAGCATTCT TACCTATAAT
AATGCCCCGG CAGGAGCGCA ATGTGGCTAT GCCAGCCAAG TGCCGAGCAT GAGCTTTAGT
ATGTGGAGTG ATGGAAGTTG TGGCACGCAA ACCATCGATG GTAATAAACC ATTTACTGGG
CCAAGCTTGC GACCATTGGG CTGGTATGGT GGCCCAACCC CAACCTACTT GCCACTCAGC
CATAGCGCAT CGACCGATGC TGGCTCATGC TCAAGCTCTG CTGTGACCGA TCAACGTGGT
TTGGCAGGCT TTGTGGGTGC GGCCTGCGAT ATGGGCGCGG TCGAAAGTGG CTCGTTATGG
TATCAAGTGG CGTTGCCAAT GACGATTAAG TAA
 
Protein sequence
MLRRKKILLS GLIMLGLGLV WDSRPVAADS VVVGTGTPAS CNEAAFDAGL AQLFPGEQAP 
GGTLTFNCGP NPHTIVLTSQ KFLHDGSVID GGGKITLSGG NTTRIFWVSQ QARVEIQRII
LTNGNAQHSG AIFAEPNWSG EFTNLALNQV TIKHSQATTF GGGIGAQHTN LSLIDSLIEA
NRSSGSGGGV SFNTGNLTIR NSKFSTNKAE TEGAGLEAWT ANLDISQTNF ELNELQGREH
TDFGGGLVIQ QSYGVFQGGR IWSNIAGQGG GIYLRGGSTI EFNASKIADN VAFNEGAGGY
ITANSSLTFK NGIIDQNLSA VAGGGIANQG GLLIERSTLT NNEALQSDGG ALDNTGVAVL
RYSTLAKNKA QRGAGLNNRP NSTLVIDRVT MTANNAEIAG GGIYHAGTLF TVDNSILTYN
NAPAGAQCGY ASQVPSMSFS MWSDGSCGTQ TIDGNKPFTG PSLRPLGWYG GPTPTYLPLS
HSASTDAGSC SSSAVTDQRG LAGFVGAACD MGAVESGSLW YQVALPMTIK