Gene Haur_4543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4543 
Symbol 
ID5736939 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5813662 
End bp5814759 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content46% 
IMG OID641281705 
Producthypothetical protein 
Protein accessionYP_001547302 
Protein GI159901055 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTAACCC AAATTCGCTC ACGCAAACGT TTATTGGCGA TTGGCGCTGC TGTCTTGGTG 
TTCTTTTTTA TCGTGACAAC GTTGTTAAAT GGTTCGGGTA ATTCAGCAAG CGATAAATTC
CAAACTGTTT CCGATGCTTT GGATCAAGCT GCAATGCCGG AATCTGCGCC AGCCGCTGAG
ATGGAGCGTC AAGTAACTAC CGATGGTTTA ATGGATGAAT CCGATGTTGC TGCTGCGCCG
GCACTTGGCG GGGCTGCTCC TGCCGATGCT GAAAATTCAC AAGAACCAAG TGCCGCCCCC
AACCAAGCTA CTGATCGTTT GGTAATTAAA AATGCTGATG TTGAAGCCTT AATTGATTAT
AAGCAAATGC GTTTGGCCAG TACCCAAATT GAAAATATGG TGCTACGCTT GGGTGGCTAC
ATTGTTTTGA CTGACGATGC TAGCAGCAAT GACGAAGATC AAGCCTATAT TTCGCTGGCC
TTTCGGGTTC CGGCTGATCA ATTTGAAAAA GCCTTGAATG CCTTTGAAGA AAATAAACTC
GAAGTTGTGC GCCGTGAAGT TTCTGGCCAA GATGTTACCG AAGAATTTGT CGATAATCAA
TCACGATTAA CCAACTTAGA AGCCACTGCT GCACGCATTC GTGAATTGCT GGCCAAAGCC
GAAACCATCG CCGACACGAT TAAAATCAAT GAAACTTTGG CGCAATACGA AAGCCAAATC
GAAATGATTA AAGGCCGCCA AAAATATCTA AGCGATAGCG CTTCGATGAG CATGATTACC
TTGTTGATTC GGCCCAAAAC CGCTGATTAC AGCATGTTTA CCAAAATTGA TATTGGTCAA
AATATTCGCA ATGCCTTAGC TAAAGCCGAA CGTCCAGGCT GGACACCGCT CGCCGCCGCA
ACTGGTGCTT GGGACGATGT GTTGGAAATT GGCAAAGATG TTGCTGAAAC CTTGGTTGTT
TGGGCGGTTT GGCTTCCAAT TTGGTTGCCG TTGGTTTTGG CCGCATGGTT TGGCTGGCGC
AAATGGCGTA AATATAGCCA AAACCAAAGC CAAAATTCCC CAATCACTAA TCAAAATCCC
CCAGTTAATC AACCCTAA
 
Protein sequence
MLTQIRSRKR LLAIGAAVLV FFFIVTTLLN GSGNSASDKF QTVSDALDQA AMPESAPAAE 
MERQVTTDGL MDESDVAAAP ALGGAAPADA ENSQEPSAAP NQATDRLVIK NADVEALIDY
KQMRLASTQI ENMVLRLGGY IVLTDDASSN DEDQAYISLA FRVPADQFEK ALNAFEENKL
EVVRREVSGQ DVTEEFVDNQ SRLTNLEATA ARIRELLAKA ETIADTIKIN ETLAQYESQI
EMIKGRQKYL SDSASMSMIT LLIRPKTADY SMFTKIDIGQ NIRNALAKAE RPGWTPLAAA
TGAWDDVLEI GKDVAETLVV WAVWLPIWLP LVLAAWFGWR KWRKYSQNQS QNSPITNQNP
PVNQP