Gene Haur_2215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2215 
Symbol 
ID5734102 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2814179 
End bp2815711 
Gene Length1533 bp 
Protein Length510 aa 
Translation table11 
GC content47% 
IMG OID641279356 
Producthypothetical protein 
Protein accessionYP_001544983 
Protein GI159898736 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAC TTGCCCAACA TCCTCCAACA GCTGAATTGG CGATTGATAA AATTGACAAA 
GCTGAATTAA TTCGTGCGTT TCAGCAAAAA ATGCCAGGCT ACGATCCGGC TGTCGTGGCG
CAGGCAATTT GTGATCTGCT TGAGCAAAAA CCAACTATGA TCGATGGGCA AGCCCTGAGA
TTAACTATTA ATGACCAAAC TGCTAACTTA GCCAGCTATA ACGTCGTTGG TGGCAACCTC
AATCAAACAA CCATAACGGT TGTCATGCCA AATCCGATCG CAAGCGCAAA CACAGCTAAA
CTAAAACGTG GCCTGACTCG CCGAATTATT GCGCTAATCG TTGGTTTCAT CGGTTTGCTG
ATGCCTTTAC ACGCTTCCGA TCAATTTCAA GCCTCGCAAC AACCGTCATC ACAGAGCATG
ACAGGCGATG TCAATATTGT GATCGCTGAA TTTACCCAGC TTAATGAAGA AATTCCCAAT
CAATGGAATG ATCTGTTGGC GCAGGATCTT GATAATTTGT ATGGAATGAG TAGCTTCAAT
CGGGTCAAAA TTCAGACTGA TCCGCGTTTT ATTCCATCGA TTGAGGCCGC TCAGCAGCTA
GCCCAAACCA CAAATGCGCA TATGGTGATC TACGGCGATA CCCAATCGAG CAGCACGGGG
GCGGTTGTAA CGCTGCGTTT TTATGTGATC GATCCACATC GGATTAATAT TGGCGAGCTG
ATCAATGGCG AACACCAAAT TGGCGCTCAA CTGCAACTCT CAGAGGCTGA TCTTTTAGCA
GGTCATTCGT TGTTGGATCT ACCTGATGCA TTGATTCTGA TTGAATTTAC CAAAGCCTTG
GTCTACTTAG GGCTTGAGGG AACAACCAAT CTTGATGCAG CGCTGGCATC GATCAATCAA
GCGCTTGCCG CCGCTGAAAT TCACCCAAAT ATTAAAGGCA AAGAGGTACT TTTATTATTT
GCCGCTGTGA TCACTACTGA GCAATGTGCG CTCGAAGTCG ATGATCACAG CGCTTGTTTT
GCCCGCGCCC AGCGCTACCT TGACGCAATT CTCCAGCTTA ACCCAGCTTA TGGCCGCGCT
TACCTTGCCA AAGCCAATAT TTTTTATGCC CAAGGCAATT TGTTTCAGGC CATGGAATTC
TTTAACCAAG CCAAAGCCTT GCCAGATCAG CCATTTGGCA GCTATATCGA GGAAAAAGCC
ACCTTTGGAA TCGGCAACGT TTGTACCTTG CAATTGCAAT ATGTTCAGCA ACGGCTTGGT
GGAAGTGTAG CCGCAAATAG CGAGGCAACC AAACTAGCAA ATTGCGCCTT ACAAGCCTAT
CAAACCCTAA TTAATAACTA TGATCCAGTG CGGAACGACC CGATTTTGCA AGAACTTACG
GCTTGGGCCT ACTATTGGCA AGGGGTAGTT TATGTCGAGG CAGGTCAAAC CAACGCTGCT
CAGTGGTCAT TTGAGCAAGC CCAACGTTTA GCCACAAATG CCGATTTGCA GCAGCGCGTC
CAACAACGGC TCACCCAAGG AGCAGCCAAA TGA
 
Protein sequence
MDELAQHPPT AELAIDKIDK AELIRAFQQK MPGYDPAVVA QAICDLLEQK PTMIDGQALR 
LTINDQTANL ASYNVVGGNL NQTTITVVMP NPIASANTAK LKRGLTRRII ALIVGFIGLL
MPLHASDQFQ ASQQPSSQSM TGDVNIVIAE FTQLNEEIPN QWNDLLAQDL DNLYGMSSFN
RVKIQTDPRF IPSIEAAQQL AQTTNAHMVI YGDTQSSSTG AVVTLRFYVI DPHRINIGEL
INGEHQIGAQ LQLSEADLLA GHSLLDLPDA LILIEFTKAL VYLGLEGTTN LDAALASINQ
ALAAAEIHPN IKGKEVLLLF AAVITTEQCA LEVDDHSACF ARAQRYLDAI LQLNPAYGRA
YLAKANIFYA QGNLFQAMEF FNQAKALPDQ PFGSYIEEKA TFGIGNVCTL QLQYVQQRLG
GSVAANSEAT KLANCALQAY QTLINNYDPV RNDPILQELT AWAYYWQGVV YVEAGQTNAA
QWSFEQAQRL ATNADLQQRV QQRLTQGAAK