Gene Haur_4051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4051 
Symbol 
ID5735909 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5168589 
End bp5169791 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content52% 
IMG OID641281202 
Producthypothetical protein 
Protein accessionYP_001546811 
Protein GI159900564 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAAAGC TTAAAAATTA CGTAGTCTCC TTGGATGATG TGTCCACCGT CGAAGAACTC 
AAGGCCGCCT TGCGTGAACG GCGTTTGGTT GGCGTGACCC AACTAACCCT CAATGGGCGC
AATTGGGGTG ATGCTGTCGC GCATAGCTTG GCTCGCTCAC CGCATATGGC CACCGTGCGC
GTGCTTGACC TCAGCGAAAA TCAATTAACC GATGAAGCTG TGATTGCCTT GGCGCAATCG
CCCCACTTAA GCCAATTGCG CACGCTGCGC TTGCAACGCA ACCAAATTGG CGATGCTGGC
TTGATTGCCT TGGCGCAATC GCCCTACCTG AGCAACCTCA ACGAATTAGA GCTGTATCAA
AATGCAATTG GCGCGGCGGG TGCTCAAGCA CTCAGCCATT CGACCAGCCT AACCCAATTA
ACCAAACTAC GGCTTTATCG CAATCAACTG GGCGCAGCTG GCGGCGCGGC CTTGGTCGAA
GGTCAAGCCT TGGTTAATCT CCAGTTGCTC GATATTGATC ACAACGATTT GGGCGATGCT
GGAATTAGTG CTTTGGCCGC CGCCCAACAT TTGACCCAAC TCACCTCGCT CAATTTAAGC
AACAACAATA TTGGGCCAGC AGGGGTGAAG GCGCTGGCCG AAGATGAACA ATTGGGCAAC
GTAACCCAGC TTGATCTCAG CGATAATGAT GTGGGGTTTG CTGGAGCCAA AGCCTTAGCC
GAATCGCCCT ATCTGCGCAG CATTGTGACG CTCAATTTGG CCTCAACCAA TATCGATGCT
GCTGGCATTA GCGCCCTTGC CAATTCGCCA GTGCTTGCCA ACACCGAACG CCTTGATTTG
CGCTCGAATG CAATTGGCGA TGCTGGGTTG AGTGCCCTAG CTCAATCGAG TCATGTCACT
AATCTCAAGG CCTTGTTGCT CAACGATAAT GCGATTGGCG ATGCTGGAGT GCAAGCCTTG
ACCACCAGCA CCAGTCTTGG CAATTTGGCG GTGTTGTATC TCGCCGATAA CGCAATTGGT
GATGCTGGCG CAACCAGTTT GGCCAACACA ACCAACTTGG GCCAATTGCA AGAGCTTGAT
TTATGGGGCA ATCAGCTAAC CCGTAGCAGC CAAGAAGCCT TTAGTCAATC GCCAACGCTG
AATAATTTGG TGCTGCTCGA TTTAGGCATC AGCGACGACG AAGACGAGGA CGATATCTTC
TAA
 
Protein sequence
MAKLKNYVVS LDDVSTVEEL KAALRERRLV GVTQLTLNGR NWGDAVAHSL ARSPHMATVR 
VLDLSENQLT DEAVIALAQS PHLSQLRTLR LQRNQIGDAG LIALAQSPYL SNLNELELYQ
NAIGAAGAQA LSHSTSLTQL TKLRLYRNQL GAAGGAALVE GQALVNLQLL DIDHNDLGDA
GISALAAAQH LTQLTSLNLS NNNIGPAGVK ALAEDEQLGN VTQLDLSDND VGFAGAKALA
ESPYLRSIVT LNLASTNIDA AGISALANSP VLANTERLDL RSNAIGDAGL SALAQSSHVT
NLKALLLNDN AIGDAGVQAL TTSTSLGNLA VLYLADNAIG DAGATSLANT TNLGQLQELD
LWGNQLTRSS QEAFSQSPTL NNLVLLDLGI SDDEDEDDIF