Gene Haur_3007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3007 
Symbol 
ID5734894 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3797709 
End bp3799439 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content51% 
IMG OID641280151 
Producthypothetical protein 
Protein accessionYP_001545773 
Protein GI159899526 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGGA TTGTGATCCT GCTGGGCTTG ATTTTGAGCC TGATTCTCAT CCCCTCGAAT 
TCGCCCTCAA CCAGTGCCGC CGAGCCTTCG CGTCCGGTTT TTGCCTACTA CTATGGCTGG
TGGGGCAGCC AAACTTGGTA TCTCGACAAG ATTCGCGACC GCCCGCTGGA ACTCTACGAG
AGCGACCGCG ATAGCACCAT GCTCAACCAT ATTCGCCAAG CCAAAAATGC TGGCATCGAT
GGTTTTATCT GTACATGGCG CTATACCTGT GCCCGCTTGT TGCAACTGGC TGAGCAAGAA
GGCAATTTCA ACGTGGTTTT CAGCGTTGAT CCGGTAGCTG ATGGCACGCT CAACTCAACT
CAGGCAATTG TCGATAATAT GCGCGAAATG GCTGGCCTCG CCAGCAGCAA TGCCTATTGG
CGCTGGGATG GCAAGCCAGT TTTCGTGTTT TGGAATGATA CAATTTTGCC TGGTGGCCGT
GGTTCGCTCA GCGATTGGAC CGATCTGCGC AGTCGGGTTG ATCCCAATCG CAATCAATTT
TGGCTCGGGG GCGGGGTAAA CTTCAGTCTG CTCGATGTGT TCGATGCGAT TCACTTTTTC
GATATTACCT GGGAACGCAA GCAAGGCGAT GCGATGATTT CGTATAGCCG CAATTTGCGC
GAATATAACA GCAGCCGCAA TAGTAACAAA CCATTTGTCG CGACTGTTAT GCCTGGTTAT
GATGATTTGC TCTATCGCAA CGGCCACTTC CGCGACCGCG AAAACGGCAA CTACTATCGC
GCTGGCTGGG ATACGGCGAT CAATTATCAG CCCAAGGCGA TCATTTTAAC GAGTTGGAAT
GAATGGTATG AAGGCAGCCA GCTTGAGCCA AGCCAAGCAT ATGGCAACCT CTACCTCGAT
ATTTCACGCG AAAAAATTAG TGCCTTCAAA AATAATGTGC CGCCAATTCC CGATGGCTTT
GCCGATACCA ACTTTGAAAA AACCTGGCAA CGCACTGATA AGCCCGTCCA AGATGGCCGC
ACCAGCCGCT CATGGGTTTG GGGGCCAGCA ATCGCCAAAG GCCAGTACGA GCCATATGGT
GGCAGCACGC GGCTCGTCCA ATATTTCGAT AAATCACGTA TGGAAGTCAA TAATCCCAAT
GGCGACCGCA ACCAGCCATG GTTCGTTACC AACGGATTAT TGGTGGTCGA GATGATTCGC
GGCCAAGTCC AAATTGGCGA TAGCTCGTTT GAGCAACGTA TCCCTGCTGA TGAAGCTTTG
GGCGGCGATC CACGGGCAAT TAACGATGTT GCGCCTGGCT ATAGCTCACT GCGCAATGTA
TTGGCCAGCC AGCCCGATAA AACTGGCCAA ACTATCAGCA CCCAACTCAA GCGCGATGCG
ACGACCAGTG GTATTACTGC GCCAAGTGTT GTCACCAACG CCCATTATGT CAGCCAAACC
CAGCATAATA TTCCCGATGT CTTCTGGAAT TTTATGAACC AAACGGGCAC GGTTTATGAA
AATGGCCGCT TTGTCAACGA TAAACAAGTT GTTGATTGGG TGTTTGCTTT CGGCTACCCA
CTAAGCGATG CCTACTGGAT GCGGGCCAAA CTTGGTGGCA CAGAAACATG GATGTTAGTC
CAAGCCTTTG AACGGCGGAT ATTGACCTAT ACCCCAACCA ATGCACCTGC CTTTCAGGTT
GAGATGGGCA ATGTAGGCCA ACATTATCGC CGCTGGCGCT ACGGCAACTA G
 
Protein sequence
MRRIVILLGL ILSLILIPSN SPSTSAAEPS RPVFAYYYGW WGSQTWYLDK IRDRPLELYE 
SDRDSTMLNH IRQAKNAGID GFICTWRYTC ARLLQLAEQE GNFNVVFSVD PVADGTLNST
QAIVDNMREM AGLASSNAYW RWDGKPVFVF WNDTILPGGR GSLSDWTDLR SRVDPNRNQF
WLGGGVNFSL LDVFDAIHFF DITWERKQGD AMISYSRNLR EYNSSRNSNK PFVATVMPGY
DDLLYRNGHF RDRENGNYYR AGWDTAINYQ PKAIILTSWN EWYEGSQLEP SQAYGNLYLD
ISREKISAFK NNVPPIPDGF ADTNFEKTWQ RTDKPVQDGR TSRSWVWGPA IAKGQYEPYG
GSTRLVQYFD KSRMEVNNPN GDRNQPWFVT NGLLVVEMIR GQVQIGDSSF EQRIPADEAL
GGDPRAINDV APGYSSLRNV LASQPDKTGQ TISTQLKRDA TTSGITAPSV VTNAHYVSQT
QHNIPDVFWN FMNQTGTVYE NGRFVNDKQV VDWVFAFGYP LSDAYWMRAK LGGTETWMLV
QAFERRILTY TPTNAPAFQV EMGNVGQHYR RWRYGN