Gene Haur_3752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3752 
Symbol 
ID5735616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4716715 
End bp4719036 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content50% 
IMG OID641280904 
Producthypothetical protein 
Protein accessionYP_001546516 
Protein GI159900269 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000190776 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTG CGCGTTTATT AATGTTCGTG GTGCTAGCTG GCTTGTTAAC GCCGCTGGGT 
GTATGGGCGC GGCCTGTAAC GCCACCCCAA GCCCAGCCGC AAGCCCCGAA TACAGTCAAT
TTTTTCGTAG ATCCTAATTC CTTTGCAGAT ACCATGGTTC CTGGGCAATC GCGCCAATTT
GCATTTCAAA TTCAGAGCCT AAGTAGCTCT AATGAAAGTG CAAATTTTAC ATTTGCTCCA
GAGGGATTAC CACCCCAAAT TACAGTTAAT GCAATTGCTC CTGTGACAGG GGTAAATCCA
AGTGATCAGG TTACAGTTTT GGCTACAATT AATATTGCAG CTACTGCTCC CTTGCAAACG
TATACATTTA GGATTCGTGT TACTGCTACT GGCAATACTA CAGGTGTTGC CAGTTCAAGA
ATTGTTGATT TTCTTATTAA TGTCGTAGCG CCAACAGCAA CTCCAACTCG TACTAATACA
CCTACCGCCA CTGCTCCAAC GGCTACCAAT ACTGGCACGC CTGGGCGGAT TTGCGATGGT
AATGGTGGCA CGGTTAACGA TAAATTTGAG CCAAATAATA CCCGCAGCCA GGCTCGCCGA
ATTGAAGTTG ATGTGCCGCA AGTGCACGCG ATTTGTCCAG TTGGCGACGA AGATTGGCTC
TTATTTGGCG GTTTAGAGGG CAAAGTCTAT ACGATCGATG TTTCGCAAAT GGTCGATGGG
CTGGATCTTT CGTTGACGCT TTACGATAGC AACGGCAATC AATTGGCCTT TAACGATGAC
TTTCCGCGCA ACAATGATCC CAGCGATATC AAGCCACGAA TTCAATCGTG GCGTGCGCCC
GCCAACGGCC AATACTACAT TAAGGTGCGC GATTCCGCCG GACGCGGCTT TATTGATGCG
CTCTACACAG TGGTGTTGAA TAGTGAAAGT TATGGCCCCA CGCCAACGCT CATCCCTGAA
ATTTGTAAAG ATCTCTACGA GCCGGATGGC TTGCCTGAAA TTGCACCGTT GATTGTGGTT
GGCGAAGTTC ATCCCGACCA TCGGTTGTGC CCACGCGGTG ATGCCGATTG GGTCAAATTC
TTTGGCAAAG CTGGCAAAGT CTATTCGATC TTTACCTCGG AACTCAGTGT TGGTGCTGAT
ACAGTGATGG TCTTGGCTGA TCGCGATGGC ACAACGATTA TCGATTTCAA CGATGATTAT
GAATCAGGCT TGGATTCGCG AATCGATTTC GCGCCGTTCG TCGATGGCTT CTATTTTGTG
CAGGTCAAGA ATGTTGGCGA TGTTGGCAAT CAGTTTATCG ATTACACCTT GACCTTCCAG
ATCAAAACTA ATGCCAACCA AGGCGAACCA ACCATGCAGC CAACCGCAAC CTTCGAAGAT
GATATTACGC CAACTTTCGA GGATGATGTT ACGCCAACTA GCGATCCTAA TCGCACGGCA
ACCGCGACTG GCACGGCGAC CCGCACGCCA ACTTCGGCCT ATCCAACCCC AACCACTTCA
TCGAGCAACA AATTGCCCAA CTTCGATACG CGCAGCAATG GCAAATTTGC CGACCCAGCC
TTTAACAAGG TTTGGGCTTA TGCCGATGCT CCAGTGGCGA GTGGCCAAGC AGTGCGCTCT
TGGTTGTGGG GGCCGAGCAG CGGCCAAGCC CGCGCCGAGG TTTACGATCA AGCGCCTGGT
GGTTTGCGTC AAGTGCAATA TTTCGATAAA TCGCGCATGG AAATTAGCGA TTTCGAGGCC
GACCGCCAAA GCCAATGGTT TGTGACCAAC GGCTTGCTGG CGAAGGAATT GATTCAGGGC
CAGATTCAAA TTGGCGATAG CAATTATGTG CAGCGTAGTC CGGCCCAAAT TAATATTGCT
GGCGATTTAG GCGCTGCTTC GGCTCCAACC TACGCCAGTT TTAGCAATTT GCTTGGCGCA
ACCAGCGATC GTACTGGTCA ATTCGCCGAT CAGCAATTAG CGCGTAGTGG CAAAGTGAGC
GCTTATGCTG GCGCTGCAAC TGATGCAGCT AAGTTGGTGC ATTATGTGCC ACAAACTGGC
CACAACATCC CTAGCGCCTT CTGGGATTTT GTCAATCGTC AAGGCTTGGT TAGCCAAAAT
GGCCGTACCC AAAACGGCCA AGTGATGGAT TGGGTTTTTG CTTTGGGCTA CCCAATTAGC
GAAGCCTACT GGGCCAAGGT CTATGTTGGT GGTGTTGAGC AAACTGTGTT GGTGCAAGCC
TTCGAGCGCC GCGTGCTGAC CTACACTCCC AGCAATCCCG CCGATTGGCA AGTCGAAATG
GGTAATGTCG GCCAACACTA CGAACAATGG CGCTACCGCT AG
 
Protein sequence
MKRARLLMFV VLAGLLTPLG VWARPVTPPQ AQPQAPNTVN FFVDPNSFAD TMVPGQSRQF 
AFQIQSLSSS NESANFTFAP EGLPPQITVN AIAPVTGVNP SDQVTVLATI NIAATAPLQT
YTFRIRVTAT GNTTGVASSR IVDFLINVVA PTATPTRTNT PTATAPTATN TGTPGRICDG
NGGTVNDKFE PNNTRSQARR IEVDVPQVHA ICPVGDEDWL LFGGLEGKVY TIDVSQMVDG
LDLSLTLYDS NGNQLAFNDD FPRNNDPSDI KPRIQSWRAP ANGQYYIKVR DSAGRGFIDA
LYTVVLNSES YGPTPTLIPE ICKDLYEPDG LPEIAPLIVV GEVHPDHRLC PRGDADWVKF
FGKAGKVYSI FTSELSVGAD TVMVLADRDG TTIIDFNDDY ESGLDSRIDF APFVDGFYFV
QVKNVGDVGN QFIDYTLTFQ IKTNANQGEP TMQPTATFED DITPTFEDDV TPTSDPNRTA
TATGTATRTP TSAYPTPTTS SSNKLPNFDT RSNGKFADPA FNKVWAYADA PVASGQAVRS
WLWGPSSGQA RAEVYDQAPG GLRQVQYFDK SRMEISDFEA DRQSQWFVTN GLLAKELIQG
QIQIGDSNYV QRSPAQINIA GDLGAASAPT YASFSNLLGA TSDRTGQFAD QQLARSGKVS
AYAGAATDAA KLVHYVPQTG HNIPSAFWDF VNRQGLVSQN GRTQNGQVMD WVFALGYPIS
EAYWAKVYVG GVEQTVLVQA FERRVLTYTP SNPADWQVEM GNVGQHYEQW RYR