Gene Haur_3993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3993 
Symbol 
ID5735854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5093823 
End bp5096441 
Gene Length2619 bp 
Protein Length872 aa 
Translation table11 
GC content50% 
IMG OID641281143 
Producthypothetical protein 
Protein accessionYP_001546753 
Protein GI159900506 
COG category[S] Function unknown 
COG ID[COG1572] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAACAC GTCTGCTGCG GTGCTGCTTG CTGATCGGTC TATTGTTAAG TGCCATGCTT 
CCGCCAATCG TGGAGGCAAC ACCGAAAGCA GCGCCATTGG TGGCCATTTA TATTTTGGCC
TACGATAATC GGCTTGATAG CACGATGAAT TTAACCCCGT ATTATGATGC AACCCTCACT
AGCATCACTA ATGCCACTGT TGGTCAGCCC GATCTAACGG CGATTGTGTT GGCTGATTTG
GCGGGAATGA ATGATACCCA TGTGCGGGTT GTGCAGAATG GCAATGTCAA TACCTTGATT
GGCTTGCCCG ATATTGATGG AATTATTGAT AGCAACCTTA AAGAATATGA TGTAACTGAT
GGCCGAACCT TGGGCGGGTT TTTATTGTGG GCCAAAAGCA GTTACACTGG CCAAAACTAC
ACCTTGAGCT ACATTGGCCA TGGTGTGCCA ATTATGCCCG ATATCGAGAT TTCAAACCTC
AGCCAGCCCG AACGCCCAGT CAGCAGCGTC AATCTGCCAC CGTTGCCCAC CCGCATTGGG
GCCAATGCCG ATGTGACCGA TCACACGCTG ACCACGAGTA TCAGCGGCTA TAATGCGCTT
TCGCCCAATG ATTTGGCCTT GGCTTTGGCA ATTGCGGCTC CGATTGGCCC TCGCTTGGCC
GTGCTCGATG TGTTGATGTG CTTTAGTGGC TCGATTGAGG CCTTATACCC ACTTGCACCG
TATGCCGAAT ATCTGACTGC CTCGCCCAAC TATGCCTTTT TCGACCCAAC CATGCCCGGT
AATGCCTTGC TTGGCTTGAA CAGCAACCCC AACCCGCTAC AAATGGCCCG ACATATTCTC
GATAGCTACC ATAATCAACT GCCAAGCAGC GATCATCCAC GGATTTTAAG CGTGATCGAT
GCCGATCAAT TAAATAATCT CAAAACCACG TGGGATAGTG CTTCGAACGC CATCTATACC
AATTTGCTCA ATCCCAGCCA GCGCGAAGTT ACCCGCACAG CCTTGTTTAA TGCCTACCTT
GAGAGCCGAA AATACGATCT GACCTATTGT GAGCCAAGCG ATTGGGAGCT GAATGCGCCC
GATGGCTTGG TCGATTTGCG CAGTTTTGCT CATGGTTTGA GTCAAAGTTT CGCCAATCTC
AATCCGCAAG TTGCTAGCTT TGCCGCCCAA ACCCGCGATC GGATTCGCAA TAATAGCGGT
AATCCGATGG TCGTGGTTTA TCGCTTGGCG GATAACGATT TTCCGTGGTT CGACCCAACT
CCAACCCAAT GGATTTTTGA TGGACTGAAT CCACTTGGGC TTGATGATGA TGCCGCTGGC
TTGAGTTTAT ATGCCGATTT GCAAGGCCTC TCGGTGGCGG GAGCAACCGA ATTAAGCTGG
CAGGCCCATT GGTATCACGA TGACGACACC CAGCCAGATA ATCCGCATCC GTTAGCATTT
TTAGCCGATG TGACCCATCG CAACGGCTGG GATGAAGTGT TTCAGGAATA TTGGCGTGAT
ATTGAGGTAC AAACAGCGTT GTGTACGCCG AGCTTACCTG CTGCCCGCGA TCAATCGACT
CCTCGCGCCG ACATCAGCCT AAGCCAATTT AACCCTGCCG ATAGCAACTT GGCGGTCAAT
GAATCGATTC GTTTGAGTGT TATGCTGAAT GTCACTCGCG CAGTGCAACG TAGCGATCTC
TGCTTTGAGG TTGTCTTGAA TGGTACAGTC GTATTTACTG ATAGTTTGAT GTTGACCAAA
CTTGAGGCTG GTAGCCAACG AATCTATGCG CAAAAAATTT GGCAACCAAC GACTGCTGGA
GTTTATAGCT TGCGGGTCGT TGGCGATGGC GGCCAGCATG TGCAAGAAAG CAACGAAAAT
AATAATGTGC TTACCCGCTC GCTGAATATT GCGCCAACCG TGCCGCGTCG CCCAATGCTT
ATTGTCAAGA CCTCGAACAA TCTGCAATTA TCCAATAGCC CAACCGTTAC GCTGAATGTT
CAGCAGCAAG CGGGCAGCGG TACACCAGTT TCAAGGGTGA TTATCCAAGC CTATCAATAT
CAGGGCAATG CTGCCAACCC GCGCTTGCAA ACGCCAGTGT TGCGGGCTAC TACCACGATC
AATCAACCAA CTTTACCAAC AGTGCAATTA AGCGTAGCAG GCTTAGATCC TGGTGCGGTG
GTCTTGTATG TTTGGGGCTA TTCGAGCAAT GGCTATAGCT TGATTCCGGC CATTGTGCGG
CTAAATTATG CGCCGTTGCC CGCAACGATC AGCCAAAACC AAAAACATAT CTATCGATTT
AGCCTCAAGC GCGACCAAGC CCAAGCCTTT CGTTTACAAA GCCAGTTTGG CAATAGCAAT
TTGCACGCAT GGGAACCATA TATCTGGACT GCACCAACTC AACAATCAAC CAGCCTTGGA
TTGGATCAAA TTAGCTATAA TCCTACACCG CTAGCTGGCG AATACATCGT GCAAGTAAGT
AGTAGTGAGG CTGGTCGCTA TCTGTTTACG GCGATACCTA ATCCGCCAGC AGGTCGCAAC
TTTGAAACGA TCACCAGTAA ACAACCACGG CCAATCTTTG AGGAGCCAAT CCCATTCTTG
CCAATCGACC AACTGTTTAT CCCAATAGTC CAACGTTAA
 
Protein sequence
MGTRLLRCCL LIGLLLSAML PPIVEATPKA APLVAIYILA YDNRLDSTMN LTPYYDATLT 
SITNATVGQP DLTAIVLADL AGMNDTHVRV VQNGNVNTLI GLPDIDGIID SNLKEYDVTD
GRTLGGFLLW AKSSYTGQNY TLSYIGHGVP IMPDIEISNL SQPERPVSSV NLPPLPTRIG
ANADVTDHTL TTSISGYNAL SPNDLALALA IAAPIGPRLA VLDVLMCFSG SIEALYPLAP
YAEYLTASPN YAFFDPTMPG NALLGLNSNP NPLQMARHIL DSYHNQLPSS DHPRILSVID
ADQLNNLKTT WDSASNAIYT NLLNPSQREV TRTALFNAYL ESRKYDLTYC EPSDWELNAP
DGLVDLRSFA HGLSQSFANL NPQVASFAAQ TRDRIRNNSG NPMVVVYRLA DNDFPWFDPT
PTQWIFDGLN PLGLDDDAAG LSLYADLQGL SVAGATELSW QAHWYHDDDT QPDNPHPLAF
LADVTHRNGW DEVFQEYWRD IEVQTALCTP SLPAARDQST PRADISLSQF NPADSNLAVN
ESIRLSVMLN VTRAVQRSDL CFEVVLNGTV VFTDSLMLTK LEAGSQRIYA QKIWQPTTAG
VYSLRVVGDG GQHVQESNEN NNVLTRSLNI APTVPRRPML IVKTSNNLQL SNSPTVTLNV
QQQAGSGTPV SRVIIQAYQY QGNAANPRLQ TPVLRATTTI NQPTLPTVQL SVAGLDPGAV
VLYVWGYSSN GYSLIPAIVR LNYAPLPATI SQNQKHIYRF SLKRDQAQAF RLQSQFGNSN
LHAWEPYIWT APTQQSTSLG LDQISYNPTP LAGEYIVQVS SSEAGRYLFT AIPNPPAGRN
FETITSKQPR PIFEEPIPFL PIDQLFIPIV QR