Gene Haur_1784 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1784 
Symbol 
ID5733686 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2072262 
End bp2074331 
Gene Length2070 bp 
Protein Length689 aa 
Translation table11 
GC content50% 
IMG OID641278927 
Producthypothetical protein 
Protein accessionYP_001544555 
Protein GI159898308 
COG category[S] Function unknown 
COG ID[COG3299] Uncharacterized homolog of phage Mu protein gp47 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACGATTC ATAAAAAACA ATTTAATGAT ATTTATGCCA GCATGGTTGC CGATTCGCGC 
CAGCGCTTGC CACGGCTCTC CGATTTTGAA GAAGGCAGCG TAGTTCGTTC GCTGTTTGAA
TCGTTTGCCT ATGAGCTAGC CGTGCTGTAT GAGCAGATGG ATCTGGTTTA TCAGGCTGGC
TTTATCGATA CCGCTGAGGG GGCACAACTT GATCGGGTGG TGGCGATTTT AGGCATCAAG
CGCAACGAGC CAGATTTTGC AACAGGTAGT GTTACCTTTC AACGCGATAG CACCAGCGAA
GAAACCTTGA TTCCAATTGG CACGCTGATT ACCACCAAGG AAGATCCCAA ACAAACGCCC
TCCAAAAAGG CCTATATCAC AACTGAAGAG GGGCGAATTG CCCCAGGTAC AGCCACGGTT
GAGGTGCATG TACGGGCCGA AGAGCGCGGC AAACACATGA CTACCGCTGA GCAAACGGTG
ATCGTGATGC CCCGCCCATT GCCTGGCATC AAATCGGTGA CCAATCCTAA AGCAATTGGC
TTTCGTGGGC GTGAGCGCGA AACCGATCAA GAATTGCGTG AGCGTGCCAA ACAAACCTTG
CTGGCTTCAG GCCGCGCCTC AAGTTTATCG ATTGAAAATG CCTTGTTGAG TTTGCCCGAA
GTCCGCGAAG TGCGGATTCA CGAAGATTTT CATAATAATC CTGAAGGCCG CCCAGGCTCG
ATCGAAGTCT TTGTTGATGG CATGAACGAG CATAACGAGC GCCAATTGCG TCAACGTTTG
GATGAGGTGC GGGCCGCCGG GATTTATGTG GTGCTTAAGC CAGCAGCACC GATTAATGTT
GATGCAGTGA TTCATATTGC AGCGAACGAG CAGATTGCGA ATGCTGAAAA AGGCCTGCTT
GAAACCCAAG TGCGCGAAGC AGTTGAGGCC TACTTTGGCC GTTTGGGTAT GGGCCAGCCC
TTGCTTTTCT CGCAAATTAC CAGCGAAATT TTGCAAGTCA AAGGCGTAAA CGATCTGATT
AATTTGGAAT TGCAGCTTTA TCGTGAGGCC GAAAGCAAAG CCCAAGGCCT TGTGCAGTTG
ACTCGTAGCA CTAATTTGAG CCAAACCTTA TTGATCGAGT TGCCATGTGA ATTGCGCACG
CTAGATAATC AACGCTTTTT GGCAACCAAT CCTGCAAGTT TTGCTCCCAA CCAAGCGATG
ATTGAGCTTC AAGTGCAGGC GATGTTGGCT GGACGCAGCG GCGAACTAGC AACCAGTGCC
GCCTTGTGGG AAGAATTGGT GATCAACCAA ACTAAATTGA CGATTAGCAA TCCCCAGCCA
ATTTTATTGC AACGCACCAA ACACACGCCC CAAGATAAGC GTTTGGAAAG CGCCATTTCC
GAGCTTTTCA ACGCTGCTAG CATTCGAGTA GCGGCTGAGC CAAAGGATTT GCCAATTTTA
ATCTTTATTA AATTAGTTGA GCCAGGCTTG GAGCGCGACG AGAAACGTCG CAAAATCGAA
GGCGCAATTC AAGAATATAT TGCTGGTTTG ACGCTTGGCT CAGCCATCCA AGCTAGCGAA
ATCGAGAAGA AAATTCGGCT CTATCATCGC AAAGATTTTA GCTTGCGCAT GTTGGCAAAG
CCCTTTCAAA GCAACGAACG CCCCGAAGAT AGTGCGATTC AGGTCAGTAT TATTGAACGG
CCTGTGTTGG CTAACCTGCT GATCTATAGC GAGCGCGTCG AATTAACTGG CAAACTTGAG
TTGACGGTTG CGCCCACGAC CAGCGATAGC CAACGCCGCC AAATTTGCTA TGAAGTGCGA
CGGGCAATTA TGGCCTATCT CGATGATTTG GCTCCAGAGC AGGATTTAGA GCTAGCCCAA
ATCGAAGCGC TTGCCAAAGC CCAGCTTCAA GTATTGCAAG TTGCTTTCAA CCCTAAACAC
TGCCAGCTCT ACAATGTCTC GACCAATCCA GTTGGTGTGC TGGCTGAGCG CAATAACGGC
AAAGTCGTCA AAATCGCCAG CTTCGAGAAG GTTTTTCTTG CCAGCGATAG TGCCAGCGAT
CCGGCGCTGC GCTTTGTGAT ACAGGCTTAA
 
Protein sequence
MTIHKKQFND IYASMVADSR QRLPRLSDFE EGSVVRSLFE SFAYELAVLY EQMDLVYQAG 
FIDTAEGAQL DRVVAILGIK RNEPDFATGS VTFQRDSTSE ETLIPIGTLI TTKEDPKQTP
SKKAYITTEE GRIAPGTATV EVHVRAEERG KHMTTAEQTV IVMPRPLPGI KSVTNPKAIG
FRGRERETDQ ELRERAKQTL LASGRASSLS IENALLSLPE VREVRIHEDF HNNPEGRPGS
IEVFVDGMNE HNERQLRQRL DEVRAAGIYV VLKPAAPINV DAVIHIAANE QIANAEKGLL
ETQVREAVEA YFGRLGMGQP LLFSQITSEI LQVKGVNDLI NLELQLYREA ESKAQGLVQL
TRSTNLSQTL LIELPCELRT LDNQRFLATN PASFAPNQAM IELQVQAMLA GRSGELATSA
ALWEELVINQ TKLTISNPQP ILLQRTKHTP QDKRLESAIS ELFNAASIRV AAEPKDLPIL
IFIKLVEPGL ERDEKRRKIE GAIQEYIAGL TLGSAIQASE IEKKIRLYHR KDFSLRMLAK
PFQSNERPED SAIQVSIIER PVLANLLIYS ERVELTGKLE LTVAPTTSDS QRRQICYEVR
RAIMAYLDDL APEQDLELAQ IEALAKAQLQ VLQVAFNPKH CQLYNVSTNP VGVLAERNNG
KVVKIASFEK VFLASDSASD PALRFVIQA