Gene Haur_2537 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2537 
Symbol 
ID5734415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3242462 
End bp3244876 
Gene Length2415 bp 
Protein Length804 aa 
Translation table11 
GC content50% 
IMG OID641279677 
Producthypothetical protein 
Protein accessionYP_001545303 
Protein GI159899056 
COG category[S] Function unknown 
COG ID[COG5427] Uncharacterized membrane protein 
TIGRFAM ID[TIGR03662] Chlor_Arch_YYY domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGCTT GGTGGCTAAG CATCGTTTTA ATTGGAACTC TCGCGCTGCC TTTGAGTATG 
CGCCTATTTG GGCATTTGCC AGGGCGGGGC TTGGCATGGA GCAAAGCGCT AGGCTTGTTG
GTTGTTGCAT GGATCGCTTG GATGGGCGCT ATGCTCAATC TTTCGGGCTT CGATGGCGTG
ACGGTTGGCT TGGGCTTAAT TGGCTTGGGC ACACTCGGTT GGTTTGTGCA ACAACCTTTT
GATAAAGCCC GTTTGTTGGC AGCAATTCGC CAATATTGGC CGCAATGGCT GGCCTATGAG
TTGTTGTTTG CTTTGGTTTT TTTGCTGGGC ATTCAATTGC GCTTCCATGG GATGTTTGGC
TCAGGGATTC ATGGCACCGA AAAACCAATG GAATCAATGC TGTTTAGCGC TGTGCTGAAT
AGTCCAAGCT ATCCACCAAG CGATTTGTGG TTGGCCGGAT TTAGCGTTAA TTATTATTAT
TTTGGCTATG TGCTGATCAG CGTATTGAGC GTGCTCAGTG GCGCGACGCT TGGCGAAACG
TTTAATCTTG GTTTGGCCAC GATTATGGGC TTGGCCAGCC TTGGGATTGT TGGTTTAGTC
ACAACTATGG CGGGTTTGTG GTGGCGTGAG TATTTGCTAT CCCGTCTCCG ATTAATCGCA
ATTGCTGCGC TTGGTTTGTT TGGTGGCGTA TTGGTGCTGT TTGCAGGCAA CCAAATTGGC
GCTTTACAGA AGATCGTCAA TTCTGCGGAA GTTAATCGAC TAACCGATAG CCAACGGGTT
TCAGTCTTGT GGCAAGCAAT TCAGGGGGTT GAGCCAGCCA CGCTTGATCC CGCCACGCTA
AAATCAGAAA ATACTGGCAT TTCTAAAAGC TCGACCTTGC CACCAATGGG CGAGAAATTT
GAAGCTTGGC CATCGTCGCG TGCAATCTAC GATGATCACG AAGAAACCAT GATCATTGTC
GATAATCAGC AACGTATGGG GATGTCTCAG CGTGAGATTA TCACCGAGTT TCCCTTCTTC
AGCTTTTATC TTGCCGATAT GCACCCGCAT GTTTTGTCGA TTCCGCTGAC GTTGTTGGCG
ATTGCTTTGG CCTTGGCGAT TTTTGTCCGG CCAGCGATGG TCCGTTTCCC AAAGCACGAT
TGGCTTGAAT TGGCGATAGC TGGCTTGGTG ATTGGCGGCT TATATGCAGC CAATTCGTGG
GATGCCCCGA CCTTTGGTGT GTTGTATGCC TTGGGTTTGG TTGGCTTGTG GCGTGGGCAT
ACACCGCAAC CAACCCGCCG CGATTGGCTA CAGCTTGCAG GTCAAGTTGG TTTAGTGGTC
TTGGCGGCGG CATTGTTGTA TATGCCGTTC CTGCTCACAT TTAGCTCATT TGCAGGCCGC
GATACCGTGC CCGACCCATT TGCCAGTATT CCAATTATTG GCAGCTTGGG CAAAATTATG
GCTCCCGCCC GTGATCACTC TGGCTGGACT GATTTAGTGG CGATTTTTGG CTTGTTCTTA
GTGCCGATTA TTGCTTGGCT CAGCCGCACG ATCAAGGTTT GGCAATTATG GGCTATGACT
GGCGCGGTGC TGCTGATTGG CTTAATTGCC GGCATTCCGG CGATTGTCTT TTTGCCGATT
GCCGTGATCT GTTGGCAAAC AGCTTGGCAA CGCAATCAGC GCGATGTGCA AAACTTTAGC
TTGATCGTGG TTGGTTTGGC AGCCTTATTG ATTGTAGTTG TCGATTTTCT GTATCTGCGT
GACATTTTTG ATAATCGCAT GAACACGGTT TTCAAGGTCT ATTATCAGGC TTGGATGCTG
TTGGGAATTG GTGCTGCTGC TAGTATTTGG GGCTTGTTGA GCAATGCCCA ATGGCGACGC
TGGACGAATG GCATTTGGTT GCCATTATTT GGGCTTTTGT TGGCTGGCGG CTTAGTCTAC
CCAATTTCAG TGCTTAACCC TACAACTTCG CCCTCGTGGG ATGCAAGTGG CTCGAAGCTT
GATGCAGTAG AAAGTTCCCA ACACTTTTCC GAGCCAATGC GCAAAGCCGC TGCCTGGCTT
GAAGCCAACA CACCAAGCAA CAGCGTTTTG GCGACTGCAC CTGGCAGCAG CTATCAAGAT
GGCGGTGAGT TAGCAACCTT GAGTGGTCGG CCAACCTTAT TAGCTTGGCC CGGCTCGCAT
GAAGGTTTAT GGCGCAGCAA ACAGCCTGAT GCAAATCAGC AAGTGGCGCA ACGCCAAGGC
GATATCAGCG CAATTTACAA TGCCACCGAT ATCAATCAAC TGCGCGAAGT TTTGGCTCGC
CAGCGGGTCG ATTATGTGGT GTGGGGGCCA AACGAGCAAA AAGCCTATCC ACAGGCCAAT
ATTGGCTTGC TCGAACAGGT TGCCAGCAAA GTTTACGAAG CCGATAGCTG GATCATCTAT
CAAGTACAAC CATAG
 
Protein sequence
MIAWWLSIVL IGTLALPLSM RLFGHLPGRG LAWSKALGLL VVAWIAWMGA MLNLSGFDGV 
TVGLGLIGLG TLGWFVQQPF DKARLLAAIR QYWPQWLAYE LLFALVFLLG IQLRFHGMFG
SGIHGTEKPM ESMLFSAVLN SPSYPPSDLW LAGFSVNYYY FGYVLISVLS VLSGATLGET
FNLGLATIMG LASLGIVGLV TTMAGLWWRE YLLSRLRLIA IAALGLFGGV LVLFAGNQIG
ALQKIVNSAE VNRLTDSQRV SVLWQAIQGV EPATLDPATL KSENTGISKS STLPPMGEKF
EAWPSSRAIY DDHEETMIIV DNQQRMGMSQ REIITEFPFF SFYLADMHPH VLSIPLTLLA
IALALAIFVR PAMVRFPKHD WLELAIAGLV IGGLYAANSW DAPTFGVLYA LGLVGLWRGH
TPQPTRRDWL QLAGQVGLVV LAAALLYMPF LLTFSSFAGR DTVPDPFASI PIIGSLGKIM
APARDHSGWT DLVAIFGLFL VPIIAWLSRT IKVWQLWAMT GAVLLIGLIA GIPAIVFLPI
AVICWQTAWQ RNQRDVQNFS LIVVGLAALL IVVVDFLYLR DIFDNRMNTV FKVYYQAWML
LGIGAAASIW GLLSNAQWRR WTNGIWLPLF GLLLAGGLVY PISVLNPTTS PSWDASGSKL
DAVESSQHFS EPMRKAAAWL EANTPSNSVL ATAPGSSYQD GGELATLSGR PTLLAWPGSH
EGLWRSKQPD ANQQVAQRQG DISAIYNATD INQLREVLAR QRVDYVVWGP NEQKAYPQAN
IGLLEQVASK VYEADSWIIY QVQP