Gene Haur_5210 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5210 
Symbol 
ID5737168 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp301391 
End bp305734 
Gene Length4344 bp 
Protein Length1447 aa 
Translation table11 
GC content58% 
IMG OID641282374 
Producthypothetical protein 
Protein accessionYP_001547965 
Protein GI159901719 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.124073 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCACAGC AGCCGACCGC CGCCTATCAT CGTGAGAGTG AACGCTTGCT CCACCTGCGC 
CATACTGGGT TTGTCGGGCG GGTCGCTGAA CTCGCCGATC TTCAGTCACT GATCGACGAT
CTCCGCCCGA CGGGTGGCTA TGTCGTCGTG ACGGGCGACG CAGGGGTCGG CAAAAGCAGC
CTGCTCGCGC ATGCCATTAT CCAGGCAGGG ATTGACCAAA CGCCCTATCA TTTTATTGCC
CTCACCCCTG GTCGTGCCTA TCAGGTGGAC TTGCTGCGCC ATGTGATTGG CCAATTGGTG
CGCAAGTACC CGGAAGCTGA GGCCGCGTTT TCCGCCGAGA CCTATCCCGC GCTACGCCTC
ACCTTCCTGC GCATCGTGCA TGATCTTGCG GCGCAGGGTG TGCAGGAAAC GATCTATCTT
GATGGGCTTG ATCAGTTGCA GCCCGACCTT GATGGACTGC GGGACATCAC CTTCTTGCCA
CTCCAGCTCC CGCCCGGCAT CGTGATCGTG CTGGGGTCGC GCCCGGATGA GGCGCTTGAT
CGCTTGGGGC TTGACCACGG GGTCGTCTAC GCAGTTCCCC CCTTGCGTGA GGCCGATGTG
CTCCAGCGGT GGCAGCAGGT GCAGCCGATG CTGACCCCGG CCAGCGTTCA GCCGCTGGCC
CGTTCGGTCG CAGGGAATGC CTTGCTGGTC GCGCTTGCCG CCACGCTCCT CGGCCACGCA
GCGCCGACCG TTGTCCCGCC ATGGCTGGCC GAGGCAAGTC GCGACCCAAG CACCCTCTTT
CGGGTGAGCC TCGACCGGAT CAAGCACCAC GACCCAATCC CGTGGCAGCG CATCATTCGC
CCACTGCTCG CCGTCTTGGT CGTGACCCAA GAACCGTTGC TCCCATCGAG TATCGCAGCC
CTGATTGAAC AACCCGTGGA GCAGGTGCAG GCCGTGCTGG GTCTGGTCAG TGAATGGGTG
AGTGTCGCTG CTGATCAGCG GCTAGCGTTA CGCCATTTAA TGTTCTATGC CTATTTAGCA
CAGCACGAAT TCGCTGCCGC CGACGTGCGG GTATGGCATC AGCGGTTGGC GACGTGGTGT
GGGATAGGTC TGGCGACGAT CTGGGAGGAG AGCGGCGATC CAGCGGAACA GGCCCGACGC
TGGTATGCGC GACATCACTA TGTGACCCAT CTAGCCTTGG CGGAACACTG GGCAACGCTG
TGGCAGGTGA TTGATGCAGG CACGTATGGT GCGCACAAGG TGCGGTTTGA TCCCACGCGG
CGACTGTATG GATTGGATAT GGATCGTGCC CGTGAGAGTG TGATCGCCGC GGGTGTCGGT
GCAGCGATTG ACGAGCAGCT GGCCGAATTG CCCCGACTGT GGCGGTATAG TCTGCTGCGC
ACGAGCCTGA CCGCCGATGC GGATGGATGG CTCGATGACA TGTTTGTGAT TGTCGCTGCC
AGTGGCCGAC TGTCGGAAGC CGAGGCCCAG ATTGCACTCT GTTCTGACCC GAAACGCCAG
GTGCAGTTGT GGTCGCGTAT CCTACCATAT GCGGAGCTGG AGCGACGGGT GGCTCTCATC
CAACAGATGG AACTGGTCGC ACGGAATCTG CGCGATGCCG ATGACCGTGA TCGTGCGCTT
GGGACGGTCG GGATGGCCTA TATCGACCAT GGGATGTTTG CGGTGGGGTA TCCGCTGATC
CGTGCGCTCG CACATAAGCG TGATTCACGG TTGTATGCGG CCGTCGTGGA TCGGGTTGAT
CATGGCGATA GTGCGCACGC CCAAGAGGTG ATGGCAGCGA TTCAAACACC AACATACCAA
GTCCAAAGTG CCCTGCTCAT CGCCAAGGCC TTGCTGACGG GTGGAGCCTA CGAGGAGGCC
CATGCGCTTC TGATGCAGAT TCGTCCCATG GCTCGGGATG GCGATCTGAA CACACTCCTG
TGCCTGCTTG CTGATATCCA GTGGGCTTGG GGCAATCTCT CCCAGTCGCA GCTCCTGCTT
GCTGAGGCCG AAGGATTGAT TCCGTCGATT GCTCATTGGG AGCATCCTGC TCGGTTTCGT
ATGATCCTGG ATGGGTATTT AAGGCATGGG GATCGGGTCA ACGCCCTGCG GTGCACGCAG
CAAGGTACTG ATGATAGCGT GCGTTATGAA ATCGTGAAAC TCTATCTTAC GCATGCCGAT
GCGGTAACGG CTTCCCAAGT CGCTCCCATG ATAGCCCATG ATGGGTATCG TGACGGAGCC
TATAAGGCCC TCATTCTCTG GTATTGTCAA CAGCAGGATC TCACGACGGC CCACGCGCTG
CTGGCCTTGA TCGTTGAAGA TCAGCAACAC ATTGAAGGTG CCTATGTCGT GGCAAAGGCC
TATGCCGAAC GCCAGCAATT TGACGCAATG GATGCATGCT TAACCATGGC ACTCGACGAT
GATTTGGGAG AACTTAACTT TTTTGATTTT GATTGTGTCT TCGATATCGT CGAATTGTAC
GCACGTTTTG GGTTTCATGC GCGTGCATGC GCGATTTTAG CGCGAGTCTT TCCCCTGATT
CGTGAGCTTC CTCGCGATTG GCCCGAAGAG TTGCCGGAAT GGATTCGGTT TGCGGAAATC
ACCAACCGCT ATGCCTATGC AGCCTTCTAT GATGACCTTG TCCGGGCCGT CTTTATCCCA
AAAGAACCCC ATTCCTATGC GACGGCGGTT GGCACCATGG CCCAAGGCTA TGCCATGCAT
GGTGATTTTA CGCGTGCTGA GCAGCTCGCT TCTGCACTTA CGTCCCCCGA TATGGCGATA
ACGACGCTGC ATGCCTTGGC GAGGATCGCC CAGGCGGCTC AGAGACCACC CCTTGCTGCC
CACTATTTAA GCGCGGCGCA TGCCCGTTTG AACACGATCG CCGATGCGAC TGCGCAGATT
GCATGGTGTG GGCGGCTTGC AACGACGGCA TGGACAATGG GGTTGACGGA CGCGGCACGC
ATGCTGATCG ATGACGGTCG GCAGCGATGG GCACAGCTCC CCATGCACCA GCAAGGGGTG
AGTTCGCAGA GACTCGTTGA AGGGTATCAG GGGCAGGAGA ACCTTGGTGA AGCCCTCGCG
ATTACCCAAT TAATGAAGGA CTCGCCGTAC TATCAGCAGT TGATCGAGAC GTTGATCGCT
GGGTGTATAA AGGCCCAGGA TCTCGATCAG GCGTATGCCA TCCTGAAACA ATCCCATATC
GCTGTCCCAC AGTACGTCGC CGCACTCTGT ACGATTGCGA TCACGGCCCG TGAACAGGGA
AACATGGCGC TTGCGGCGCA TACATGCGCC GAAGCACTGC TTGCCTGTGA CCGCTATCCT
GTTCGGCGGG AGCGATTGAA TTTCGTAAAA GACTTGGCGA TAAGCCAGTT AACCCATGGG
AGCGATGCCT GCTTACCGCA ACTCTTGGCG ATGCTGCGAT CCAATGACCC ATGCTGGGAT
CTTCCCGCCG ATGTTGCGAT GCGGTGTGCG ATCGCTGCAG TCTATGCGGA TCAGGGTGAT
CGTGCGTCGT TTGCGGAGTG GCTGGGTGCT GCGTATGACG GTACACACCA CGTGTTGCAA
TCAGAAGATG CTAGTCGCTG GGTGAGCATT GCGTATGAAA CGCTTGCCAA AACGTATCTG
GCGTATGCTG ATGAAGCCGC CATCCAGGCG TTTCTTGCGG ATTTCGCGGC GGTCGTTCCA
CGCTGTGTGG AGCGGGGTGA TGCGAGTGTT GCGATGAAGG TCTTAAGTCA AACCTATGCC
CACTATGCCA TCCAATACCA GCCGTCATGG CGTGAAAAAG CCTATCAGGT GGCTCGCGCC
ATTCCAGACC TGTGGGAGGG TGAGAATACG CTTGCAGTGG TGGCAACAGC CTATGTTCGC
GTCCATGATC ATGCGGGGGT GAACATGATC ATCCATGAAA TGCGACAACG GAAATTCACG
CGCCATGAGG CCATGATTCA ACAACACCAT AATCTGTGTA CGATTGCCTT GGCCTATGCC
GAACAGGGCG ATCATGCCCA CGCCGCTGCC CTGATTGCGC CACTTGCCCC ATCGTCTCAT
CGCGATCGCG TGCTGAAGGT GCTGATTGAG GACCATCGTC AGGCTGACCA GTGGGATGCG
GTCTATCAAC TTGTGGATAC CTATTATGAT CGCACCGAAC GGGTAGCGGT TGTTCAACAG
ATTATCACCG CCTATCGTGA ACGACAGCGT ATCCGTGAAA GTATCGCGCT TGTCCACACC
ACGTGGCGCA GATGTAACCA TGCCGATGAA TTATGGAGTA TGAGCGCAAG CATCCTGCCA
TTCCTGCTGA ACGATCCATC GCTTGGCCTA GCCTTGCTTG ATGCGGTGCC ATGGGTTGAG
CAGCAACTGC GACGGTATGT CTAA
 
Protein sequence
MSQQPTAAYH RESERLLHLR HTGFVGRVAE LADLQSLIDD LRPTGGYVVV TGDAGVGKSS 
LLAHAIIQAG IDQTPYHFIA LTPGRAYQVD LLRHVIGQLV RKYPEAEAAF SAETYPALRL
TFLRIVHDLA AQGVQETIYL DGLDQLQPDL DGLRDITFLP LQLPPGIVIV LGSRPDEALD
RLGLDHGVVY AVPPLREADV LQRWQQVQPM LTPASVQPLA RSVAGNALLV ALAATLLGHA
APTVVPPWLA EASRDPSTLF RVSLDRIKHH DPIPWQRIIR PLLAVLVVTQ EPLLPSSIAA
LIEQPVEQVQ AVLGLVSEWV SVAADQRLAL RHLMFYAYLA QHEFAAADVR VWHQRLATWC
GIGLATIWEE SGDPAEQARR WYARHHYVTH LALAEHWATL WQVIDAGTYG AHKVRFDPTR
RLYGLDMDRA RESVIAAGVG AAIDEQLAEL PRLWRYSLLR TSLTADADGW LDDMFVIVAA
SGRLSEAEAQ IALCSDPKRQ VQLWSRILPY AELERRVALI QQMELVARNL RDADDRDRAL
GTVGMAYIDH GMFAVGYPLI RALAHKRDSR LYAAVVDRVD HGDSAHAQEV MAAIQTPTYQ
VQSALLIAKA LLTGGAYEEA HALLMQIRPM ARDGDLNTLL CLLADIQWAW GNLSQSQLLL
AEAEGLIPSI AHWEHPARFR MILDGYLRHG DRVNALRCTQ QGTDDSVRYE IVKLYLTHAD
AVTASQVAPM IAHDGYRDGA YKALILWYCQ QQDLTTAHAL LALIVEDQQH IEGAYVVAKA
YAERQQFDAM DACLTMALDD DLGELNFFDF DCVFDIVELY ARFGFHARAC AILARVFPLI
RELPRDWPEE LPEWIRFAEI TNRYAYAAFY DDLVRAVFIP KEPHSYATAV GTMAQGYAMH
GDFTRAEQLA SALTSPDMAI TTLHALARIA QAAQRPPLAA HYLSAAHARL NTIADATAQI
AWCGRLATTA WTMGLTDAAR MLIDDGRQRW AQLPMHQQGV SSQRLVEGYQ GQENLGEALA
ITQLMKDSPY YQQLIETLIA GCIKAQDLDQ AYAILKQSHI AVPQYVAALC TIAITAREQG
NMALAAHTCA EALLACDRYP VRRERLNFVK DLAISQLTHG SDACLPQLLA MLRSNDPCWD
LPADVAMRCA IAAVYADQGD RASFAEWLGA AYDGTHHVLQ SEDASRWVSI AYETLAKTYL
AYADEAAIQA FLADFAAVVP RCVERGDASV AMKVLSQTYA HYAIQYQPSW REKAYQVARA
IPDLWEGENT LAVVATAYVR VHDHAGVNMI IHEMRQRKFT RHEAMIQQHH NLCTIALAYA
EQGDHAHAAA LIAPLAPSSH RDRVLKVLIE DHRQADQWDA VYQLVDTYYD RTERVAVVQQ
IITAYRERQR IRESIALVHT TWRRCNHADE LWSMSASILP FLLNDPSLGL ALLDAVPWVE
QQLRRYV