Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_4077 |
Symbol | |
ID | 5735935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 5204113 |
End bp | 5208009 |
Gene Length | 3897 bp |
Protein Length | 1298 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641281228 |
Product | TPR repeat-containing protein |
Protein accession | YP_001546837 |
Protein GI | 159900590 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCAAC CAAACATCCT CACCCTCGAA TGTACCGCTG CCGCCGCTGA TTCGCCCGAT CTAGCCCAAT GGACGTTGCG ACTCAATCAA TTGACTGCCA GTGGCCTGTT GCCTGCGCTG CCAACCAAAG AGCAACGCGC CGATTTTCAA TGGTATTTTG AGCGCTATTT AGATTGGCCG TTTTTGGAGT TTCGCGAACG CGCCATGCGG GTTGAAGCCG AGTTAGCGAC AGTTGGCAAG GCACTCTTTG CCGCAATTTT TGAATCAAGC GCCAACGCTG CCAAAATCTA TGCTGAATGG ATGCAGGTTG ATGCTGGTGT GCCAACCTTG CAAATTTTAA GCCTGATTCC CGCCGTGCTG AGCGTGCCAT GGGAGTTGCT GCACGATGAT TTTGGCTTTT TGAACCAACG GCGGATCAAT CCAGTCGCGA TCTATCGCAC CGTTTCCAGC GTTCGCTCAA AAATTGTAGC CCAGCAATTT GCCATGCCGT TGCGGGTGCT GGTGGTTGTG GCACGGCCTG ATGATCAAAA TTTTCTCGAT CCGCGCAGCT CAGCCCAAAC CATTTTTGGC CAGTTACAGC AATTAGAAAA CGATCAAAAA CTCAAGCCTG GCATGATCGA GCTGGAGTTT TTGCGCCCGC CAACCTACGA TCAATTGGCG CGTCGCCTGC AAGATGCCAC AAAACCTGTA CATATATTGC ATTTCGATGG CCACGGCGGC TTTCCCAAAA TCCAGCCAAC CAGCACGCTC TATAAATCGG CCAACGTGCC ACAAGGGGTT TTATCGTTCG AAAAAGCCGA TTATCAGGTT GATACGGTTG AGGCTCGCCG CTTTGCCGAA TTGTTGAACG GCGCTGGCGT GCGTTTGGTC TTGTTGGATG CCTGCCAAAC CAGCGTGATG GATACCAGCG CCAGCGACGA TGCCGAGTAT CAACGCCAAC AGGCTTTGAG CAGCGTGGCA ACCCAGTTGT TGACAGCGGG CGTGCCTGCG GTGGTAGCGA TGAGCGCCAG CGTGATTGTG CCGACCACGG CGCTCTTTTT TGGCGAATTG TATGGCCTGA TCGCCGAGGG CCAATCTGTG CCCGTGGCGT TGGAGCGAGC ACGCCAAGCC TTGCAAAGCC AGCCCGTGCG CCTATACCTC GCCCGTAACG CCGAACAAGA ACCTGATCCA ATTAGCTTGG CCGATTGGTG GTTGCCGCAT TTTTATCAGC AAGCAGCGAT CAATCTCACC CCAACCGGCA AGCCAGCGCG ATCCAAGCCG ACCAAGCTTT CGGGCTTCAT CGAAAACCCA CATCAACGCC CATTTGTTGG CCGCGCCAAG GAGCTATTGC AGCTTGAACG GGCTTTGCTC AAGGGCAAAA TTGCCCTGCT GCATGGCTTT GGTGGCATTG GCAAAACCCG CCTTGCCAGC GAAGCCGCCG CCTGGCTGAC CCAAACCAAG CTCTATCAAG GCGCATTGCT GCTCTCGTTT GAACATGGCG GCAACCAAAT TGCCCTGCTC AGTGCAATTG CCCGCCATTA TGAGTTGCCC GAAACCGATT CGCACGATTT GCAGGCAGCA TTAAAGCGCT TCAAGCCACA CCTACACAAC AAGCCATTGC TAATTATCGC TGATAATTTG GAGAGCATTT TGCCCAACGC GGGTAATGTT GAGCTGAATG TGTTGGAAGC CCAAGAACGC CAAGCCTTGT GGGATTGTGT GCTAGGCTTG CGGCAAGCAG GCGCAGGCAT CATTTTGACC TGCCGCGATT ATGATTTGCG TGATAGCCGC TTGCAGCAAG GCCAATACAC CAGCGTGATT GCCATGCGCG GCTTGGATAC CCCCTCGGCT TACAGCTTTG CCACAGCCTT GCTCGATGAT TTGGCGATTG ATCGGCGGCG TGCGCCAAAA TTACAACTAA GCAGCCTGTT GGCGCGGCTC GATCATCACC CCTTGGCCAT GGGCTTGACT CTACGCGCCT TGCGCGACCC AGCCTTGAGT ATCGAGCAAC TGCTGAGCGA CTATAGCAGT GCCTTGCTGC AATACACTGA TGAAACCAGC CCCAACCAAC GCCATAGCTC ACTCGAAGCC TCGCTTAACT ACTCGCTGCA ACGCCTCAGC CCTGAACAGC GCGAGTGGCT GGCTAAGCTC GCACCATTTG AAGGCGGTGC AAGTGAAGAT GATCTTTTGG TGATCACCGA GATTCCTGCC GAGCAATGGG CGCAACTCCG TGCCTCCTTA GAGCATGCGG CATTAATTGT GCCTGTAGCA ATTCCAGGTT TCGAAGCACC ATTTTTGCGT TTTCACCCTA CCCTCACACC CTATTTACGC CAGCAGCACC CCGCTAGCAC CGAGCTAGAG CAACGCTTTG CCGTGCGTTA TTATGGGTTA TCAGGCTATT GTTATCAGCA AGATCGCCAA AACCCGCAAG CAGTTCGCGC CTTGGTACGC TATGAACTGC CCAACCTACA ACGGGCAATC CAAGGCTTAT TGCGCTTCAA GGAGATTCCT GCGGCGGTGA ATATGTCTGA TAACCTAAAT AAGTTTTACA ATAATTTTGG CATGCAACGC GAGCGCAATC TGCTTAATCG GCAAATTGCT CCGTATATTT CTTCTAGTGA ACAGCTTTCG GAAGCCGAAT ATTTGTATGA AAGTGAGCTT GGTGAGGTTG AATATAGCCA AGGTCATTAT GAGTTAGCAC TTAAACGTTT TCAGCAGTTA CTAGCACGGA TTGAGCAACA ATTACCCGAT CAACTGCTTA ACTTTCAGCA TTGCATAACC CTTACCCACA TTGGGCGATG TTTGCAGCGA ATGGGGCAAT CTACTCAATC GCAGCGAACT TATGAACAAG CCTTAGCGGT GATTGAGCAA TTGCTCAATC ACCGCTTGGA TGATCGAGAC TACCTCAAAC AACAAGCGCT GCTAATAACA GCGCTTGCTG ATTGCCACAC CGATCAAGGC CAATTTGCCC AAGCCAAAAG CTCCTATGAG CAAGCGCTAG AAATCAAAAA AACCATTGAC GATCGGAGGG GTCAAGCGGT AAGTTTAGGC CAACTTGGGC TGTTAGCGCT ACGCCAACGT CAGTATGCTG AGGCCACTCA AAAATACAAT GCAGCGCTAG CAATCTATCA GCAGTTGAAC GAGCCTAGTA TCGTAGCTTC TATCTATCAT CAACTTGGCA GAGTTGCCGA AAAGCAAAAG AACTGGACTA TAGCAGAGAA TTACTATCGC CAGAGTCTTA CAATCAAAGA AGGTTTACAC AATAACATTG GAGTCGCAGA AACATGTAAT CAACTAGCAC TTTTGGCAAA AAGTGCTGGT AGATCTATTG AGGCAGAGGG CTGGTTTAAA CGGGCTTTAC AAAATCCTGA TTTACCAGTG TTAAATCGCG CCAAATGGCT CAATAATCTT GCCGATCTCC TTGCCGACCA AATTCAAGCG GGGACTTGGT CGCAGAGCCG CTTGGCTGAA GCTCAAAACT ATGCAGAACA AGCACTAAAT ATTCTTAAAG GGGTTGATCC CGCACAAGCT TCTTTATGGT TACCGCTTAA TATTCTTGCC AACTTAGCCG ATCCAGCAGG CCAAGCAGCA GAGTATTGCC AGCAAGCCCG CATTGCCTAT GCCGCCCATG CCGCCAATCG CTGGCATATC GATCAACAAT TTGGAGATTT GATCGCGGCG ATTGTCACTG CAACTCAAGG CAATCAAGAA GTACGCACAG CAGTCGAGCA AACCTTACCG CAATTAGAAG CAAATGGTTG GAAGATCAGC AAGGCTATTC AGCGCATCTG GGCAGGCGAG CGCGATTGGC ATGGGTTGTG TGCCGAGCTT GATAATCAAG ATTCATTGTT GATCTTACGG GTGCTTGAGG AGCTTGAAGC CAATTCAGAA GGCAAAAGTC AAAAGGATAA GGGGTAA
|
Protein sequence | MSQPNILTLE CTAAAADSPD LAQWTLRLNQ LTASGLLPAL PTKEQRADFQ WYFERYLDWP FLEFRERAMR VEAELATVGK ALFAAIFESS ANAAKIYAEW MQVDAGVPTL QILSLIPAVL SVPWELLHDD FGFLNQRRIN PVAIYRTVSS VRSKIVAQQF AMPLRVLVVV ARPDDQNFLD PRSSAQTIFG QLQQLENDQK LKPGMIELEF LRPPTYDQLA RRLQDATKPV HILHFDGHGG FPKIQPTSTL YKSANVPQGV LSFEKADYQV DTVEARRFAE LLNGAGVRLV LLDACQTSVM DTSASDDAEY QRQQALSSVA TQLLTAGVPA VVAMSASVIV PTTALFFGEL YGLIAEGQSV PVALERARQA LQSQPVRLYL ARNAEQEPDP ISLADWWLPH FYQQAAINLT PTGKPARSKP TKLSGFIENP HQRPFVGRAK ELLQLERALL KGKIALLHGF GGIGKTRLAS EAAAWLTQTK LYQGALLLSF EHGGNQIALL SAIARHYELP ETDSHDLQAA LKRFKPHLHN KPLLIIADNL ESILPNAGNV ELNVLEAQER QALWDCVLGL RQAGAGIILT CRDYDLRDSR LQQGQYTSVI AMRGLDTPSA YSFATALLDD LAIDRRRAPK LQLSSLLARL DHHPLAMGLT LRALRDPALS IEQLLSDYSS ALLQYTDETS PNQRHSSLEA SLNYSLQRLS PEQREWLAKL APFEGGASED DLLVITEIPA EQWAQLRASL EHAALIVPVA IPGFEAPFLR FHPTLTPYLR QQHPASTELE QRFAVRYYGL SGYCYQQDRQ NPQAVRALVR YELPNLQRAI QGLLRFKEIP AAVNMSDNLN KFYNNFGMQR ERNLLNRQIA PYISSSEQLS EAEYLYESEL GEVEYSQGHY ELALKRFQQL LARIEQQLPD QLLNFQHCIT LTHIGRCLQR MGQSTQSQRT YEQALAVIEQ LLNHRLDDRD YLKQQALLIT ALADCHTDQG QFAQAKSSYE QALEIKKTID DRRGQAVSLG QLGLLALRQR QYAEATQKYN AALAIYQQLN EPSIVASIYH QLGRVAEKQK NWTIAENYYR QSLTIKEGLH NNIGVAETCN QLALLAKSAG RSIEAEGWFK RALQNPDLPV LNRAKWLNNL ADLLADQIQA GTWSQSRLAE AQNYAEQALN ILKGVDPAQA SLWLPLNILA NLADPAGQAA EYCQQARIAY AAHAANRWHI DQQFGDLIAA IVTATQGNQE VRTAVEQTLP QLEANGWKIS KAIQRIWAGE RDWHGLCAEL DNQDSLLILR VLEELEANSE GKSQKDKG
|
| |