Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_5170 |
Symbol | |
ID | 5737128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009973 |
Strand | - |
Start bp | 248076 |
End bp | 252617 |
Gene Length | 4542 bp |
Protein Length | 1513 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641282335 |
Product | hypothetical protein |
Protein accession | YP_001547926 |
Protein GI | 159901680 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAGTGATA CTGGCGATAA GGTTGATGCC TATGGGATTG CGCAGACACC CCATCACTTT ATTGCGCTGA CTACGGGTCG TGAATATCAA TTAGGCTTGT TGCGGGCGGT GGTGGCGCAG CTGATTCTCA AGCACAATCT CACCGTTTCG TACTTTCCGG AAGAAAGCTA TCCGGCGATG AAGGGGGAAT TTGCGCGGAT TCTTGACGAG CTTTCTAAAC ACGGCATCGC GGAAACGATC TACCTGGATG GCCTTGATCA ACTTCAGCCG GAGATTGACG GCTCGCGGGA TTTATCCTTC CTCCCGCCGC AGCCGCCGCC AGGCATCGTG ATCGTGCTTG GTTCGCGACC AGATGAGACG TTGAAACCGC TGGAGATTCT GCATCGGGTG GATTACGACC TGCCGCCACT CAGCGAGCCA GATGCGTTGG CATTGTGGCG ATCGGTTCAG CCTGGCATGG CAGATGGCCT ATTCCATGAC CTCTATACCG CACTGAATGG TAATGCGCTG TTTGTTCATT TAGCGGCGGA TACGATGCAG GGTGCATCGG TGTTTGATGC GACCAGTTTG ATCCAGCAGA TTGAGCAGAA TCCAAGTAAC CTCTTTGGGA TTACCTTGGA ACGGATTAAA CGCATGCCAC AATCCAAGTG GGATGTGGTC TGGAAGCCCA TGCTGGCGCT CTTGCTGGTC GCCCAAGAAC CATTGCGACT GGATGTGCTG GGCGATCTGC TGGAACACGA CCACGACACC ATGCAGGATG CCGTATGGGT CTTAGGAGGC TTAGTGAGCC AGGGCATTGA TCAACGAGTT GCGTTGCATC ACTTGTTGTT TCGTGACTAT CTCGCTGTGT CAGTCTTCAA TCAGCGTGAG GTCAAGCGCT GGCAGCAACG ACTAGCCGAC TGGTGTGCGA TGGATCGGGA TGTGATTTGG ACTGATCATG CTGATATGGT TGAACAGTCA CGGCGGGTCT ATGCACGACA TCACTATGTT ACCCATCTTG CACTGGCCGA GAACTGGACA GTACTGTGGC AGGTCTTGGA TGCGGGCGAC TATGGCGAAC ACAAAACACG CTTCGATCCG AGTACAAGAC TGTATGCGTT AGATCTGGAT CGCGGGCGGG AGAGTGCCAT CAACGCAGGT CAATCAGTTG ACGAACATAT TCAAAACCTA CCACGCTTGT GGAAGTATAG TTTGTTGCGG ACGAGTTTAG CGAGTCGCGT TGATCAATGG CCCGATGATC TCTTTGTTGT CCTTGCGATG CTTGGGCGCA CGCAGGAAGC ATTAGGCCAT ATTGAGTTAT GTTCGAATCC AATGCGACAG ATCCAACTTT GGGAAAATAT TATTAAATGG TGTGATCCAA AGCAGCAAAT GATCGTTATT ACGAGGATGA TGCAATGTGC AGTTGGCTTC TCAGCTTTTT CTCAAACTTG TGCGTTGGCC ACGATTGCTA GAACACTTAC CACACTTGGC GATGTTAACG ATGCTATAAA AATTCTGAAC GAGGCTATTG TCATCACGTC CACCATTGAT GAGACAGAGA ACCGAGCCAA TGCGTTGATC GTTATTATTC AGGCTGCCGC AGCACTCAAC CACTCCGAAC ATAGTGTAAA GATTCTCGAT AGCGTCTTCT GTATCGTTTC TTCTGTATAT AACTTAAGGA ACCATACCAA TATCCTTATT GCTCTTGCAG AAACTGCTAC AACTCTTGGA GATCTCAGTC GTGCTTTTAC TATTATATCC ACCATCCATA CAGTAGAAAA CCGGATTAAC GCACTTGCTG TTATTGCCAG GGCTGCCGCA TCTCTTAATA ATCTCCAGCC TGCTATAAGG ATTTTTAATG ATATCCTTAG CCTTGCATCC TCTATCTACC CGAAAAGAAA TTGTACCAAT ATCCTTGTCA CCATTACTCA AACCGCAGTA ACCCTTAACC AACCCGACCG TATCGCAAGT TTTCTCGATA ATGCCCTAGT CATCGCCTCC ACTACCCAAC CTACAAGAGA TCGTACTAAT ATACTTATTG TTATTAGTCA TGCTGCTGCC ATTCTTGGCC ATTCTAACTA TACCATTAGT GTTCTTGATA ATGCCTTAGC TATTGCATCC ACAATCTATC CGACAGGCAA CCGTGCTGAT GCCCTTAGTA CAATTGCCAA AACTACCGCT ACTCTTGGGA ATATCGATCA GGCCCTCTTT ATTACCTCTA CGATAGATGC AGTAGCAAAC CATGTTGATG CCCTTAGTAC AATTGCCAAA ACTACCGCTA CTCTTGGGAA TATCGATCAG GCCCTCTTTA TTGCCTCTAC AATTAATACA CTAGCAAACC GTGCCGATGT TTTGACCAGT ATTGCCAAAA CTGCAGCCCT CCTTGATGAT TTCAACTATG CTACAAAGAT TCTCAACGAC ACCCTCGTCC TTCTCGCTAA CGTTGATACT GCATCAAGGC GTGTTAACGC CTTGACTGCG ATTGCTCAAA CAGTTGCATC ACTCGGCCAT TTTGACCAAG CCCTCGCCAT TGCATCCTCC ATTCATACCG CAGGAAGCCG GGACACTGCT TTGGCTGCGA TTGCTCAAAC AGCTGCATCA CTCGGTCATT TTGACCAAGC CCTCGCCATT GCATCCTCCA TTCATACCGC AGGAAGCCGG GACACTGCTT TGGCTACGAT TGCTCAAACA GCTGCATCAC TCGGCCATTT TGACCAAGCC CTCACCATTG CATCCTCCAT TCATACCGCA GGAAGCCGGA ACACTGCTTT GGCTGCGATT GCTCAAACAG CTGCATCACT CGGCCATTTT GACCAAGCCC TCACCATTGC ATCCTCAATT AACGTGGCGG AAAACCGTGC TAATACCTTA GCTACTATTG CTAAGACCGC TGCATCCCAG GGCGATCTCA GCCTTGCGGT CGCCATTGCA TCTACTGTCG ATGCCACAAG AAGCCGAGCT AAGGCTCTCG CCACTTTTGC CAAGCCTGAT GCCTCCCAGG GCGATCACGA TCATATCAGA ACTCCTGCTA TCGATGCGAC GAGGAACCGT TCCAATACAC TGATCTCCAT TGCCCAAACC GCTGCAACAC TTGGTCATTT TGACCAGGCT CTTGCCATTA TCTCTACGAT TAACGAACTC TGGCGGTATA ATAAAGCTTT AGTTACAATT GCCCAATCTG CCGCATCACT CAGCTATTTT GACCATGCGA TCGCTATTGC CTCATCTATT TATGTAGTAG AGAAACGTAC TCATGCTTTA GCCACTATTG CCCAAACCGC TGCATTATTT GGCCATTTTG ACCAAGCCCT CACTATTGTT CAATCCATCG ATAACAGCCT GGATCGTGCT CATGCTTTAG CCACTATTGC CCAAACTGCT GCAAAACTTG GTCATTTTGA CCAAGCCCTC ACTATTGTTC AATCCATCGA TAGGGGACTA TGGAGGCAAA TCAACGCACT ATCCGAAATT GCCCAAACCG CTGCAAAACT TGGTTATTTT GACCAAGCCC TTACTATTGT ATCCTCCATC GATTCAGTAG AACATCGCAA CAGTGCCTTC GTTGCAATCG CCCAAACTGC TACATCTCTT GGTGATTTCA ACCGTGTTCT GACTATGATC ATTCCCACCA TTAATAATGA ACAGTGGAGA CACACTAATG CCTTGGTATC TATTGCCCAA ACGGCTGCCT CCCTCGGCTA TTTCGACCAA GCCCTCGCCA TTGCTTCATT GATTGATGTG AAGAGAAGCA GCGCACATGC ATTAGCGACG ATTGCGAAAA CGGTTGCATC ATTCGGTGAT TTCGGTCGTT CACTTACCAT TACGCAGTCA ATTGATAACG ATTGGTATCG GGCTGATGCC CTTGCCACAA TTGCCCAAAC TGTCGCATCA TTCGGCCACT TCGACCAAGC ACTCACCATT ACATACGCTA TTGATTCAGT AGAAAATAGA AATATTGCCT TGACCACGAT TGCCCAAACG CTAATATCTC TTGGTCATTT CGACCAAGCC CTCCCTATTG TTCAATCCAT TGATAAAAGT TCGCATCGTA ATGATACCCT CGCCACGATT GCTCAAAGGG CTGCATCCCT CGGCTATTTT GACCAAGCCC TTACTATTGT ATCCACCATT GATTCAGTAA GAAATCGCAA TGCCGTATTG ACGACGATTG CGCAAATAGC TGCATCCCTC GGCTATTTTG ACCAAGCCCT CGCCGCTGCC GCTGCCATCA GAGATGATAT AGAAAATCAT ACCAATAACC TCTTGACGAT TGCGCAAAGG GCTGCATCCC TCGGCTATTT TGACCAAGCC CTCGCCGCTG CTTCTGCCAT CAGAGATGAA GATAAACGTA TTAATATTCT TAAAGGTATT TTACAAATCG TCCAACCTGC TAAACTTTTA TTACCCATTA TTCAAAAAGA ATGGTATACC AGCAAAACAT CGAATAAGAC ATGGTTCTGT CTCCCCTTTC TTAATTCCTT ATTTAAAAAT AATTTGTATT TAGTCGGGCA GATTCTGGAA GGGGAAGAAT GGGTCAATGC GCAACTCAAA CGACTGGGGT AA
|
Protein sequence | MSDTGDKVDA YGIAQTPHHF IALTTGREYQ LGLLRAVVAQ LILKHNLTVS YFPEESYPAM KGEFARILDE LSKHGIAETI YLDGLDQLQP EIDGSRDLSF LPPQPPPGIV IVLGSRPDET LKPLEILHRV DYDLPPLSEP DALALWRSVQ PGMADGLFHD LYTALNGNAL FVHLAADTMQ GASVFDATSL IQQIEQNPSN LFGITLERIK RMPQSKWDVV WKPMLALLLV AQEPLRLDVL GDLLEHDHDT MQDAVWVLGG LVSQGIDQRV ALHHLLFRDY LAVSVFNQRE VKRWQQRLAD WCAMDRDVIW TDHADMVEQS RRVYARHHYV THLALAENWT VLWQVLDAGD YGEHKTRFDP STRLYALDLD RGRESAINAG QSVDEHIQNL PRLWKYSLLR TSLASRVDQW PDDLFVVLAM LGRTQEALGH IELCSNPMRQ IQLWENIIKW CDPKQQMIVI TRMMQCAVGF SAFSQTCALA TIARTLTTLG DVNDAIKILN EAIVITSTID ETENRANALI VIIQAAAALN HSEHSVKILD SVFCIVSSVY NLRNHTNILI ALAETATTLG DLSRAFTIIS TIHTVENRIN ALAVIARAAA SLNNLQPAIR IFNDILSLAS SIYPKRNCTN ILVTITQTAV TLNQPDRIAS FLDNALVIAS TTQPTRDRTN ILIVISHAAA ILGHSNYTIS VLDNALAIAS TIYPTGNRAD ALSTIAKTTA TLGNIDQALF ITSTIDAVAN HVDALSTIAK TTATLGNIDQ ALFIASTINT LANRADVLTS IAKTAALLDD FNYATKILND TLVLLANVDT ASRRVNALTA IAQTVASLGH FDQALAIASS IHTAGSRDTA LAAIAQTAAS LGHFDQALAI ASSIHTAGSR DTALATIAQT AASLGHFDQA LTIASSIHTA GSRNTALAAI AQTAASLGHF DQALTIASSI NVAENRANTL ATIAKTAASQ GDLSLAVAIA STVDATRSRA KALATFAKPD ASQGDHDHIR TPAIDATRNR SNTLISIAQT AATLGHFDQA LAIISTINEL WRYNKALVTI AQSAASLSYF DHAIAIASSI YVVEKRTHAL ATIAQTAALF GHFDQALTIV QSIDNSLDRA HALATIAQTA AKLGHFDQAL TIVQSIDRGL WRQINALSEI AQTAAKLGYF DQALTIVSSI DSVEHRNSAF VAIAQTATSL GDFNRVLTMI IPTINNEQWR HTNALVSIAQ TAASLGYFDQ ALAIASLIDV KRSSAHALAT IAKTVASFGD FGRSLTITQS IDNDWYRADA LATIAQTVAS FGHFDQALTI TYAIDSVENR NIALTTIAQT LISLGHFDQA LPIVQSIDKS SHRNDTLATI AQRAASLGYF DQALTIVSTI DSVRNRNAVL TTIAQIAASL GYFDQALAAA AAIRDDIENH TNNLLTIAQR AASLGYFDQA LAAASAIRDE DKRINILKGI LQIVQPAKLL LPIIQKEWYT SKTSNKTWFC LPFLNSLFKN NLYLVGQILE GEEWVNAQLK RLG
|
| |