Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2491 |
Symbol | |
ID | 5734372 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 3183206 |
End bp | 3185176 |
Gene Length | 1971 bp |
Protein Length | 656 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 641279631 |
Product | hypothetical protein |
Protein accession | YP_001545257 |
Protein GI | 159899010 |
COG category | [S] Function unknown |
COG ID | [COG1479] Uncharacterized conserved protein |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGGCTT CAGAAACCAA ATTCCAGCCA ATCATCGAGG GTACCAAACA ATATGTTGTT CCATTATTTC AACGTGCATA TAGTTGGGAT AGACGTGAAT GGGATATTCT TTGGGAAGAT ATTACTGATC TATGTGAAAA CGAAGAACCA AAAAGTCACT TTATTGGATC TATTGTAACA ATGCCAACAC CCTCAGTACC TGAAGGTGTT GGTAAATATC TATTAATTGA TGGACAACAA CGACTAACTA CAATTTTTAT TCTGCTTTGT TTGTTACGAG ATAAAGCTAT TGACATAGGT AATGAAGAGT TGGGTAATGA GATTCAACAA ACAATGTTAG TAAATCCCTT TAAAAAGGGA AATGATTATT TCAAACTGCT ACCAACACAG GTTGATCGAT TAGCATTTCA ATCGCTGATT ACAAAACAAT CCTTTTCTGG TGATAGTCAA ATTGGTAAGT GCTATAAATT CTTTGAACGA AAAATTGATT CATCAAATAT TCTAGAAGTA AATAAAGTAT TAACAAGTAG ATTTTCAGTT GTAAGTATAT TATTAGATTA TGATGATAAT CCTCACCTCG TTTTCGAAAG TCTTAACGCG AAAGGAAGAC AACTAACACA ATCTGATCTC ATTAGAAATT ATTTTTTTAT GAGAATTCAT ATTAACGATC AGGAAGATAT TTATAACAGA TTCTGGAATC CGATGCAAAG TAATCTCAAA GATAATCTCA CTGAATGTAT CAGACATTAT ATGATGAGAA ATGGAGTTAT TGTAAAACAA GGTGATGTTT ATTTTACATT AAAAGAACGT GTTGAAAAAG GAGATGCATT AAAATCTTTA GAACAAATTG CTGTATTTGC AGATTATTAC CAAAAACTGA TCAATCCATC CATTGAACCA AATATTACTG TAAGGAATGC ATTATTCAGA ATAAATCGAC TAGAGATTAC TACATCTTAT CCATTTTTAT TAAGCTGCTA CCATGATTAT GCTAATAAAG ATCTTTCCCC CAATGATTTT ACTGAGGTTC TTCATATTAT TGAGAATTAT ATTCTACGAA GATTTATCTG TAACTATTCA ACAAATCAAT ACAATAAACT CTTTCCATTT TTATATGATT GGGCAAAATC AAACAATCCA TCTAATTTTA TTCAAGGCCT ACGTGAGGTT CTTCAAGAGC GAGGATATCC TAATGATGCA TTGTTCAAGT CACACCTGAT TGAATCAAAG CTCTATGGTA AGGGAGAGCG ATTAGTCAAA ACCAAATTGA TTTTAGAAAC TTTGGAATCG CATTTTCATC ATAAAGAACA AACATCATTT GAAACCCTAT CAATTGAGCA TATTCTCCCT CAAACATTAA CTGAATGGTG GAAACAACAT CTTGGAGAAG ACTGGCAAGC TGATTATGAA CTGGCTGTTC ATACCCTTGG AAACTTAACA CTCACAGCAT ATAATTCGGA GCTTTCTAAT GATACCTTTC CAAAAAAACA GATGTACTTT CAACAAAGTC ATATAGAATT GAATAAATAT TTTCAGAATA TTGAATTTTG GAATAGAGAT GCAATTGAGA CGCGGGCCTC TATCCTTGCG GACATTGCCT TGCTCTGCTG GCCATACTTT GGAAGCAATA AGCCTGTAAC ATTCGCCACC AGTGCCAATG ATGTAACGGG TAAAACGCCT CAAACACTTA TTTTCCTTGG TAGTCGGATT CCAGTTGATT CTTGGAGGCA GGTGATGCTT AAAACTCTCA ATACCATTGC AGACATTGAG CCAGATATGT TTGAGGTCAT AACCCGAGAA TATTCTCGAT TTATTAGCTC TGATAGCACA AGATTTAAGA GAAAGTCAGA GCTTGACAAT GGTCTTTTTG TTGATGTTAA TCTGCGTTCA CGAGATATAT ATCGTTTATG CCAACAAATT ATCATAACTA TCGGATTGTC CCATGAGAAT TGGGGAGTAG AAACAAATTA A
|
Protein sequence | MKASETKFQP IIEGTKQYVV PLFQRAYSWD RREWDILWED ITDLCENEEP KSHFIGSIVT MPTPSVPEGV GKYLLIDGQQ RLTTIFILLC LLRDKAIDIG NEELGNEIQQ TMLVNPFKKG NDYFKLLPTQ VDRLAFQSLI TKQSFSGDSQ IGKCYKFFER KIDSSNILEV NKVLTSRFSV VSILLDYDDN PHLVFESLNA KGRQLTQSDL IRNYFFMRIH INDQEDIYNR FWNPMQSNLK DNLTECIRHY MMRNGVIVKQ GDVYFTLKER VEKGDALKSL EQIAVFADYY QKLINPSIEP NITVRNALFR INRLEITTSY PFLLSCYHDY ANKDLSPNDF TEVLHIIENY ILRRFICNYS TNQYNKLFPF LYDWAKSNNP SNFIQGLREV LQERGYPNDA LFKSHLIESK LYGKGERLVK TKLILETLES HFHHKEQTSF ETLSIEHILP QTLTEWWKQH LGEDWQADYE LAVHTLGNLT LTAYNSELSN DTFPKKQMYF QQSHIELNKY FQNIEFWNRD AIETRASILA DIALLCWPYF GSNKPVTFAT SANDVTGKTP QTLIFLGSRI PVDSWRQVML KTLNTIADIE PDMFEVITRE YSRFISSDST RFKRKSELDN GLFVDVNLRS RDIYRLCQQI IITIGLSHEN WGVETN
|
| |