Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_1528 |
Symbol | |
ID | 5733415 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | + |
Start bp | 1780113 |
End bp | 1782224 |
Gene Length | 2112 bp |
Protein Length | 703 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641278668 |
Product | phosphoenolpyruvate-protein phosphotransferase |
Protein accession | YP_001544300 |
Protein GI | 159898053 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria) [COG1925] Phosphotransferase system, HPr-related proteins |
TIGRFAM ID | [TIGR01003] Phosphotransferase System HPr (HPr) Family [TIGR01417] phosphoenolpyruvate-protein phosphotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.611034 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGAGC TTGATTTGGT CATTCATAAC TCGGCAGGCC TGCATGCGCG GCCTGCCCGA GTCTTAGTTG ACCTTGCAAA ACAATTCAAA TCGACGATTT CGATTCGTGC TGGCGGCAAG CGGGTCAATG CCAAAAGCAT GATCGCGCTG TTGACCCTCG GCGTAGTCTG TGGTCAGGCG ATCCAAATCG AGATTAACGG CGAAGATGAA GCCGCCGCTG CCGAGGCGAT TACAACTGCG GTGCACGAAG GTTTGGGCGA AGGTCATGGC ACACCAGCCG CCAATAGCGT TAATGATGCT TTGGTTGCGG CTCGCAATGG CCATGCCAAT CTCAATGGCC ATGCCAAGCT TGAAACCACC GTTGCTGTGG CCGAACCGCC GCCTGCGCCA GCCCCAACTC GCGCTGAGCC ACTCAAAGCT GGCGCGATCA TCCAAGGGAT TGCTGGCGCA CCTGGCATTG CGGTTGGCAC AATTTTGCGC TACGAACGTG CGCGAATCGA AATTAGCCAC CGATTTAGTG GGGTGGATAA CGAACTCCAG CGTTTGCAAG CGGCCTTGAC CACCGCCCAA CAGCAATTGG TGGCGCTGCG CGAACAGGTA TTGTTACGGG CTGATGCCAG CGAAGCCGCC ATTTTTGATG TTCACCGCGA TATTTTGGCC GATCCTGCCT TGCTTGAGGC TGTGCATGGC TCAATTGGTG CTGGGCAAGG GGCCGAAGTC GCTTGGCAAC AGGTGATCAA TCAACAGGCT AACGCGATTG CTCAACTCAA CGATGCCCTA CTTGCCGAGC GTTCTAGCGA TATTCGTGAT GTAGGCGATC GGGTGCTGCG CCTTTTGGTG GGAGCCGAAG CTTCGACGCT TGATGCCCAC TGGGCAAAGG CCAATCAGCC GATGATTGTC GTGGCCTATG ATCTGACACC TTCAGAAACC GCTGCTTTTG ATCCAGCCAA AGTGCTGGGT TTTTGCACAG CGGTAGGTGG CCCAAATGCC CACACCGCGA TTTTGGCTCG TGCCTTGGGT TTACCAGCGG TGATTAGTGC TGGCCCAAGC GTCCTTGAAC TTGCCACCGG AACCGAAGTA ATTCTCGATG GTACGGCTGG CACCTTGTTG ATCTGCCCAG CGCCTGAAGC CATTGTGGCG GCCAAAACTG CCCAGCAGCG CGAGCGTGAG CATCAAGCAT GGGCTATGCG CAGCGCTAAC GAGCCAGCCA CAACCGTTGA TGGTCAGCAT ATCGAAGTGG TTGCTAACAT TGGTGGCCTG AGCGAAGCTC AGCAAGCCAC AACTTTAGGT GCTGATGGGG TGGGTTTGTT GCGCACCGAA TTTTTGTTCT TGGAGCGCAC CCAAGCCCCA ACCGAAGATG AGCAGTTTGC AACCTACCGC GAAATTGCCC AAGCCATGGG CGATGCCCCA GTGATTGTGC GCACCTTGGA TATTGGCGGC GACAAGCCAC TGCCCTATCT GGCCTTGCCT GCTGAAGAAA ACCCATTTTT GGGCGAACGT GGCATTCGCT TGTGCCTAGC ACACCCTGAA TTGCTACAAA CCCAATTACG GGCAATTTTA CGTGCTGCCA GCTTTGGCCG CTTGCGAATT ATGTTCCCAA TGATTGCCGA TGCTGGTGAG TTACGCGCTG CCAAAGCCGA AATCGAGCGG ATTCGCAACG AATTGCAAGT AGCTCCAATC GAAATCGGGA TTATGATCGA AGTGCCTTCA GCCGCTTTGA TGACCGATAT TTTGGCGGCT GAAGTTGATT TCTTCTCAAT TGGCACCAAC GACCTGACCC AATACACCTT AGCTATGGAT CGGACGCACC CGACGCTGGC GGCTCAAGCC GATGGTTTGC ATCCAGCGGT GTTGCGCCTG ATTGCTCGAA CGGTTGAGGC GGCCCATGCT GCTGGCAAAT GGGTCGGAGT TTGCGGCGAG TTGGGCGCTG ATCCGCAAGC TGTGCCAATT TTGGTGGGCC TTGGGGTTGA TGAACTCAGC GTCAGCGTGC CCGCCATTCC AACTGTTAAA GCCCAAATCC GGGCATTGAA CTTTGCTCAG TGCCAAACAT CTGCTCGGCG GGCGTTGGTC TGTGCCACTG CCGCCGAAGT ACGCCAAGGA GCGTTCGACT AA
|
Protein sequence | MQELDLVIHN SAGLHARPAR VLVDLAKQFK STISIRAGGK RVNAKSMIAL LTLGVVCGQA IQIEINGEDE AAAAEAITTA VHEGLGEGHG TPAANSVNDA LVAARNGHAN LNGHAKLETT VAVAEPPPAP APTRAEPLKA GAIIQGIAGA PGIAVGTILR YERARIEISH RFSGVDNELQ RLQAALTTAQ QQLVALREQV LLRADASEAA IFDVHRDILA DPALLEAVHG SIGAGQGAEV AWQQVINQQA NAIAQLNDAL LAERSSDIRD VGDRVLRLLV GAEASTLDAH WAKANQPMIV VAYDLTPSET AAFDPAKVLG FCTAVGGPNA HTAILARALG LPAVISAGPS VLELATGTEV ILDGTAGTLL ICPAPEAIVA AKTAQQRERE HQAWAMRSAN EPATTVDGQH IEVVANIGGL SEAQQATTLG ADGVGLLRTE FLFLERTQAP TEDEQFATYR EIAQAMGDAP VIVRTLDIGG DKPLPYLALP AEENPFLGER GIRLCLAHPE LLQTQLRAIL RAASFGRLRI MFPMIADAGE LRAAKAEIER IRNELQVAPI EIGIMIEVPS AALMTDILAA EVDFFSIGTN DLTQYTLAMD RTHPTLAAQA DGLHPAVLRL IARTVEAAHA AGKWVGVCGE LGADPQAVPI LVGLGVDELS VSVPAIPTVK AQIRALNFAQ CQTSARRALV CATAAEVRQG AFD
|
| |