Gene Haur_1528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1528 
Symbol 
ID5733415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1780113 
End bp1782224 
Gene Length2112 bp 
Protein Length703 aa 
Translation table11 
GC content56% 
IMG OID641278668 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001544300 
Protein GI159898053 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins 
TIGRFAM ID[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.611034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGAGC TTGATTTGGT CATTCATAAC TCGGCAGGCC TGCATGCGCG GCCTGCCCGA 
GTCTTAGTTG ACCTTGCAAA ACAATTCAAA TCGACGATTT CGATTCGTGC TGGCGGCAAG
CGGGTCAATG CCAAAAGCAT GATCGCGCTG TTGACCCTCG GCGTAGTCTG TGGTCAGGCG
ATCCAAATCG AGATTAACGG CGAAGATGAA GCCGCCGCTG CCGAGGCGAT TACAACTGCG
GTGCACGAAG GTTTGGGCGA AGGTCATGGC ACACCAGCCG CCAATAGCGT TAATGATGCT
TTGGTTGCGG CTCGCAATGG CCATGCCAAT CTCAATGGCC ATGCCAAGCT TGAAACCACC
GTTGCTGTGG CCGAACCGCC GCCTGCGCCA GCCCCAACTC GCGCTGAGCC ACTCAAAGCT
GGCGCGATCA TCCAAGGGAT TGCTGGCGCA CCTGGCATTG CGGTTGGCAC AATTTTGCGC
TACGAACGTG CGCGAATCGA AATTAGCCAC CGATTTAGTG GGGTGGATAA CGAACTCCAG
CGTTTGCAAG CGGCCTTGAC CACCGCCCAA CAGCAATTGG TGGCGCTGCG CGAACAGGTA
TTGTTACGGG CTGATGCCAG CGAAGCCGCC ATTTTTGATG TTCACCGCGA TATTTTGGCC
GATCCTGCCT TGCTTGAGGC TGTGCATGGC TCAATTGGTG CTGGGCAAGG GGCCGAAGTC
GCTTGGCAAC AGGTGATCAA TCAACAGGCT AACGCGATTG CTCAACTCAA CGATGCCCTA
CTTGCCGAGC GTTCTAGCGA TATTCGTGAT GTAGGCGATC GGGTGCTGCG CCTTTTGGTG
GGAGCCGAAG CTTCGACGCT TGATGCCCAC TGGGCAAAGG CCAATCAGCC GATGATTGTC
GTGGCCTATG ATCTGACACC TTCAGAAACC GCTGCTTTTG ATCCAGCCAA AGTGCTGGGT
TTTTGCACAG CGGTAGGTGG CCCAAATGCC CACACCGCGA TTTTGGCTCG TGCCTTGGGT
TTACCAGCGG TGATTAGTGC TGGCCCAAGC GTCCTTGAAC TTGCCACCGG AACCGAAGTA
ATTCTCGATG GTACGGCTGG CACCTTGTTG ATCTGCCCAG CGCCTGAAGC CATTGTGGCG
GCCAAAACTG CCCAGCAGCG CGAGCGTGAG CATCAAGCAT GGGCTATGCG CAGCGCTAAC
GAGCCAGCCA CAACCGTTGA TGGTCAGCAT ATCGAAGTGG TTGCTAACAT TGGTGGCCTG
AGCGAAGCTC AGCAAGCCAC AACTTTAGGT GCTGATGGGG TGGGTTTGTT GCGCACCGAA
TTTTTGTTCT TGGAGCGCAC CCAAGCCCCA ACCGAAGATG AGCAGTTTGC AACCTACCGC
GAAATTGCCC AAGCCATGGG CGATGCCCCA GTGATTGTGC GCACCTTGGA TATTGGCGGC
GACAAGCCAC TGCCCTATCT GGCCTTGCCT GCTGAAGAAA ACCCATTTTT GGGCGAACGT
GGCATTCGCT TGTGCCTAGC ACACCCTGAA TTGCTACAAA CCCAATTACG GGCAATTTTA
CGTGCTGCCA GCTTTGGCCG CTTGCGAATT ATGTTCCCAA TGATTGCCGA TGCTGGTGAG
TTACGCGCTG CCAAAGCCGA AATCGAGCGG ATTCGCAACG AATTGCAAGT AGCTCCAATC
GAAATCGGGA TTATGATCGA AGTGCCTTCA GCCGCTTTGA TGACCGATAT TTTGGCGGCT
GAAGTTGATT TCTTCTCAAT TGGCACCAAC GACCTGACCC AATACACCTT AGCTATGGAT
CGGACGCACC CGACGCTGGC GGCTCAAGCC GATGGTTTGC ATCCAGCGGT GTTGCGCCTG
ATTGCTCGAA CGGTTGAGGC GGCCCATGCT GCTGGCAAAT GGGTCGGAGT TTGCGGCGAG
TTGGGCGCTG ATCCGCAAGC TGTGCCAATT TTGGTGGGCC TTGGGGTTGA TGAACTCAGC
GTCAGCGTGC CCGCCATTCC AACTGTTAAA GCCCAAATCC GGGCATTGAA CTTTGCTCAG
TGCCAAACAT CTGCTCGGCG GGCGTTGGTC TGTGCCACTG CCGCCGAAGT ACGCCAAGGA
GCGTTCGACT AA
 
Protein sequence
MQELDLVIHN SAGLHARPAR VLVDLAKQFK STISIRAGGK RVNAKSMIAL LTLGVVCGQA 
IQIEINGEDE AAAAEAITTA VHEGLGEGHG TPAANSVNDA LVAARNGHAN LNGHAKLETT
VAVAEPPPAP APTRAEPLKA GAIIQGIAGA PGIAVGTILR YERARIEISH RFSGVDNELQ
RLQAALTTAQ QQLVALREQV LLRADASEAA IFDVHRDILA DPALLEAVHG SIGAGQGAEV
AWQQVINQQA NAIAQLNDAL LAERSSDIRD VGDRVLRLLV GAEASTLDAH WAKANQPMIV
VAYDLTPSET AAFDPAKVLG FCTAVGGPNA HTAILARALG LPAVISAGPS VLELATGTEV
ILDGTAGTLL ICPAPEAIVA AKTAQQRERE HQAWAMRSAN EPATTVDGQH IEVVANIGGL
SEAQQATTLG ADGVGLLRTE FLFLERTQAP TEDEQFATYR EIAQAMGDAP VIVRTLDIGG
DKPLPYLALP AEENPFLGER GIRLCLAHPE LLQTQLRAIL RAASFGRLRI MFPMIADAGE
LRAAKAEIER IRNELQVAPI EIGIMIEVPS AALMTDILAA EVDFFSIGTN DLTQYTLAMD
RTHPTLAAQA DGLHPAVLRL IARTVEAAHA AGKWVGVCGE LGADPQAVPI LVGLGVDELS
VSVPAIPTVK AQIRALNFAQ CQTSARRALV CATAAEVRQG AFD