Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0771 |
Symbol | |
ID | 4710665 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 855103 |
End bp | 858171 |
Gene Length | 3069 bp |
Protein Length | 1022 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 639855232 |
Product | hypothetical protein |
Protein accession | YP_001002351 |
Protein GI | 121997564 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0574] Phosphoenolpyruvate synthase/pyruvate phosphate dikinase [COG1213] Predicted sugar nucleotidyltransferases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTCCAC AGCTAACTGA CTATAGTACG CTTATAATCC TCGGCGCCGG CGCCCCCCAC CGTGGGGACC TTCCAGCCGC TCTGCGCGAG CCGCGCTCCG GCACTTCGGT CCTGCAGTGG CTGCTGGACG CGTCCGGCTG CTCGCTCGAG TGCACTACCT TCGTCGCTGG GTACCAAGCA GACGCAATTC GAGAGCGTTA TCCCGATTTG GCGGTGGTGG AAAACCGGGA TTGGGAGCAC ACGGGGAGTG GCGCTTCACT CCTGGCGGCG CCTTTTGGTA CGAACTCCCC CCTGCTGGTC TGCTACAGCG ACATCCTCTT CCGTCAGACA GTGCCCACGG CGCTGGCGCG CTGCGAGGCC GATGTGGCAG TGGCCTGGGA CAGCGCCTGG GACCACCGCT ACGCCGGGCG CGAGGCTGCG GACCTGGCCC GCTGCGAGAA GGTGATGGTC AACGGCGAGC GCATCGAGCG CCTCGGCGCC GACCTGCCGG TGGACTGGGC CGACGGCGAG TTCATTGGCC TGGTGCGCTT CTCCCCGCGC GCCCTGGAGT GGCTGGAGCG CCTGCGCGCG GACGGCCCCG AGAGCCTGCG CCAGCGCCAC CTCTCCGAAT ACATCGAATA CCTGCGTGCG GCCGGCCTTT CGGTCGCCGG CGTCGATGTC GCTGGAGACT GGGCCGAGTT CAACGAACCA CGGGACATCG CCCACTTCAT TCTCGGTACC AAGGCCGAGA CATTGGGCCG GCTGCGCGGC ATGGTCCGCC ACGCGGTCAT TCAGGACCAA GTCGTATGCA CCGTCGCCGA GTGGCAAGCC GATCGCGCGG CGGTGATGGA CCAGGTGCGC CAGCGCTTCA CCGGAGAGCG GCTGGTTGTC CGCTCCAGCG CCCGTAGCGA GGACTCCTTC CACCATTCGA ACGCTGGCGG CTACGACAGC CTGCTCAACG TCGATCCTCG GAATGGCCTT GAAGAGGCCG TCGAGCAAGT CATTGCTTCC TATGGGACGG CGGATGACGA CGACCAGGTG CTCATCCAGC CCATGGTCAC CGACGTGGCC ATGAGCGGCG TCGCCTTTAC CCGTACCCTG GAGCACGGGG CACCGTGGTA TGTGGTCAAC TACGAGACCG CCGGCGACAC CGAAGCCATC ACCAGCGGGG CTAGCGGTGA CCACCGCACG CTGCTGCTGC GCCGTGGCGC GGAACCCGAA ACGCTGCCGG AGCCACGGCT GGGGCCGCTG GTGGCCGCAC TGCACGAGAT CGAGGCCCTC CTCGGTTACG ACGCCCTGGA CGTGGAATTC GCGCTCGACC CCGCCGGCTC GGTGCACATC CTCCAGGTCC GACCCATCGC GGTAGATCGC AAGGGTAGCG ACCTGGACGA TAGTGCCTTC GACGCCGCCA TGGCTGCCGC CCACGCCCGA TGGGAGACGC TAACCCCGGC CCCACCCCAC CTGCCCGGGG ATCCGGCGCC GCTCTATGGC GTGATGCCCG ACTGGAATCC GGCGGAGATC ATCGGCACCG CTCCGGGCGC CCTGGCCGCC AGTCTCTACC GTCACCTGAT CATGGACGAG ACCTGGGCCA TCCAGCGCGC CGAGTACGGT TACCGCGACG TGCGCCCGGC TCCACTGCTT GTGGAGTTCG CTGGCCACCC GTATGTGGAT GTGCGTGCCA GCTTTGCCTC GTTCCTGCCG GCGGAACTCC CCGACGATTT GGCCGGCCGG CTGCTGAGCT TCTACCTGGA GTGGCTGCGC CAGCGCCCGG AGCTCCACGA CAAGGTGGAG TTCGAGGTGG TGCCCACCTG CCTGGCCCCC GGCTTCGAGG GCTGGGAGCG GCGCCTGCAG GAGGACGGCG GCTTCAGTAC CGACGAGGTC CGCGCCCTGC GCGAGGGGCT ACGCCGGATC ACCGCCGGCG CCTTCCACCG CTGCGAGGAC GATCTGGCGC AGATCGAGAC CCTGCGCCAA CGCTTCCAGG CGATCGAGGG CGATACCCGG CTGGCACCAC TGGAGCGCGC ACGGATCCTA CTGGACGACT GCCGGCGGCT TGGGACACTG CCGTTCTCCC ATCTGGCTCG CAGCGGTTTC GTCGCCGTAA CACTGCTGCG CGAAGCCGAG GCGTGCGGGA TGATCAGCGC AACCGCCCGG GAGAGCTTTC TCTCCACGGT CCGAACGGTG AGCCACCGGC TCACGGCGGA CGCACGGGCC ACCGCCACCG GGGAGATGAG CGAGCACGCC TTTATCGCTC GTTACGGCCA CCTGCGCCCG GGCACCTATG ACATCACCTC GCCCCGTTAC GACGCCGATC CAGAGCGCTT CCTCCGGCCG CTGGTGGAGC ACGCCCGGGA GGCTGCCATG GAGGAAGAAA ACCCTGGGCC CTGGCAAGCC GAGCGCGCAG CCTTCTTCTC GGCACTGGCC GAGCTCGGCC TGCCGGCCGA CCCGGAGCGG GTCGAGACCT TCCTGCGTCA GGCCATCGAG GGGCGGGAAT ACGCCAAGTT CATCTTCAGC CGCAACCTGT CCGCGGCGCT GGAGGCACTG GCTGAGGCGG GAGCACAGTA CGGTCTGGAG CGCGCCCAGG TCGCGCACCT GCCGCTGGAC GAGCTACTCG CCCTGCGCTC CGCGGCGCGT TCTGATGAGG CGATCGCCCG TCATCTGAGG ATGCGCGCCG ATGAGGAGGC CGAGGCCCGG CGGGTGGCAG GGGCCTGCGA GCTACCGCCG CTGATCACCG GCCAGGCGGA CCTGGATGCC TTCGTCATTG GTGCCGACCG ACCCAACTTC ATCGGCTCCG GCTGCATCAC GGCGGACTGC CTGGACCTCG GCGATCAACC GGCCGATGCG GACCTAGACG TGTCCGGGCG GATCGTCCTC ATCCCTCAGG CCGATCCCGG CTACGATTGG CTCTTCGGCC AGGGGATCAC CGGGCTGGTG ACCCTCTACG GCGGCGCCAA CTCGCACATG GCCATCCGCG CTGCGGAATT CGGTCTACCG GCCGCCATCG GCATTGGCGA GCAACGCTAC CGTGAATTGG CCCAAGCGCG AGTTGTCGAG CTTGCCCCGG CCAACGGCAT CCTGCGGGTG GTCCGATGA
|
Protein sequence | MVPQLTDYST LIILGAGAPH RGDLPAALRE PRSGTSVLQW LLDASGCSLE CTTFVAGYQA DAIRERYPDL AVVENRDWEH TGSGASLLAA PFGTNSPLLV CYSDILFRQT VPTALARCEA DVAVAWDSAW DHRYAGREAA DLARCEKVMV NGERIERLGA DLPVDWADGE FIGLVRFSPR ALEWLERLRA DGPESLRQRH LSEYIEYLRA AGLSVAGVDV AGDWAEFNEP RDIAHFILGT KAETLGRLRG MVRHAVIQDQ VVCTVAEWQA DRAAVMDQVR QRFTGERLVV RSSARSEDSF HHSNAGGYDS LLNVDPRNGL EEAVEQVIAS YGTADDDDQV LIQPMVTDVA MSGVAFTRTL EHGAPWYVVN YETAGDTEAI TSGASGDHRT LLLRRGAEPE TLPEPRLGPL VAALHEIEAL LGYDALDVEF ALDPAGSVHI LQVRPIAVDR KGSDLDDSAF DAAMAAAHAR WETLTPAPPH LPGDPAPLYG VMPDWNPAEI IGTAPGALAA SLYRHLIMDE TWAIQRAEYG YRDVRPAPLL VEFAGHPYVD VRASFASFLP AELPDDLAGR LLSFYLEWLR QRPELHDKVE FEVVPTCLAP GFEGWERRLQ EDGGFSTDEV RALREGLRRI TAGAFHRCED DLAQIETLRQ RFQAIEGDTR LAPLERARIL LDDCRRLGTL PFSHLARSGF VAVTLLREAE ACGMISATAR ESFLSTVRTV SHRLTADARA TATGEMSEHA FIARYGHLRP GTYDITSPRY DADPERFLRP LVEHAREAAM EEENPGPWQA ERAAFFSALA ELGLPADPER VETFLRQAIE GREYAKFIFS RNLSAALEAL AEAGAQYGLE RAQVAHLPLD ELLALRSAAR SDEAIARHLR MRADEEAEAR RVAGACELPP LITGQADLDA FVIGADRPNF IGSGCITADC LDLGDQPADA DLDVSGRIVL IPQADPGYDW LFGQGITGLV TLYGGANSHM AIRAAEFGLP AAIGIGEQRY RELAQARVVE LAPANGILRV VR
|
| |