Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_0928 |
Symbol | |
ID | 4710251 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | - |
Start bp | 1002505 |
End bp | 1006188 |
Gene Length | 3684 bp |
Protein Length | 1227 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 639855397 |
Product | hypothetical protein |
Protein accession | YP_001002506 |
Protein GI | 121997719 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.261917 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGACCCGGC CCACGGTAAC CGCCAATGGC CGCACCCTGG TGCACGCTGG CTCCGGGGGC GAGTTGCTGA CCACCGACGT CTGCCGCACC ACCGTCGGCA GCTCGACGGT GTCGATCCCG TACCTCAATG TGGCCGAGTC GCGGGACGCC GCGGAGACCG CCCGAACTGT CTTCGTCGAG GGGCACCCGG TCTGCACCGA GGCCTCGCAG TTCGCCCGCT CCCGGGGCGA CGAGGGGGGT GATCAGGGCG GGACCGTTTC CGGCACCATC ACCGGTCCGG CGACCTTCCT GCCCGGCAGC GGCTCGCCGG ACGTCTTCGT CGAGGGCGTG CCGGTGGCTC GAGCCCTCGA CGCCATGGTG GCCAACCACG AGAACACCCC GCCGGCCCCG CTGCTCCAGG AGCAGCTCCC GCCCACCACC CTGACCGGCG GCGATCCGCC GCCGGACGCC GAGCCACGAC CGCGGACGCG GATCGACATC GGTGGCGGGC CGCAGGCCCC GCTCCCCGGC GAGGTGGTGG TCGAGCCGGC GTCGGGCGGC TGGCACCGCA GCCGGGCCCT ACGCTGGAGC CACCACGACG GCGAGCGGTG GGGAGCCGCC ATCGAAGAGC TCCCCGAAGG GACCGAAGCG CTGCAACTGA GCCTGATCGA CGCCGACCGG CGTCACCACC TCTGCAAGGT CCCGCTCACC CGCGGGGCCG AACAGCCGGC GTGGCTGCCG GGGGAGGAGC CCGAGGTCTC CCCGCCGACG GTCCTGATCC CGACGCTGGT GCGCTTCGCC CGGGACGCCG AGCTCCGCCA CCGCGACGCC GGCTTCGGCT TCGGGGGGTG GCTCTACCTG TTCCGTCGGG AGGGGGATCG GCGCTACCTG TGGCGTGAGT GCTATGTCCA CCCGCGTGGC CGCCTCCACG AGGTCAACCT GGCCTACGAG CACGGCGACC TGCGCCGGCC CACCGGTGAT GGGCGGCCGA TGCTGCTCCT GCCGCTGCGC ATCGGCGGCG AGCCGGTGGT CCTGGAGGGG CTGTTTTCTG CCGTCCAGCT CCCCTGGGCG CGCATCGCCG CCGCCGGCGG GCTGGCCCCC GACGATCCCC GGGCCGGGGT ACACGGTGAA CCGGACGGCG CCGCCGAGCC ACCCCCCGAG GCGCGCTTCT TCCCCATTGA GCTAACCGAA GACCCCGACA CCGGGCCCGG GGCGGCCGGC CGCCTCGGGG GGAGTGGCGC CTTCCGCGAC GCGGTGGCGG AAGAAGGCGA ATCCGAGCCG GTCCGCGAGG ATCTCGCCCC CTACGCCGAG GACGGTCTGG CATTGCTCTA CCTCGACGAC CCCCTGGGCG AGGCGCGCGG GTTGGCCCAT GAGCAGGCGG CGCTGGCGGA TCGACTCGAG ACGCTGGTCG CCGGGCTGCA GACCGGCACC AATCCGGAGC GAAGCACCGT CGGCGAAGGG GAGGGCGATC CGCAGCAGCC CGACGAGCAA TGGCTGGCGG CGGCCGAATC CCTCCACGCC GTGGCGGCGG GCACCTTTCG TCTGGCCTTC GTCGAGGAGC GCCTGGGGCC CGGCGAGCAA CCGCCCGACC GCGAGCTGCT CGAGCGCCTG CTCGGTGTCG CCGAACGCAG CGAGATCCGC GCCGAGATTC GCCGGGTGCG CCGCAAGCGC TTGGCGCTGC TCAGTAGCCA ACGTTACGCC GATGCCTTGG CCGACTACGC CGGCAACACC CCGCGCCGGG TCTTCGAGGG ACTCGCCGCC CTGGGCGCGC ACTTCCGGGA GCTGGCCGTG CCGGCGGACA CCGCCGACGG TTACATCGGC GACATCCCCG CGGCCGAGGG CGAGGCGGAG GCCGCAGAAC TGGATCAGGC CCGCGCGCTG CTGCGCACCG TCCTGGAGGC CGAGGGTGAT CCCTCCGGGC TGGCCCGGGT CGCCGCCCGG TTGTTGGCGG CCGAGCCGCT GACCCTGGGC GCCGGGGCGC GGGGCGGCGG CTTGCCCCGG CCGCGCCGGG AGCTGGAGCT GCAGCTGGCG GCGGCGCCGC GGCGTCTCCA GGCAGTGGAG CGTTGCTGCG ACCTGGCCGA GCAGCGCCGT GCGGTGACCG CCGAGGACGC GGCGGAGGTC CTGTACGTGT TGCTGTCGAC CTTCGCCGAG TCGGTGAGCT ATCAGGGCGA CGCCCTCGAG CACGTCATGC GCATCACCTG CCGCTTCCAG GAGCTGCCCG GTCTGGCCCT GCTGGCGGGG TTGGAGGTGA CCACGGTCAG CTACCCGGCG CCGGACTTCG ACGGCGTCCC GCTGGCCGAA ATCCAGCGCC GCGCCGGAGC GCCTACGGGA GGCGAGGCGC GCACCCTGCG GATCGAGCAC CAGGGCTACG AGCGGCGCTT CCGCATCCTC GCCGCCGGCA GCGACGACTA CCACGCCGCC GTGGCCGATG CCGGCCGCGG CCCGGTGCAG GTCCTCGGCA GCCCCCTCAC CGCCGAGCTG CGCATCGAGG ACGAGATCCG TGTCCGCGAA CTGCGTTCCA CCCGCCTGCC GGAGGGTCGC CTGGCCGACC TGGCCGAGCG CCTCTCCACC AGCTCCGGCG GCCACGCACT GCGCAGCGTC TTCGGCGCCT TGGAGCTCAT CAACCTCAAC CAGGCGGTAC GCCAGACCCG TACCGAGGCC TGGTACCACA TCGCCAGCGC CGTCGCCTCC TCCCTGACCC TGGCGGAGCT GCTCGCCCAG ACCCGGCTCG CCTATCTGAA GCCACGGATT CCGGTGGGGG ATCCGAGGTA TGGGCGTATC GAGCGGTTGG GATTACGGGC GCAAGTGCTT GGTGGTGCAG CCGCTTTCGC AAGCGCGATC TATGCCGCGG GCCGCGGCTA CAGTCGATCC TCACGTGGTG ATGAGGTAGC CAGCCGGACG TGGTTCGTGG CGGGGTTAGC TTACCTGGGT GCGGGTGGCT TCTGGCTTGC GGGTAGCGGA GGTGTTGGTC TGTCGTTTCT CTTGGTAGCC GTCGGTGCGT CGGCTATAGC GGGCTATGTT TCAGATACTT CGTTTGCCGA TTTCTGTCGC AATGGCCCCT TTACCCCAAC AGCCCGGGAG CGGCTGGGTG GCTCGGACCG GGATTCGGGG TGGGCCTGGG TGGCCCGGGC GGTGGAGGTG GGGCCACCGG TGCAACGGCC CGCCGAGACC GGGAGCTGGG ACGACTGGCC GGCGGTGGCC GCCTGGTGGC AGCGCGTCCT GCACCGCCCG CCGCTGCGGG TGGCCGCCGA GTACAGCCAG CAGCTGGGCG GCCGGGGCGC CCTGCGCCAG GTCCGCCTGA CCCTGGCGGG GGTGGCGTGG CGGCGGGACC GCCTGGAGTG TCGGGTGCTG CTCTGCCCGG AGGACGGTGC CTGCCAGGAG GTGACCCCGC TGCCTCCAGC GTTCATCAAG GCGGCGCCGG GGGAGGGCTC CGACGCCCTC ACGGTAACCC TCGCCTGGGA CGCCCTGCCG ACGGCCCCGG GGTGGCGCGG CGAGCTCTGC TTCCTCATCC GGCAGCGGGC GGTAGCCGGC GCCGGTGACG ACGCCCTGCC GGAGCCCGAG GCCGACGGCA CGCCCCGCTA CTGGGTGGCC CGGGTGCCCG TGCCCGCCTG GCAGGGTCGG GCCAAGGCGG CGGTGGCGCC CCGTGCGCTG ACCGTGGACG AGGTGCTCAG GGAAGAGGCG GCCGGGAGCC GGTCGCAGGG GTGA
|
Protein sequence | MTRPTVTANG RTLVHAGSGG ELLTTDVCRT TVGSSTVSIP YLNVAESRDA AETARTVFVE GHPVCTEASQ FARSRGDEGG DQGGTVSGTI TGPATFLPGS GSPDVFVEGV PVARALDAMV ANHENTPPAP LLQEQLPPTT LTGGDPPPDA EPRPRTRIDI GGGPQAPLPG EVVVEPASGG WHRSRALRWS HHDGERWGAA IEELPEGTEA LQLSLIDADR RHHLCKVPLT RGAEQPAWLP GEEPEVSPPT VLIPTLVRFA RDAELRHRDA GFGFGGWLYL FRREGDRRYL WRECYVHPRG RLHEVNLAYE HGDLRRPTGD GRPMLLLPLR IGGEPVVLEG LFSAVQLPWA RIAAAGGLAP DDPRAGVHGE PDGAAEPPPE ARFFPIELTE DPDTGPGAAG RLGGSGAFRD AVAEEGESEP VREDLAPYAE DGLALLYLDD PLGEARGLAH EQAALADRLE TLVAGLQTGT NPERSTVGEG EGDPQQPDEQ WLAAAESLHA VAAGTFRLAF VEERLGPGEQ PPDRELLERL LGVAERSEIR AEIRRVRRKR LALLSSQRYA DALADYAGNT PRRVFEGLAA LGAHFRELAV PADTADGYIG DIPAAEGEAE AAELDQARAL LRTVLEAEGD PSGLARVAAR LLAAEPLTLG AGARGGGLPR PRRELELQLA AAPRRLQAVE RCCDLAEQRR AVTAEDAAEV LYVLLSTFAE SVSYQGDALE HVMRITCRFQ ELPGLALLAG LEVTTVSYPA PDFDGVPLAE IQRRAGAPTG GEARTLRIEH QGYERRFRIL AAGSDDYHAA VADAGRGPVQ VLGSPLTAEL RIEDEIRVRE LRSTRLPEGR LADLAERLST SSGGHALRSV FGALELINLN QAVRQTRTEA WYHIASAVAS SLTLAELLAQ TRLAYLKPRI PVGDPRYGRI ERLGLRAQVL GGAAAFASAI YAAGRGYSRS SRGDEVASRT WFVAGLAYLG AGGFWLAGSG GVGLSFLLVA VGASAIAGYV SDTSFADFCR NGPFTPTARE RLGGSDRDSG WAWVARAVEV GPPVQRPAET GSWDDWPAVA AWWQRVLHRP PLRVAAEYSQ QLGGRGALRQ VRLTLAGVAW RRDRLECRVL LCPEDGACQE VTPLPPAFIK AAPGEGSDAL TVTLAWDALP TAPGWRGELC FLIRQRAVAG AGDDALPEPE ADGTPRYWVA RVPVPAWQGR AKAAVAPRAL TVDEVLREEA AGSRSQG
|
| |