Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hhal_1902 |
Symbol | |
ID | 4710676 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhodospira halophila SL1 |
Kingdom | Bacteria |
Replicon accession | NC_008789 |
Strand | + |
Start bp | 2093831 |
End bp | 2095954 |
Gene Length | 2124 bp |
Protein Length | 707 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639856375 |
Product | putative PAS/PAC sensor protein |
Protein accession | YP_001003468 |
Protein GI | 121998681 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG2176] DNA polymerase III, alpha subunit (gram-positive type) |
TIGRFAM ID | [TIGR00573] exonuclease, DNA polymerase III, epsilon subunit family [TIGR01406] DNA polymerase III, epsilon subunit, Proteobacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCAGCG CCATGGCACG TCGCCGGTTG AAATTCTGGG CACCGGCGCT GGTCCTGCTG CTGCTGATCG CCGCTGCCCT GGCCACCGTC GGCTACCTGA CCCTGGGACA CGTTCCCGAG GGACCGGAGC GCGCCAATGC GCTGCTCGCC CTGGGAGGCG CCGGCGCGGT GCTCATCGGC GCCACCGTGG CCATCTGGAT CCTCCTGGAC GCCACCGTCA TCCGCCCCCT GGGCGCGCTG GCCCGCGGGG CGTCGATCAT GGCCCACTCG AACCCGGCCC ACGAACTGGA GATCCCGAGC ATGCACCTGC TCGGCGAGCT CCCGGAGAGC CTACAGACGC TGGGCAGCAA CCTCTATGAG ACCCGGCGCG AAGTGGCCAA GGCCCTGGGC TCCGGCGCCC AGGGGGTCGA GAACCAGAAG ACGCGGCTGG AGATCGTTCT GCGCGAGATC CAGCAGGGGG TCATCGTCTG CGACACCGAG GGCCGGGTGC TGCTCTACAA CCCGGCCGCC GGCGAGATCC TGCGCAGTGA TGCACTGGGT CTGGGGCGCT CCATCTACGA CGTGCTGGCC CGCTCGGCGG TGGACCACAC CCTGGAGATG CTCCAACACC GCCTGGCCAT CGCCGACGAC CACACCGTGG CGGAAAACCG CGCCGAATTC GTCTGCGCGA CGGTGGATGA CGGCGCCCTG CTGCACTGCC GGATGAGCCT GCTCCCCTCC TCCAGCCCCC TGCGCTCGGG GTTCGTGCTG ACCATCGAGG ACATCACCCG CAAGATTGAA GGCGTAGCCC GCCGCGACCA CGCCCTGCGC ACCGCCGTGG AGGCCCTGCG CCCGCCGCTG GCGAACCTGC GCGCCGCCGC CGAGAGCCTC GGCCAGGGCG ACGAGGCCAT GGCCCGCGAG CAGCGCCAGT CGTTCGAGAC GCTCATGGTC CACGAAAGCC GGGAGCTGAG CCGACGCTTC GAGCGCATGG CCCGGGAGAC CCACCGACTC GTCTCGGCCC CCTGGACCAT GGCGGACATC AGCAGCGCGG ACCTATTGGC CAGCGTCCTG CGCCGCCACC CGGAGGGATT GCCGAAGGTC GAGGTGGTCG GCATCCCGCT GTGGATGCAC GCCGAGAGCC ACTCCATCGG CCTGGTACTG GAGCACATGC TGCGCCACCT GCGCGGCGAA CTCGGCATCG AGCAGATCCG CGCCGAGCCC CTGATGGGGG ATCAGCGGGT GTATCTGGAT CTCTCCTGGC AGGGCCCACC GATCCCGCCG GATCAGCTCG AGACGTGGCT GGACGAGGAG CTGCCCGAGG CCACCGGGCA ACTGACCCCG CGCGGGGTGC TGGAGCGCCA CGACAGCGTC GCCTGGAGCC AGATACAACC TCGCTGCGAA GGACGCGCCC TGCTGCGCAT CCCGGTGCCG CTATCGCGGC GCCAGTGGGA GCAGCCCGGC GAGCGCCTGC CACCGCGGCC GGAGTTCTAC GACTTCTCGC TGGCCGACCA GGCCGCCGAT CAGGGCGAGC TGCTCGATCG CCCCCTGGCC CAGCTATCAT TCGTCATCTT CGATACCGAG ACCACCGGAC TGGCCCCCTC GGAAGGCGAC GAGATCATCT CCATCGCCGG GGTACGCATG GTCAATGGCC GCATCCTCGA GGGCGAGTGC TTCGAGCAGC TGGTCAACCC CGGGCGGCCG ATCCCCAAAG CGTCGATCAA GTTCCACGGC ATCCGCGACG AGATGGTCGC CGACAAGCCG GGGATCGCCA CGGTGCTGCC GCAGTTCAGC GCCTTCGTCG GCGATTCCGT GCTGGTCGCC CATAACGCCG CCTTCGACAT GAAGTTCATC CGCCTCAAGG AAGGTCAGTG CGGCCTGAAG TTCGAAAACC CGGTGCTCGA CACCCTGCTG CTGTCGGTCT TCCTGCACGA TCACACCCCT GAGCACACCC TGGAGGCCAT CGCCAACCGC CTGGGGGTGG AGATCAGCGG CCGCCACACG GCGCTGGGCG ACACCCTGGT CACCGGCGAG ATCTTCGCGC AGATGCTGCC GCTGCTCGAG GAGCGCGGCG TCACCACCCT GCGCGATGCG ATCAACGCCT CCGAACAGAT GGTCGAGGTC CGCAAGCAGC AGGCCCAGTT CTAA
|
Protein sequence | MASAMARRRL KFWAPALVLL LLIAAALATV GYLTLGHVPE GPERANALLA LGGAGAVLIG ATVAIWILLD ATVIRPLGAL ARGASIMAHS NPAHELEIPS MHLLGELPES LQTLGSNLYE TRREVAKALG SGAQGVENQK TRLEIVLREI QQGVIVCDTE GRVLLYNPAA GEILRSDALG LGRSIYDVLA RSAVDHTLEM LQHRLAIADD HTVAENRAEF VCATVDDGAL LHCRMSLLPS SSPLRSGFVL TIEDITRKIE GVARRDHALR TAVEALRPPL ANLRAAAESL GQGDEAMARE QRQSFETLMV HESRELSRRF ERMARETHRL VSAPWTMADI SSADLLASVL RRHPEGLPKV EVVGIPLWMH AESHSIGLVL EHMLRHLRGE LGIEQIRAEP LMGDQRVYLD LSWQGPPIPP DQLETWLDEE LPEATGQLTP RGVLERHDSV AWSQIQPRCE GRALLRIPVP LSRRQWEQPG ERLPPRPEFY DFSLADQAAD QGELLDRPLA QLSFVIFDTE TTGLAPSEGD EIISIAGVRM VNGRILEGEC FEQLVNPGRP IPKASIKFHG IRDEMVADKP GIATVLPQFS AFVGDSVLVA HNAAFDMKFI RLKEGQCGLK FENPVLDTLL LSVFLHDHTP EHTLEAIANR LGVEISGRHT ALGDTLVTGE IFAQMLPLLE ERGVTTLRDA INASEQMVEV RKQQAQF
|
| |