Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_2971 |
Symbol | |
ID | 5734843 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 3747990 |
End bp | 3751175 |
Gene Length | 3186 bp |
Protein Length | 1061 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641280115 |
Product | isoleucyl-tRNA synthetase |
Protein accession | YP_001545737 |
Protein GI | 159899490 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0060] Isoleucyl-tRNA synthetase |
TIGRFAM ID | [TIGR00392] isoleucyl-tRNA synthetase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.0600156 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGTTTG CAGCAGTTGA TCCCAAGGTG TCGTTTCCGA CGCTTGAGGA TGAGATCGCC GCTTGGTGGG AAGCCCATGG CATCGTCAAA AAGACGCTTG ACCATGGTGA TGCTTCGCGT CCGTTTGTCT TTTTTGAAGG GCCACCCACG GCCAATGGCC GCCCGGGGAT TCACCACGTC GAGGCTCGTT CGTCTAAAGA TATTATGGTG CGCTTCAACC GCATGCTTGG TAAAAAGGTC ATCGGTGCTC GCGGTGGCTG GGATACCCAT GGCTTGCCAG TCGAACTGGA AGTTGAAAAG AAATTAGGTT TCGCAGGCAA ACCCGACATC GAAAAATATG GCATCACCGA ATTTAATGCT GCCTGCCGTC AATCAGTGTG GGATTACATC CAAGAATGGG AAAAGCTGAC CCAGCGGATC GCCTTCTGGA TCGATCTCGA AGATCCCTAT ATCACCTATG ACAACAAATA TATCGAGTCG TTGTGGTGGA TCTTCAAGCA ATTGCACGAG CGCGAATTAC TGTATCGTGA TTATAAAGTG ACGATGCACT GCCCCCGCTG TGGCACATCG CTCTCGGATC ACGAGGTCGC CCAAGGCTAC CAAGATAATA CCGACGATCC ATCGGTGTGG GTGCGCTTCC GCCATACGCC GAGCGACCAT GCCTTAGATG CTCAAGTGGC CGATGCTGCT TTCTTGGCCT GGACAACTAC ACCTTGGACT CTGCCAGCCA ACGCTGGTTT GGCGGTCAAT CCTGAGGCAA CCTATGTGCT AGCCGAGCAC GAAGGCCAGC GCTATATCTT GGCTGAAGCT TTGGTTGGGG CGGTGTTGGG TGAAACTGCC GCCACGCTTG CTAGCTTTGT TGGTGCTGAT CTGCGTGGCT TGCGCTATAC GCCGCTGTTT CCTGGGGTTG GCGATAATGG CGCAGCGATC GATTTGAGCA GTGCTTATCG CGTCGTTGCC GATGAGTTCG TCTCGTTGGA AGACGGTACG GGGATCGTGC ACATCGCGCC AGCCTATGGC GACTTGGAAA TTGGTCGCAA GTATGGCTTG CCAACCTTGT TCTCGGTCGA GTTGGCGGGC AAAGTGCTGG GCAGCTTTGA AAGTTTTGGC TTTGCAGGCA TGTTCTTCAA AGAAGCTGAT CCAAAAATCA CGCGCTATTT GAAAGAACAA GGCTTGTTGT TCAAATCAGG GCGAGTGCTG CACACCTATC CATTCTGTTG GCGCTGTAAA ACACCGTTGC TGTTCTATGC CAAGCAATCG TGGTATATCC GCACGACGGC CTTGAAGCAA CAATTGATCG CCAACAACAA AAAGATCAAT TGGGTGCCTG AGCATATCCA AGCTGGGCGT TTTGGCAACT GGCTCGAAAA TAACATCGAC TGGGCGATCA GCCGTGAGCG CTATTGGGGC ACGCCGCTAC CAGTTTGGAC ATGCGATACC TGTCAGCATA TCGATGTGGT CGGTTCGTTG GCGGAGCTTG GCGAACGTTG GGGTCAAGAT ACTGCCAATC TGGATATGCA CCGTCCATTT GTTGATGCAC CTAGCTGGTC GTGCCCTGAG TGCGAAGCTG GCACGATGCA GCGCATTCCC GATGTGGCTG ATTGTTGGTT TGATTCAGGT GCGATGCCAG TGGCGCAATG GCATTATCCG TTTGAAAATC AAGAACTGTT TGAAGTTGCT GGCCAAGCCG ATTTCATCTC GGAAGCAATC GACCAAACTC GTGGCTGGTT CTACACCTTG CACGCAGTTT CAACCTTGCT CTTCGATCGC CCAGCCTACA AGAATGTGAT CTGTTTGGGT CACTTGTTGG ATGGCAAAGG CGAGAAGATG TCCAAATCCA AGGGCAACAT CGTTTCGCCA TGGGAAATGG TCGAGAAGTA TGGCGCTGAT GCGGTGCGTT GGTATATGTT TGCTGCTGGT CAGCCTTACA ATCCACGCCG TTTCTCAGCC GATTTGGTCA GCGAATCGTT GCGCCAGTTC TTGTTGACCT TGTGGAATAC CTATAGCTTT TTCACAACTT ACGCCAATGT CGATGGTTGG ACACCTGAAT TGGCGCAAGG CGATTTGGCG GCGATCGATC GCTGGGCCTT GGCACGGCTC AACGCATTGG TGCGCGATGT GCGCAACGAT CTGAGCAATT ACGATATGAA CACACCAGCC AAGCGGCTCG AACAATTCGT TGACGAGCTT TCAAACTGGT ATGTGCGACG TAATCGGCGG CGTTTCTGGG GCAGCGACAT GAACGGCGAT AAGCAAGCTG CCTATTCAAC CTTGTACACC TGTTTGGTCA CGATTTCCAA ACTGATGGCT CCATTCACGC CATTTGTGGC CGAATCGCTG TATCAAAATT TAGTGCGCAG CTACGACCAA ACTGCCGCCG AAAGCGTGCA TATGGCGCTG TATCCTGAAG CCAATTTGGC GCTGATCGAT GAAGAATTGA TTCGCAAAAC CGACTTATTG CTCAAGGCTG TGAGCTTGGG CCGCGCCGCC CGCAAGAACG CTGGCATACG CGTGCGTCAG CCACTGAGCG AAGTGTTGGT GCGCTTGCCA CGCGGCGAAC AACTTGATGA ACTAAGCGCT GAACTGAGTG ACGAACTCAA TATCAAGTCG GTGCGCTGGC TTGGGGTGGG CGATGGCTTG GTCAGTTATC GCTTCAAGCC CAATTTGCGC TCGGTTGGCA AAAAGTTTGG CAAACTGGTT CCAGCATTGC GCGAAGTGTT GGCGAATCTT AGCAGCGAGC AAGCCGCCGA TGCAGCGCAT AAAGTTGAAA CCGGGGCCAG CTTCGAGGTT GTAGTCGAAG GCGAAACATT AACCTTGGCC GCCGACGATG TATTGATGGA AGCTTCATCG CCCGAAGGCT ACGCTGTGGC CGAAGGCGAG GGCTTGTTAG TAGCCTTAGT TACGACGCTG ACTGATGAAT TGCTGCGCGA AGGCATCGCC CGCGAAATCG TGCGTAACCT CAATGATGCG CGTAAGGCCG CTGATCTAGC GATCACCGAT CGTATCAATG CAACCTTGGG AACCGAGGTT GATTTGGCGG CAGTCGTAGC TGAATATGCC GAGTACATCA AGGCCGAAAC CTTGTGCGAG GTGTTGAGCG TTGGCGATGC CAATGCTGAC CATCACACCA GCAGCATGGA ATTGGAACAA GGCAAACTAA GCCTCGGCAT TAGCAAAATC GGCTAA
|
Protein sequence | MAFAAVDPKV SFPTLEDEIA AWWEAHGIVK KTLDHGDASR PFVFFEGPPT ANGRPGIHHV EARSSKDIMV RFNRMLGKKV IGARGGWDTH GLPVELEVEK KLGFAGKPDI EKYGITEFNA ACRQSVWDYI QEWEKLTQRI AFWIDLEDPY ITYDNKYIES LWWIFKQLHE RELLYRDYKV TMHCPRCGTS LSDHEVAQGY QDNTDDPSVW VRFRHTPSDH ALDAQVADAA FLAWTTTPWT LPANAGLAVN PEATYVLAEH EGQRYILAEA LVGAVLGETA ATLASFVGAD LRGLRYTPLF PGVGDNGAAI DLSSAYRVVA DEFVSLEDGT GIVHIAPAYG DLEIGRKYGL PTLFSVELAG KVLGSFESFG FAGMFFKEAD PKITRYLKEQ GLLFKSGRVL HTYPFCWRCK TPLLFYAKQS WYIRTTALKQ QLIANNKKIN WVPEHIQAGR FGNWLENNID WAISRERYWG TPLPVWTCDT CQHIDVVGSL AELGERWGQD TANLDMHRPF VDAPSWSCPE CEAGTMQRIP DVADCWFDSG AMPVAQWHYP FENQELFEVA GQADFISEAI DQTRGWFYTL HAVSTLLFDR PAYKNVICLG HLLDGKGEKM SKSKGNIVSP WEMVEKYGAD AVRWYMFAAG QPYNPRRFSA DLVSESLRQF LLTLWNTYSF FTTYANVDGW TPELAQGDLA AIDRWALARL NALVRDVRND LSNYDMNTPA KRLEQFVDEL SNWYVRRNRR RFWGSDMNGD KQAAYSTLYT CLVTISKLMA PFTPFVAESL YQNLVRSYDQ TAAESVHMAL YPEANLALID EELIRKTDLL LKAVSLGRAA RKNAGIRVRQ PLSEVLVRLP RGEQLDELSA ELSDELNIKS VRWLGVGDGL VSYRFKPNLR SVGKKFGKLV PALREVLANL SSEQAADAAH KVETGASFEV VVEGETLTLA ADDVLMEASS PEGYAVAEGE GLLVALVTTL TDELLREGIA REIVRNLNDA RKAADLAITD RINATLGTEV DLAAVVAEYA EYIKAETLCE VLSVGDANAD HHTSSMELEQ GKLSLGISKI G
|
| |