Gene Haur_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2971 
Symbol 
ID5734843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3747990 
End bp3751175 
Gene Length3186 bp 
Protein Length1061 aa 
Translation table11 
GC content53% 
IMG OID641280115 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_001545737 
Protein GI159899490 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0600156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGTTTG CAGCAGTTGA TCCCAAGGTG TCGTTTCCGA CGCTTGAGGA TGAGATCGCC 
GCTTGGTGGG AAGCCCATGG CATCGTCAAA AAGACGCTTG ACCATGGTGA TGCTTCGCGT
CCGTTTGTCT TTTTTGAAGG GCCACCCACG GCCAATGGCC GCCCGGGGAT TCACCACGTC
GAGGCTCGTT CGTCTAAAGA TATTATGGTG CGCTTCAACC GCATGCTTGG TAAAAAGGTC
ATCGGTGCTC GCGGTGGCTG GGATACCCAT GGCTTGCCAG TCGAACTGGA AGTTGAAAAG
AAATTAGGTT TCGCAGGCAA ACCCGACATC GAAAAATATG GCATCACCGA ATTTAATGCT
GCCTGCCGTC AATCAGTGTG GGATTACATC CAAGAATGGG AAAAGCTGAC CCAGCGGATC
GCCTTCTGGA TCGATCTCGA AGATCCCTAT ATCACCTATG ACAACAAATA TATCGAGTCG
TTGTGGTGGA TCTTCAAGCA ATTGCACGAG CGCGAATTAC TGTATCGTGA TTATAAAGTG
ACGATGCACT GCCCCCGCTG TGGCACATCG CTCTCGGATC ACGAGGTCGC CCAAGGCTAC
CAAGATAATA CCGACGATCC ATCGGTGTGG GTGCGCTTCC GCCATACGCC GAGCGACCAT
GCCTTAGATG CTCAAGTGGC CGATGCTGCT TTCTTGGCCT GGACAACTAC ACCTTGGACT
CTGCCAGCCA ACGCTGGTTT GGCGGTCAAT CCTGAGGCAA CCTATGTGCT AGCCGAGCAC
GAAGGCCAGC GCTATATCTT GGCTGAAGCT TTGGTTGGGG CGGTGTTGGG TGAAACTGCC
GCCACGCTTG CTAGCTTTGT TGGTGCTGAT CTGCGTGGCT TGCGCTATAC GCCGCTGTTT
CCTGGGGTTG GCGATAATGG CGCAGCGATC GATTTGAGCA GTGCTTATCG CGTCGTTGCC
GATGAGTTCG TCTCGTTGGA AGACGGTACG GGGATCGTGC ACATCGCGCC AGCCTATGGC
GACTTGGAAA TTGGTCGCAA GTATGGCTTG CCAACCTTGT TCTCGGTCGA GTTGGCGGGC
AAAGTGCTGG GCAGCTTTGA AAGTTTTGGC TTTGCAGGCA TGTTCTTCAA AGAAGCTGAT
CCAAAAATCA CGCGCTATTT GAAAGAACAA GGCTTGTTGT TCAAATCAGG GCGAGTGCTG
CACACCTATC CATTCTGTTG GCGCTGTAAA ACACCGTTGC TGTTCTATGC CAAGCAATCG
TGGTATATCC GCACGACGGC CTTGAAGCAA CAATTGATCG CCAACAACAA AAAGATCAAT
TGGGTGCCTG AGCATATCCA AGCTGGGCGT TTTGGCAACT GGCTCGAAAA TAACATCGAC
TGGGCGATCA GCCGTGAGCG CTATTGGGGC ACGCCGCTAC CAGTTTGGAC ATGCGATACC
TGTCAGCATA TCGATGTGGT CGGTTCGTTG GCGGAGCTTG GCGAACGTTG GGGTCAAGAT
ACTGCCAATC TGGATATGCA CCGTCCATTT GTTGATGCAC CTAGCTGGTC GTGCCCTGAG
TGCGAAGCTG GCACGATGCA GCGCATTCCC GATGTGGCTG ATTGTTGGTT TGATTCAGGT
GCGATGCCAG TGGCGCAATG GCATTATCCG TTTGAAAATC AAGAACTGTT TGAAGTTGCT
GGCCAAGCCG ATTTCATCTC GGAAGCAATC GACCAAACTC GTGGCTGGTT CTACACCTTG
CACGCAGTTT CAACCTTGCT CTTCGATCGC CCAGCCTACA AGAATGTGAT CTGTTTGGGT
CACTTGTTGG ATGGCAAAGG CGAGAAGATG TCCAAATCCA AGGGCAACAT CGTTTCGCCA
TGGGAAATGG TCGAGAAGTA TGGCGCTGAT GCGGTGCGTT GGTATATGTT TGCTGCTGGT
CAGCCTTACA ATCCACGCCG TTTCTCAGCC GATTTGGTCA GCGAATCGTT GCGCCAGTTC
TTGTTGACCT TGTGGAATAC CTATAGCTTT TTCACAACTT ACGCCAATGT CGATGGTTGG
ACACCTGAAT TGGCGCAAGG CGATTTGGCG GCGATCGATC GCTGGGCCTT GGCACGGCTC
AACGCATTGG TGCGCGATGT GCGCAACGAT CTGAGCAATT ACGATATGAA CACACCAGCC
AAGCGGCTCG AACAATTCGT TGACGAGCTT TCAAACTGGT ATGTGCGACG TAATCGGCGG
CGTTTCTGGG GCAGCGACAT GAACGGCGAT AAGCAAGCTG CCTATTCAAC CTTGTACACC
TGTTTGGTCA CGATTTCCAA ACTGATGGCT CCATTCACGC CATTTGTGGC CGAATCGCTG
TATCAAAATT TAGTGCGCAG CTACGACCAA ACTGCCGCCG AAAGCGTGCA TATGGCGCTG
TATCCTGAAG CCAATTTGGC GCTGATCGAT GAAGAATTGA TTCGCAAAAC CGACTTATTG
CTCAAGGCTG TGAGCTTGGG CCGCGCCGCC CGCAAGAACG CTGGCATACG CGTGCGTCAG
CCACTGAGCG AAGTGTTGGT GCGCTTGCCA CGCGGCGAAC AACTTGATGA ACTAAGCGCT
GAACTGAGTG ACGAACTCAA TATCAAGTCG GTGCGCTGGC TTGGGGTGGG CGATGGCTTG
GTCAGTTATC GCTTCAAGCC CAATTTGCGC TCGGTTGGCA AAAAGTTTGG CAAACTGGTT
CCAGCATTGC GCGAAGTGTT GGCGAATCTT AGCAGCGAGC AAGCCGCCGA TGCAGCGCAT
AAAGTTGAAA CCGGGGCCAG CTTCGAGGTT GTAGTCGAAG GCGAAACATT AACCTTGGCC
GCCGACGATG TATTGATGGA AGCTTCATCG CCCGAAGGCT ACGCTGTGGC CGAAGGCGAG
GGCTTGTTAG TAGCCTTAGT TACGACGCTG ACTGATGAAT TGCTGCGCGA AGGCATCGCC
CGCGAAATCG TGCGTAACCT CAATGATGCG CGTAAGGCCG CTGATCTAGC GATCACCGAT
CGTATCAATG CAACCTTGGG AACCGAGGTT GATTTGGCGG CAGTCGTAGC TGAATATGCC
GAGTACATCA AGGCCGAAAC CTTGTGCGAG GTGTTGAGCG TTGGCGATGC CAATGCTGAC
CATCACACCA GCAGCATGGA ATTGGAACAA GGCAAACTAA GCCTCGGCAT TAGCAAAATC
GGCTAA
 
Protein sequence
MAFAAVDPKV SFPTLEDEIA AWWEAHGIVK KTLDHGDASR PFVFFEGPPT ANGRPGIHHV 
EARSSKDIMV RFNRMLGKKV IGARGGWDTH GLPVELEVEK KLGFAGKPDI EKYGITEFNA
ACRQSVWDYI QEWEKLTQRI AFWIDLEDPY ITYDNKYIES LWWIFKQLHE RELLYRDYKV
TMHCPRCGTS LSDHEVAQGY QDNTDDPSVW VRFRHTPSDH ALDAQVADAA FLAWTTTPWT
LPANAGLAVN PEATYVLAEH EGQRYILAEA LVGAVLGETA ATLASFVGAD LRGLRYTPLF
PGVGDNGAAI DLSSAYRVVA DEFVSLEDGT GIVHIAPAYG DLEIGRKYGL PTLFSVELAG
KVLGSFESFG FAGMFFKEAD PKITRYLKEQ GLLFKSGRVL HTYPFCWRCK TPLLFYAKQS
WYIRTTALKQ QLIANNKKIN WVPEHIQAGR FGNWLENNID WAISRERYWG TPLPVWTCDT
CQHIDVVGSL AELGERWGQD TANLDMHRPF VDAPSWSCPE CEAGTMQRIP DVADCWFDSG
AMPVAQWHYP FENQELFEVA GQADFISEAI DQTRGWFYTL HAVSTLLFDR PAYKNVICLG
HLLDGKGEKM SKSKGNIVSP WEMVEKYGAD AVRWYMFAAG QPYNPRRFSA DLVSESLRQF
LLTLWNTYSF FTTYANVDGW TPELAQGDLA AIDRWALARL NALVRDVRND LSNYDMNTPA
KRLEQFVDEL SNWYVRRNRR RFWGSDMNGD KQAAYSTLYT CLVTISKLMA PFTPFVAESL
YQNLVRSYDQ TAAESVHMAL YPEANLALID EELIRKTDLL LKAVSLGRAA RKNAGIRVRQ
PLSEVLVRLP RGEQLDELSA ELSDELNIKS VRWLGVGDGL VSYRFKPNLR SVGKKFGKLV
PALREVLANL SSEQAADAAH KVETGASFEV VVEGETLTLA ADDVLMEASS PEGYAVAEGE
GLLVALVTTL TDELLREGIA REIVRNLNDA RKAADLAITD RINATLGTEV DLAAVVAEYA
EYIKAETLCE VLSVGDANAD HHTSSMELEQ GKLSLGISKI G