Gene RPD_4038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRPD_4038 
SymbolileS 
ID4024555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRhodopseudomonas palustris BisB5 
KingdomBacteria 
Replicon accessionNC_007958 
Strand
Start bp4487627 
End bp4490632 
Gene Length3006 bp 
Protein Length1001 aa 
Translation table11 
GC content65% 
IMG OID637964241 
Productisoleucyl-tRNA synthetase 
Protein accessionYP_571158 
Protein GI91978499 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0060] Isoleucyl-tRNA synthetase 
TIGRFAM ID[TIGR00392] isoleucyl-tRNA synthetase
[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.270009 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.410472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGACA AGTCTGAAAA ATCCGACGTC AACGACTATT CCAAGACGCT CTTCCTGCCG 
CAGACGGAGT TTCCGATGCG CGCCGGGCTG CCGCAGCGCG AGCCGGAATT GCTCAAGCGC
TGGGAGGAGA TCGACCTCTA CGGCAAGCTG CGCGAAGCTG CCCGGGGCCG CGCCAAATTC
GTGCTGCACG ACGGCCCGCC TTACGCCAAC GGCAACATCC ATATCGGCCA CGCGCTCAAC
AAGATCCTCA AGGACGTCGT GACCAAGAGC CAGCAGATGC TCGGCTACGA TTCCAACTAT
GTGCCGGGCT GGGATTGTCA CGGCCTGCCG ATCGAATGGA AGATCGAGGA GGAGAACTAC
CGCTCCAAAG GCAAGCCGAA GCCGAACTTC AAGGATTCGA CGGCGATGGT CGCGTTCCGC
AAGGAATGCC GGGCGTACGC GACGAAGTGG CTCAACGTGC AGCGCGAGGA ATTCAAGCGG
CTCGGCCTGA TCGGCGACTG GGACCATCCC TACGCCACGA TGGACTACTT TGCCGAAGCC
CAGATCGCCC GCGAACTGAT GAAATTCGCC GCCAACGGCA CGCTGTATCG CGGCAGTAAG
CCGGTGATGT GGAGCGTGGT CGAAAAGACC GCGCTGGCCG AGGCCGAGGT CGAGTACGAG
GACTACACTT CGGATACGGT GTGGGTGAAG TTTGCGGTGA AGTCCGGCGA TGCGGCGGTG
AGCAGCGCCA GCGTCGTGAT CTGGACCACC ACGCCGTGGA CGCTGCCGGG CAATCGCGCG
ATCAGCTTCT CGTCCAAGAT CGGCTACGGT CTCTACAAGG TCACCGACGC GCCGGCTGAT
AATTGGGCCA AGACTGGCGA TCTGCTGATC CTCGCCGATG CGCTCGCTGA AAGCGTATTC
AAGCAGGCGC GTGTCGCTGC GTATGACAAG GTTTCGGCCG TTGCACCTGA CGTCCTGAAG
GCGATCGAAT GCGCGCATCC GCTGCGCGGC CTCGCCGGCG GCTATGAATT CACCGTCCCG
CTGCTCGATG GCGATCACGT CACCGACGAC ACCGGCACCG GCTTCGTCCA CACCGCGCCC
GGCCATGGCC GCGAGGACTT CGACATCTGG ATGCACAATG CGCGCGCGCT CGAGGCGCGC
GGCATCTCCT CGGCGATCCC CTACACCGTC GACGAGAACG GCGCGTTCAC GGAGCATGCG
CCAGGCTTCG TCGGCAAGCG GGTGATCAAC GACAAGGGCG AGAAGGGCGA CGCCAACGAG
GCGGTGATTC AAGAATTGAT CGCGTGCGGC GCGCTGCTGG CGCGCGGCAA GCTCAAGCAT
CAATATCCGC ATTCGTGGCG CTCGAAGAAG CCGGTGATCT TCCGCAACAC GCCGCAATGG
TTCATCGCGA TGGACAAGGA CATCGCCGAT GGCGATGCCG CGAAGCCCGG CGACACGCTG
CGCGCCCGCG CGCTGCAGGC AATCTCGGTC ACCCAATGGG TGCCGGCGGC GGGGCAAAAC
CGCATCAACG GCATGATCTC CGGCCGGCCC GACTGGGTGA TCTCGCGGCA GCGCGCCTGG
GGCGTGCCGA TCGCGGTGTT CGTGCGCGAG AAGGGCGACG GCTCCGCCGA GATTCTCGTC
AATGACGAGG TCAACAAGCG CATCGCCGAC GCCTTCGTCG AAGAGGGCGC CGACGCCTGG
TACATGGAAG GCGCGCGCGA TCGGTTCCTC GGCTCGCTCG CCAATGAAGA CTGGCAGAAG
GTCGACGATA TTCTCGATGT CTGGTTCGAC TCGGGCTCGA GCCACGCTTT CGTGCTGGAA
GATCCGGTTC ACTTCCCCGG CCTCGCCGGC ATCCGCCGCA AGGTCGACGG CGGCGCCGAC
ACCGTGATGT ATCTCGAAGG CTCGGACCAG CATCGCGGCT GGTTCCACTC GTCGCTGCTG
GAGAGCTGCG GCACCCGCGG CCGCGCGCCC TACGACGTGG TGCTGACCCA CGGCTTCACG
CTCGACGAGC AGGGCCGCAA GATGTCGAAG TCGATCGGCA ACACGGTCGA GCCGCAGAAG
GTGATCGCGC AATCCGGCGC CGACATCCTG CGGCTGTGGG TGTGCGCCAC CGACTACGCC
GACGATCAGC GCATCGGCCC GGAAATCCTC AAGAACGTGG TCGAGACCTA TCGCAAGCTG
CGCAACTCGA TCCGCTGGAT GCTCGGCACG CTGCATCATT TCAAGCGCGA CGAGGCGGTG
GCGTTCGCCG ACATGCCCGA GCTGGAGCGG CTGATGCTGC ATCAGCTCGC CGAACAAAGT
GCTGTGGTGC GCGCCGCCTA TGCCGAGTTC GACTACAAGA CCGTGGTCGC CTCGCTCGCC
ACCTTCATGA ACACCGAATT GTCGGCGTTC TATTTCGACA TCCGCAAGGA CACGCTGTAT
TGCGACCCGC CGTCCTCGCT GGCGCGCAAG GCGGCGCTGA CCGCGATCGA CATCATCTGC
GACGCGGTCC TGAAATGGCT GGCGCCGGTG CTGTCCTTCA CCGCCGACGA GGCGTGGGCG
ATGGTTCGCC CTGACGCCGA GCCGAGCGTG CATCTGACGC TGTTCCCGCT CGAACTCGGC
GCCTATCGCG ACGATGCGCT GGCGAAGAAG TGGACGCTGA TCCGCGCGGT CCGCCGCGTC
GTCACCGGCG CGTTGGAAGT CGAGCGCGCA GCGAAGCGGA TCGGCTCATC GCTCGAAGCT
TCGCCGATGA TCTATCTGCC CGAACAATTC ATGGGCGACA TTTTCGACGT CGATTGGGCC
GAAATCTGCA TCACCTCGAA CGCAATGGTC GAGATCCTGC GCGGCAACGA CACGCCGCCG
GCGGATGCCT TCCGGCTGCC GGAGCTGGCC AACGTCGCGG TGGTGGTCGA ACGCGCGCAG
GGCGCCAAAT GCGCCCGCTC CTGGAAGATC CTGTCGAGCG TCGGCAGCGA TCCCGACTAT
CCCGACGTCT CGCCGCGCGA CGCCCAGGCG CTGCGCGAGT GGAAGGCGCT GGGGGCTCCG
GTCTGA
 
Protein sequence
MSDKSEKSDV NDYSKTLFLP QTEFPMRAGL PQREPELLKR WEEIDLYGKL REAARGRAKF 
VLHDGPPYAN GNIHIGHALN KILKDVVTKS QQMLGYDSNY VPGWDCHGLP IEWKIEEENY
RSKGKPKPNF KDSTAMVAFR KECRAYATKW LNVQREEFKR LGLIGDWDHP YATMDYFAEA
QIARELMKFA ANGTLYRGSK PVMWSVVEKT ALAEAEVEYE DYTSDTVWVK FAVKSGDAAV
SSASVVIWTT TPWTLPGNRA ISFSSKIGYG LYKVTDAPAD NWAKTGDLLI LADALAESVF
KQARVAAYDK VSAVAPDVLK AIECAHPLRG LAGGYEFTVP LLDGDHVTDD TGTGFVHTAP
GHGREDFDIW MHNARALEAR GISSAIPYTV DENGAFTEHA PGFVGKRVIN DKGEKGDANE
AVIQELIACG ALLARGKLKH QYPHSWRSKK PVIFRNTPQW FIAMDKDIAD GDAAKPGDTL
RARALQAISV TQWVPAAGQN RINGMISGRP DWVISRQRAW GVPIAVFVRE KGDGSAEILV
NDEVNKRIAD AFVEEGADAW YMEGARDRFL GSLANEDWQK VDDILDVWFD SGSSHAFVLE
DPVHFPGLAG IRRKVDGGAD TVMYLEGSDQ HRGWFHSSLL ESCGTRGRAP YDVVLTHGFT
LDEQGRKMSK SIGNTVEPQK VIAQSGADIL RLWVCATDYA DDQRIGPEIL KNVVETYRKL
RNSIRWMLGT LHHFKRDEAV AFADMPELER LMLHQLAEQS AVVRAAYAEF DYKTVVASLA
TFMNTELSAF YFDIRKDTLY CDPPSSLARK AALTAIDIIC DAVLKWLAPV LSFTADEAWA
MVRPDAEPSV HLTLFPLELG AYRDDALAKK WTLIRAVRRV VTGALEVERA AKRIGSSLEA
SPMIYLPEQF MGDIFDVDWA EICITSNAMV EILRGNDTPP ADAFRLPELA NVAVVVERAQ
GAKCARSWKI LSSVGSDPDY PDVSPRDAQA LREWKALGAP V