Gene GM21_2685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_2685 
SymbolvalS 
ID8138027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp3123632 
End bp3126298 
Gene Length2667 bp 
Protein Length888 aa 
Translation table11 
GC content63% 
IMG OID644870289 
Productvalyl-tRNA synthetase 
Protein accessionYP_003022479 
Protein GI253701290 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.00000000391784 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCAAACA AAGAGCTGGA AAAGGTTTAC GAGCCGAAAA GCGTCGAAGA GCGGTGGTAC 
CAGCAGTGGG AGCAGAAGGG GTACTTCCAC GCGACTCTCC CCTCCGACAA GCCCGGCTAC
AGTATCGTGA TCCCCCCCCC GAACATCACC GGCGTCTTGC ACATGGGGCA CGCGCTCAAC
AATACGCTGC AGGATATCCT GTGCCGCTGG AAACGTATGG CCGGCTACAA CGTCCTCTGG
ATGCCGGGGA CCGACCATGC CGGCATTGCT ACCCAGAACG TGGTTGAGCG CCAGCTGGCG
GCAGAGGGGA AGGACCGTTT CGAGCTGGGA CGCGAGGCGT TCATCGAGCG GGTCTGGCAG
TGGAAGGGCG AATCAGGCGG ACAGATCATC GGCCAGTTGA AGCGCCTTGG GGCCTCCTGC
GACTGGGAGC GCGAGCGCTT CACCATGGAC GCTGGCCTCT CGAAGGCGGT GCGCGAGGTG
TTCGTGCGCC TGTACCAGGA GAAGCTCATC TACCGCGACA ACCGCCTGAT CAACTGGTGC
CCGCGCTGCC ATACCGCGCT CTCCGACATC GAGGTCGAGC ACGAAGAGAA GGCGGGGCAC
CTGTGGCACC TGCGCTACCC GGTGGTCGGG ACCGGCGATT ATCTGGTGGT CGCCACCACC
CGCCCGGAAA CCATGTTGGG CGACACCGCG GTGGCGGTAC ACCCCGAGGA CGAGCGCTAC
GCGCACCTGA TCGGCAAGAT GGTGCTGCTC CCCCTGGTCA ACCGCGAGAT CCCGATCATC
GCCGACGACT ACGTCGACCG CGAGTTTGGC ACCGGCGTGG TCAAGATCAC CCCGGCCCAC
GACTTCAACG ATTTCGAGAT GGGGGTCAGG CACAACCTGG ACCGCATCAA CGTCTTCGAC
GAGTCCGGCG TCGTTAACGC AGCCGGCAAA CAGTACGAGG GGATGGAGCG CTTCGCCGCC
AGGAAGCAGG TGGTGGCCGA CCTCGAAGCA GCGGGGCTTT TGGAAAAGAT CCAGGACCAC
GCGCTTTCCG TCGGCGGTTG CTACCGCTGC AAGACGGTCG TCGAGCCGTA CATGTCGCTG
CAGTGGTACG TGAAGGTCGC ACCGCTGGCC GAGCGAGCTT TGGGCGCCGT CAAGGACGGG
CGCACGAAGA TCGTCCCGCA GCAATGGGAG AACACCTACT ACGACTGGAT GGAGAACATC
CGCGACTGGT GCATCTCGCG CCAGATCTGG TGGGGGCACC GCATCCCCGC CTGGTACTGC
GACCACTGCG GCCACATCAC GGTGGCGAAG GACGATCCGA CTTGCTGCGA CGAGTGCGGC
TCCGACGAGA TCCGCCAGGA AACCGACGTG CTCGACACCT GGTTCTCCTC GGCGCTTTGG
CCCTTCTCCA CCATGGGGTG GCCGGAGAAG ACCCCGGAGC TTGCCTCCTT CTACCCGACC
TCGTGCCTGG TCACCGGTTT CGACATCCTC TTCTTCTGGG TGGCCCGCAT GATGATGATG
GGGCTCCACT TCATGGACGA GGTCCCCTTC ACCGACGTCT ACATCCATGC CCTGGTGCGC
GACGCGCAGG GGCAGAAGAT GTCCAAGTCC AAGGGGAACG TGATCGATCC CTTGACCGTG
ATCGACGCCT ACGGCACCGA TGCCTTCCGC TTCACCCTGG CCGCCTTCGC GGCGCAGGGT
CGCGACATAA AGCTCGCGGA GGAGAGGATC GCCGGCTACC GCAACTTCGC CAACAAGATC
TGGAACGCCT CCCGCTTCGC CATGATGAAC CTGGAAGGGT TCGACCCGAA CGCGGTGGAT
CCGGCTTCGC TCAAGCTTTC CAACGCCGAC CGCTGGATCC TCTACCGGCT GAACCAGACC
ACCGTCTCGG TCGACGCAGC ACTCGCATCC TTCCGTTTCA ACGAGGCGGC CAACGACCTG
TACCGCTTCA CCTGGAGCGA GTTCTGCGAC TGGTACATCG AGCTCGCCAA GGACGACCTC
TACAAGGGGG ACGCCGACAG GCAGGCTTCG GCGAAATACG TGCTCTGGCT GGTGCTGGAG
AACCTTTTGC GCCTTTTGCA CCCGTTCATG CCGTTCATCA CCGAGGAGAT CTGGCAGGCG
CTTCCGAAAA TGGACGGCTC CGCCGAGTCG ATCATGATCT CCAGCTTCCC GGCTGCGTGC
GCGGAGTGGG AGGGTTACGC CGCTGCCGCC GCCGAGATGG ATCTGGTCAT GGAGGTGATC
AAGGGGATCA GGAACATCCG CGGCGAGATG GAGGTGCCCC CGAGCAAGCA GATCGCCGCC
ATCCTCGACT GCAAGTCCGA GGCGAGCCTC GCCCTCTTGA AGCGCAACGA AGCGTACGTC
ATGAGCCTTG CGCGTCTCTC CGACCTCGGC ATCGGCCAGG GGATCGAGCG TCCCGCAGAA
GCTTCACTGC AGGTCGCGGG CGACGTCGAG ATCATCGTGC CGCTCAGGGG GCTCGTGAAC
GTCGAGGAAG AGGAGAAGCG TCTAGGTAAA GAGATCGCCA AGATCGAGAA GGACATCGAG
TTCCTCTCCA AGAAGCTTGA GAACCCCAGC TTCGTGGAGC GCGCCCCCGC CGACGTGGTC
GAGAAGGAGC GCGAGAAGAT AGGCGAGTTC GCCAACAAGA AGAAGCTCCT AGAGGAGAGC
CTGGAGAAGA TCCAGAGGCT CAGGTAA
 
Protein sequence
MANKELEKVY EPKSVEERWY QQWEQKGYFH ATLPSDKPGY SIVIPPPNIT GVLHMGHALN 
NTLQDILCRW KRMAGYNVLW MPGTDHAGIA TQNVVERQLA AEGKDRFELG REAFIERVWQ
WKGESGGQII GQLKRLGASC DWERERFTMD AGLSKAVREV FVRLYQEKLI YRDNRLINWC
PRCHTALSDI EVEHEEKAGH LWHLRYPVVG TGDYLVVATT RPETMLGDTA VAVHPEDERY
AHLIGKMVLL PLVNREIPII ADDYVDREFG TGVVKITPAH DFNDFEMGVR HNLDRINVFD
ESGVVNAAGK QYEGMERFAA RKQVVADLEA AGLLEKIQDH ALSVGGCYRC KTVVEPYMSL
QWYVKVAPLA ERALGAVKDG RTKIVPQQWE NTYYDWMENI RDWCISRQIW WGHRIPAWYC
DHCGHITVAK DDPTCCDECG SDEIRQETDV LDTWFSSALW PFSTMGWPEK TPELASFYPT
SCLVTGFDIL FFWVARMMMM GLHFMDEVPF TDVYIHALVR DAQGQKMSKS KGNVIDPLTV
IDAYGTDAFR FTLAAFAAQG RDIKLAEERI AGYRNFANKI WNASRFAMMN LEGFDPNAVD
PASLKLSNAD RWILYRLNQT TVSVDAALAS FRFNEAANDL YRFTWSEFCD WYIELAKDDL
YKGDADRQAS AKYVLWLVLE NLLRLLHPFM PFITEEIWQA LPKMDGSAES IMISSFPAAC
AEWEGYAAAA AEMDLVMEVI KGIRNIRGEM EVPPSKQIAA ILDCKSEASL ALLKRNEAYV
MSLARLSDLG IGQGIERPAE ASLQVAGDVE IIVPLRGLVN VEEEEKRLGK EIAKIEKDIE
FLSKKLENPS FVERAPADVV EKEREKIGEF ANKKKLLEES LEKIQRLR