Gene Mfla_0091 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMfla_0091 
SymbolvalS 
ID3999597 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylobacillus flagellatus KT 
KingdomBacteria 
Replicon accessionNC_007947 
Strand
Start bp98809 
End bp101640 
Gene Length2832 bp 
Protein Length943 aa 
Translation table11 
GC content60% 
IMG OID637936981 
Productvalyl-tRNA synthetase 
Protein accessionYP_544203 
Protein GI91774447 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID[TIGR00422] valyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTACCG ATATCTCTGC AAGCCAAACC CTGGACAAAT CCTTTGAACC CAAGAACATA 
GAAAGCCGCT GGTACCAATT CTGGGAAGCG CGCGGCTACT ATGCTGCCGG GCTGGATTCC
AGCAAACAAG ATAATTTCTG CATCCTGCTG CCACCCCCTA ACGTGACGGG CACGCTGCAC
ATGGGGCACG GTTTCAACCA AACCATCATG GATGCATTGA CGCGTTATCA CCGCATGCGC
GGCGCCAATA CCCTGTGGCA GCCGGGCACC GATCATGCCG GTATCGCCAC GCAGATCGTC
GTCGAGCGCC AGCTCGATGC GCAGGGTATC AGCCGTCACG ACCTCGGCCG CGAGAAATTT
CTCGAGAAGG TCTGGGAATG GAAGGAATAT TCCGGCGGCA GCATCACCAA GCAGATGCGC
CGCCTGGGCA CCTCGCCGGA CTGGAGCCGC GAGCGCTTCA CCATGGATGA GGGCCTGTCG
CGCACTGTCA CCGAGACTTT CGTCCGCCTC TACAACGAGG GGTTGATCTA TCGCGGCAAG
CGCCTGGTGA ACTGGGATCC CAAGCTGCAT ACCGCTGTCT CCGACCTCGA AGTGATTTCC
GAGGAGGAAG ATGGCCATCT CTGGCATATC CGCTATACAT TGGCGGACGG CGATGGCGAG
CTGACCGTCG CCACGACGCG TCCTGAAACC ATGCTCGGTG ACGTTGCCGT GATGGTCCAT
CCCGAGGATG AACGCTATGC GCACCTGATC GGCAAGCACG TCAAGCTGCC GTTGTGCGAC
CGCGAGATCC CGGTGATTGC GGACGATTAC GTGGACCGCG AGTTTGGTAC CGGCGTGGTC
AAGGTGACAC CTGCCCATGA CTTCAACGAC TATGCCGTCG GGCAACGCCA CAAGTTGCCG
CTGATCAGCA TCCTCACGCT TGATGGGCAT ATCAACGATG CCGCCCCGGT GCAATACCAG
GGACTGGAGC GCTTTGCCGC GCGTAAGCAG ATCGTGGCCG ACCTCGAAGC GCAGGGTTAT
TTGGTCAAGG TCGATAAACA CAAGCTCAAG GTGCCGCGCG GCGACCGTAC CGGCGTCGTG
ATCGAGCCCA TGCTCACCGA CCAGTGGTTC GTCGCCATGA GCAAGCCGGG CGCGGATGGC
AAAAGCATTA CCCAAAAAGC GCTGGAAGTG GTCGCCAACG GCGAGATCCG CTTCGTGCCG
GAAAACTGGG TCAATACCTA CAACCAGTGG CTCAACAACA TCCAGGATTG GTGCATCTCG
CGCCAGCTCT GGTGGGGGCA CCAGATTCCC GCCTGGTACA GCGACGACGG CAAGGTCTAT
GTCGCGCATG ATGAGGCAGA GGCAAAACAG CTTGCTGCCA ACGATGGCTA TCAAGGCCAC
TTGAAGCGCG ACGAGGATGT GCTTGATACC TGGTATTCAT CCGCCTTGTG GCCGTTTTCC
ACATTGGACT GGACCGGGGA CGAGGAAAAG GACAAGGCCA ACCTGGCCTT GCAGCAGTTC
CTGCCGTCCT CGGTGCTGGT CACCGGCTTC GACATCATTT TCTTCTGGGT GGCGCGCATG
GTGATGATGA CCAAGCACAT CACCGGCAAG ATTCCGTTCA AGGATGTTTA CGTACATGGG
CTGATCCGCG ATGCGGAAGG GCAGAAAATG TCCAAGTCCA AGGGCAACGT GCTTGACCCG
ATCGACCTGA TCGACGGCAT CGGCATTGAG GAGCTGGTGA AAAAGCGCAC TACCGGCCTC
ATGAATCCCA AGCAGGCCGA GCAGATCGAG AAGCGCACGC GCAAGGAGTT CCCGGAAGGC
ATTCCCGCAT TCGGTACCGA TGCCCTGCGC TTCACTTTTG CCTCGCTGGC CTCGCCGGGC
CGCGACATCA AGTTCGACCT GCAGCGTTGC GAAGGCTACC GCAACTTCTG CAACAAGCTC
TGGAATGCCG CACGTTTCGT GCTCATGAAT ACCCAGGGCA AGGATTGTGG CCTGGAAGAC
TGCAAGACCC AGCCTGAGGG CTACCTGGAT TTCTCGCAGG CTGACCGCTG GATCGTCAGC
CTGCTGCAGC GCACCGAAGC CGATATCGAA CGCGGCTTTG CCGAATACCG GTTCGACAAT
GTGGCGCAGG CGATCTATAA GTTCGTCTGG GATGAATATT GCGACTGGTA CCTCGAACTG
GCCAAGGTGC AACTGCAGAA CGGAGGCGAA GCCCAGCAGC GTGCCACGCG GCGCACCCTG
CTGCGCGTAC TGGAAACCAT ACTACGCCTT GCGCACCCCT TGATGCCGTT CATTACCGAG
GAAATCTGGC AAATCGTCGG CCCGCTGTCT GGCCGTACCG GGCCCAGTAT CATGCTGGAG
CAATACCCCG TCAGTCAGCC GGCCAAGCTG GACGAGCAGG CGGAAGCCTG GGTCGCCTTG
CTCAAGGAGT CGGTGGACGC CTGCCGCAGC CTGCGCGGCG AAATGAATGT ATCACCTGCT
GCGCGGGTGC CGCTGATCGC CGCCGGGGAC GATGAAAAGC TGGCGGCCTA TGCTCCCTAC
CTCAAGGCGC TGGCCAAGCT GAGTGACGTC GAGATCATGG CTGAGCTGCC CGAGGCGGAA
GCCCCCGTGG CATTGGCGGG TGATTTCAAA CTCATGCTCA AGATCGAGAT CGACGTGACT
GCTGAGAGGG AGCGCCTGGG CAAGGAAATC AGCCGCCTGG AGGGTGAGGT ATCCAAGGCG
CAGGCCAAGC TGGGGAATGA AAGCTTCGTG GCGCGCGCGC CAGCTGCCGT GGTCGAACAG
GAGAAAGCCA GGTTGGCCGC CTTCAGTGAT ACGCTTTCCA AGCTGCAGGC GCAGCTTGCC
AAACTGAAGT AG
 
Protein sequence
MTTDISASQT LDKSFEPKNI ESRWYQFWEA RGYYAAGLDS SKQDNFCILL PPPNVTGTLH 
MGHGFNQTIM DALTRYHRMR GANTLWQPGT DHAGIATQIV VERQLDAQGI SRHDLGREKF
LEKVWEWKEY SGGSITKQMR RLGTSPDWSR ERFTMDEGLS RTVTETFVRL YNEGLIYRGK
RLVNWDPKLH TAVSDLEVIS EEEDGHLWHI RYTLADGDGE LTVATTRPET MLGDVAVMVH
PEDERYAHLI GKHVKLPLCD REIPVIADDY VDREFGTGVV KVTPAHDFND YAVGQRHKLP
LISILTLDGH INDAAPVQYQ GLERFAARKQ IVADLEAQGY LVKVDKHKLK VPRGDRTGVV
IEPMLTDQWF VAMSKPGADG KSITQKALEV VANGEIRFVP ENWVNTYNQW LNNIQDWCIS
RQLWWGHQIP AWYSDDGKVY VAHDEAEAKQ LAANDGYQGH LKRDEDVLDT WYSSALWPFS
TLDWTGDEEK DKANLALQQF LPSSVLVTGF DIIFFWVARM VMMTKHITGK IPFKDVYVHG
LIRDAEGQKM SKSKGNVLDP IDLIDGIGIE ELVKKRTTGL MNPKQAEQIE KRTRKEFPEG
IPAFGTDALR FTFASLASPG RDIKFDLQRC EGYRNFCNKL WNAARFVLMN TQGKDCGLED
CKTQPEGYLD FSQADRWIVS LLQRTEADIE RGFAEYRFDN VAQAIYKFVW DEYCDWYLEL
AKVQLQNGGE AQQRATRRTL LRVLETILRL AHPLMPFITE EIWQIVGPLS GRTGPSIMLE
QYPVSQPAKL DEQAEAWVAL LKESVDACRS LRGEMNVSPA ARVPLIAAGD DEKLAAYAPY
LKALAKLSDV EIMAELPEAE APVALAGDFK LMLKIEIDVT AERERLGKEI SRLEGEVSKA
QAKLGNESFV ARAPAAVVEQ EKARLAAFSD TLSKLQAQLA KLK