Gene Franean1_5200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5200 
SymbolvalS 
ID5673534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6240968 
End bp6243598 
Gene Length2631 bp 
Protein Length876 aa 
Translation table11 
GC content72% 
IMG OID641244054 
Productvalyl-tRNA synthetase 
Protein accessionYP_001509464 
Protein GI158316956 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.725067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGGCAA CGGACAGTAC GGAACGGCCG ACCCGCGCGG TGCCCGCCAA GCCGAGCTTG 
GACGGGATCG AGACCCGCTG GTCCGAGCAG TGGCACGAGC GCGGGACCTA TGAGTTCGAC
CGGTCGGCGC CGCGGGAGCG GGTCTACTCG GTCGACACCC CGCCACCGAC CGTCAGTGGC
TCGCTGCACG TAGGCCACGT GTTCTCGTAC ACCCACACAG ACCTGATCGC CCGCTTCCAG
CGGATGCGCG GGCGCGAGGT CTTCTACCCC ATGGGGTTCG ACGACAACGG GCTGCCGACC
GAGCGCCGGG TGCAGAACTA CTACGGCGTG CGGTGCGACC CGTCGCTGCC CTACGACCCG
GATTTCACCC CGCCGGAGAA GCCGGGCAAG GACCAGCTCC CGATCTCCCG GCGCAACTTC
GTCGAGCTCT GCGAGACGCT GACGATCGAG GACGAGAAGG GCTTCGAGGA GCTCTGGCGC
CGTCTCGGGC TCTCCGTCGA CTGGTCGCAC ACCTACGCGA CCATCGACAC CCGGTCCCGC
GCGGTGGCGC AGCGGGCCTT CCTGCGTAAC CTCGCCCGCG GCCAGGCCTA CCTGGCCGAG
GCCCCGACGC TGTGGGACGT CACGTTCCGC ACCGCGGTCG CCCAGGCCGA GCTGGAGGAC
AGGGAGCGCC CCGGCGCGTA CCACACGCTC GCGTTCGGGC GGCCCGGCGG CGACCCGGTG
GTGATCGAGA CGACCCGCCC GGAGCTGCTG CCGGCCTGCG TGGCACTCGT GGCGCATCCC
GACGACGCGC GCTACCAGCC GCTGTTCGGG CAGACGGTGC GCACGCCGCT GTTCGACGTC
GAGGTGCCGG TGGTGGCGCA CCGGCTGGCT GATCCGGAGA AGGGCTCCGG CATCGCCATG
ATCTGCACCT TCGGTGACCT CACCGACGTC ATCTGGTGGC GTGAGCTGCG GCTGCCCGCC
CGCGCGGTGA TCGGCCGGGA CGGCCGCCTG ATCGCCGAGG CCCCCGAGGC GATCTCGTCG
CCGGCCGGCC GGGAGCACTA CGCCCAGCTC GGCGGGAAGA CGATCGCCGC CGCGCGGACG
GCGATCGTGG CGGCCCTGCA GGAGTCCGGC GCGCTGCTCG GTGAGCCGCG GGCGATCAAT
CACCCGGTCA AGTTCTACGA GAAGGGCGAC CGGCCGCTCG AGATCGTCAC CACCCGCCAG
TGGTACATCC GCAACGGCGG CCGGGACGAG CAGTTGCGTG AGCAGCTGCG CCAGCGTGGC
AGCGAGCTGC GCTGGCACCC CGAGTACATG CGGGTGCGCT ACGAGAACTG GGTGGACGGC
CTCAACGGCG ACTGGCTGAT CAGCCGGCAG CGTTTCTTCG GTGTGCCCTT CCCGGTCTGG
TATCCGCTCG ACGCGGACGG CCAGCCCCGC CACGGCGCCC CGATCGTCGC GGCCGAGGCC
ACCCTGCCCG TGGACCCGTC CAGTGACGTC CCCCCGGGCT ACACGGCCGA GCAGCGGGGC
GTGCCCGGCG GCTTCATGGC CGACCCCGAC GTCATGGACA CCTGGGCCAC CTCCTCGCTG
ACGCCGCTGA TCGCCAGTGG CTGGGAGAGC GACCCGGCGC TGTTCGCGCA GGTGTTCCCG
ATGAACCTGC GCCCCCAGGC GCACGAGATC ATCCGGACCT GGTTGTTCTC GACCGTGCTG
CGCAGCCACG ACGAGTTCGG CACGCTGCCC TGGACGGACG CGGCGATCTC GGGTTGGATC
CTCGACCCGG ACCGCAAGAA GATGTCGAAG TCCAAGGGCA ACGTGGTCAC GCCCATGGGC
CTGCTCGAGC AGCACGGCTC GGACGCCGTC CGCTACTGGG CGGCCTCGGG CCGTCCGGGC
ACCGACACGG CCTTCGACGT CGGCCAGATG AAGACCGGCC GCCGGCTCGC GATCAAGATT
CTCAACGCGA GCCGGTTCGT GCTGGGCCTC GCCGAGGCGG GCGAGGGCGA GGCCTTCGAG
GGCGACGCGG CCCTCGCGGC CGACGCCGCA CCCGGCCCGG CGGCGATCAC CGAACCGCTG
GACAGAGCGC TGCTCGCCGA GCTCGCCCGG GTCGTCGACA CGGCCACCGG CGCGTTCGAG
GGCTACGACT ACACCCGGGC GCTCGAGGTC ACCGAGACCT TCTTCTGGCG GTTCTGCGAC
GACTACGTCG AGCTGGTCAA GGGCCGGGCG TACGGCGGGC ACGGGGAGGC CGGCGCCGCC
TCCGCCCGGG CGACGCTGTC GATCGCGCTG TCGACGCTGC TCACCCTGTT CGCGCCGTTC
CTGCCGTTCG TCACCGAGGA GGTCTGGTCC TGGTGGCGGG CGGGCTCGGT GCACCGAGCC
CGCTGGCCGG AGGCCGCCGA GGTCCGCGAG GCCGCCGGGG ACGGCCGTCC CGAGCTGCTG
GACGCGGTCG GCGCCGCGCT CTCCGCGGTG CGCCGGGCCA AGTCCGAGGC GAAGGTGTCG
ATGAAGGCCG AGGTCGCCAC CGCCCGGGTC AGCGGTGACG CCGACGTCGT CGCGCTGGTC
GGGCAGGCTG CCGTGGACCT GCGCAGCGCC GGCGGGATCT GGGAGCTGAG CCTGGTCGGC
ACCGGCGGCG AGCTGGCCGT CGACGTGGTG CTCGTCCCGC CGGCCGGCTG A
 
Protein sequence
MVATDSTERP TRAVPAKPSL DGIETRWSEQ WHERGTYEFD RSAPRERVYS VDTPPPTVSG 
SLHVGHVFSY THTDLIARFQ RMRGREVFYP MGFDDNGLPT ERRVQNYYGV RCDPSLPYDP
DFTPPEKPGK DQLPISRRNF VELCETLTIE DEKGFEELWR RLGLSVDWSH TYATIDTRSR
AVAQRAFLRN LARGQAYLAE APTLWDVTFR TAVAQAELED RERPGAYHTL AFGRPGGDPV
VIETTRPELL PACVALVAHP DDARYQPLFG QTVRTPLFDV EVPVVAHRLA DPEKGSGIAM
ICTFGDLTDV IWWRELRLPA RAVIGRDGRL IAEAPEAISS PAGREHYAQL GGKTIAAART
AIVAALQESG ALLGEPRAIN HPVKFYEKGD RPLEIVTTRQ WYIRNGGRDE QLREQLRQRG
SELRWHPEYM RVRYENWVDG LNGDWLISRQ RFFGVPFPVW YPLDADGQPR HGAPIVAAEA
TLPVDPSSDV PPGYTAEQRG VPGGFMADPD VMDTWATSSL TPLIASGWES DPALFAQVFP
MNLRPQAHEI IRTWLFSTVL RSHDEFGTLP WTDAAISGWI LDPDRKKMSK SKGNVVTPMG
LLEQHGSDAV RYWAASGRPG TDTAFDVGQM KTGRRLAIKI LNASRFVLGL AEAGEGEAFE
GDAALAADAA PGPAAITEPL DRALLAELAR VVDTATGAFE GYDYTRALEV TETFFWRFCD
DYVELVKGRA YGGHGEAGAA SARATLSIAL STLLTLFAPF LPFVTEEVWS WWRAGSVHRA
RWPEAAEVRE AAGDGRPELL DAVGAALSAV RRAKSEAKVS MKAEVATARV SGDADVVALV
GQAAVDLRSA GGIWELSLVG TGGELAVDVV LVPPAG