Gene Francci3_1314 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1314 
SymbolvalS 
ID3906586 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1576324 
End bp1579035 
Gene Length2712 bp 
Protein Length903 aa 
Translation table11 
GC content71% 
IMG OID637878647 
Productvalyl-tRNA synthetase 
Protein accessionYP_480420 
Protein GI86740020 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0525] Valyl-tRNA synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.558712 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCACGA TCGAGGCGCC CGTAGGATCG TTACTTGTGG TTGCGACAGA CGGTACGGAA 
CGGTCCACCC GCGCGGTCCC GGCGAAGCCG AGCCTGGACG GCATCGAGGC GCGCTGGTCG
ACGGTATGGC AGGACGAGGG CACCTACGCG TTCGACCGGA CGGTCGACCG CGGGCAGGTC
TACTCCATCG ACACCCCGCC GCCGACCGTC AGCGGTTCCC TGCACGTGGG GCACGTCTTC
TCGTACACCC ACACCGACCT GATCGCCCGC TACCAGCGGA TGCGGGGTCG CGAGGTCTTC
TACCCGATGG GCTTCGACGA CAACGGGCTG CCGACCGAGC GCCGGGTCCA GAACTACTTC
GCTGTCCGCT GCGAGCCGTC GCTGCCCTAC GACCCGGGTT TCACTCCGCC GGACAAGCCC
GGCAAGGAGC AGGTCCCGAT CTCGCGACGC AACTTCGTCG AGCTCTGCGA GCGGCTGACC
CTGGAGGACG AGAAGGGCTT CGAGCAGCTC TGGCGCAGGC TGGGGCTCTC CGTCGACTGG
TCGTACACCT ACGCGACGAT CGACGCCCGG TCGCGGACCG TTGCGCAGCG GGCCTTTCTG
CGCAACCTCG CTCGGGGGGA GGCCTATCTT GCCGAGGCCC CGACGCTGTG GGACGTCACC
TTCCGTACCG CGGTGGCCCA GGCCGAGCTG GAGGACCGGG ACCGCCCCGG CGCGTACCAC
AGCCTGGCCT TCCCCCGGTC GGGCGTCTCC GCCCCGAGCA GCGGGCTGGA CCCGGCGGGC
TCCGGGGATC TGGCGGGCTC CGGGGATCTG GCGGACCCGA TTGTCATCGA GACGACCCGT
CCCGAGCTGC TGCCCGCCTG CGTCGCCCTG GTCGCCCATC CGGACGACCC GCGTTACCAG
TCGTTGTTCG GCGCCACGGT GCGGACCCCG CTGTTCGACG TTGAGGTCCC GGTGGTCGCG
CACCGGTTGG CCGACCCCGA AAAGGGATCC GGCATCGCGA TGATCTGCAC CTTCGGCGAC
CTCACCGACG TGACGTGGTG GCGCGAGCTG CGGCTGCCGG CGCGTGCGGT GATCGGCCGT
GACGGCCGGT TCATCCCGGA CCCACCGGGC GCCGTCACCT CCGCGGCCGG CCGGGAGCGG
TACGCCGAGC TCGCCGGAAA GACCGTGCAC AGCGCCCGGG CGCGGATCGT CGAACTGCTC
GCCGGGTCCG GCGCGCTGCT CGGCGATCCG CGGCCGATCA CCCATCCGGT CAAGTTCTAC
GAGAAGGGCG ACCGGCCGCT GGAGATCGTG ACGACCCGCC AGTGGTACCT CCGCAACGGC
GGGCGGGACG AGCCGCTGCG GGCCGCGCTG CGTGGACGCG GCCAGGAGCT GCGCTGGTAC
CCGGACTACA TGAAGGTCCG CTACGAGAAC TGGGTCGACG GCCTCAACGG CGACTGGCTG
ATCAGCCGGC AGCGGTTCTT CGGCGTGCCG TTCCCGGTGT GGTACCCGCT GGACGCCGCC
GGTGCGCCGC GGTATGACGC GCCGATCGTG CCCGACGAGG CCTCGCTGCC GGTCGACCCG
TCCAGCGACG TCCCGCCCGG GTACGCCGCC GAGCAGCGGG GCGTGCCCGG CGGGTTTGCC
GCCGACCCCG ACGTGATGGA CACCTGGGCG ACGTCTTCGC TCACCCCACT GATCGCCAGC
GGCTGGGAGC ACGATACCGA TCTGTTCTCC CGGGTCTTCC CGATGGACCT GCGTCCGCAG
GCGCACGAGA TCATCCGGAC CTGGTTGTTC TCGACGGTGG TGCGCACCCA CGCCGAGTTC
GACGTGCTGC CCTGGTCGAA CGCGGCGATC TCCGGTTGGA TCCTCGACCC GGACCGTAAG
AAGATGTCGA AGTCCAAGGG CAACGTGGTG ACCCCGATGG GCCTGCTCGA GGAGCATGGG
TCGGACGCGG TCCGTTATTG GGCGGCGTCC GGGCGGCCGG GTACCGACAC CGCCTTCGAT
GTCGGCCAGA TGAAGAACGG CCGCCGGCTC GCCATCAAGA TCCTCAACGC CAGCCGGTTC
GCGCTCGGTC TTGCCGTGAC CCCCGACGCC GGGGTCTCCG ACGCCGAGGT CTCCGACGCC
GAGGCGGCCC CCGCCCCGGC CAGTGAACCG CTGGACCGTG CCCTGCTGGC CGCCTTGGCC
GACGTGGTCG ACGCCGCCAC CGCGGCCTTC GACGGCTACG ACTACGCCCG GGCCCTCGAG
GTCACCGAGT CGTTCTTCTG GCGGTTCTGT GACGACTACG TCGAGCTTGT GAAGGGCCGG
GCGTACGGCT CCTCCGGGGC GGCGGGCGCG GCCTCCGCGC ACACCACGCT CTCGGTGGCC
CTGTCAGCGC TGCTCCGGCT GTTCGCCCCG GTCCTGCCGT TCGTCACCGA GGAGGTCTGG
TCCTGGTGGC GTCCCGGTTC GGTGCACCGG GCGAGCTGGC CGGACGCCGC CGAGATCCGC
AAGCTCGCCG AGGGCGCCCG CGTGGAGCTC CTCGGTGCCG TCGGGGCGGC GCTGTCCGGT
GTCCGCCGGG CCAAGTCCGA GGCGAAGGTG TCGCAGAAGG CCGAGGTCGC GAGCGTCCGG
ATCAGCGGCG AGGCCGACCT CGTCGCCCTC GTCGAGTCGG TGGCCGCGGA CCTGCGCAGC
GCCGGCAGCA TCCGGGAGCT CACGCTGAGC CCGACCGGCG GCGAGATCGC CGTCGCCGTC
GAACTGGCCT GA
 
Protein sequence
MTTIEAPVGS LLVVATDGTE RSTRAVPAKP SLDGIEARWS TVWQDEGTYA FDRTVDRGQV 
YSIDTPPPTV SGSLHVGHVF SYTHTDLIAR YQRMRGREVF YPMGFDDNGL PTERRVQNYF
AVRCEPSLPY DPGFTPPDKP GKEQVPISRR NFVELCERLT LEDEKGFEQL WRRLGLSVDW
SYTYATIDAR SRTVAQRAFL RNLARGEAYL AEAPTLWDVT FRTAVAQAEL EDRDRPGAYH
SLAFPRSGVS APSSGLDPAG SGDLAGSGDL ADPIVIETTR PELLPACVAL VAHPDDPRYQ
SLFGATVRTP LFDVEVPVVA HRLADPEKGS GIAMICTFGD LTDVTWWREL RLPARAVIGR
DGRFIPDPPG AVTSAAGRER YAELAGKTVH SARARIVELL AGSGALLGDP RPITHPVKFY
EKGDRPLEIV TTRQWYLRNG GRDEPLRAAL RGRGQELRWY PDYMKVRYEN WVDGLNGDWL
ISRQRFFGVP FPVWYPLDAA GAPRYDAPIV PDEASLPVDP SSDVPPGYAA EQRGVPGGFA
ADPDVMDTWA TSSLTPLIAS GWEHDTDLFS RVFPMDLRPQ AHEIIRTWLF STVVRTHAEF
DVLPWSNAAI SGWILDPDRK KMSKSKGNVV TPMGLLEEHG SDAVRYWAAS GRPGTDTAFD
VGQMKNGRRL AIKILNASRF ALGLAVTPDA GVSDAEVSDA EAAPAPASEP LDRALLAALA
DVVDAATAAF DGYDYARALE VTESFFWRFC DDYVELVKGR AYGSSGAAGA ASAHTTLSVA
LSALLRLFAP VLPFVTEEVW SWWRPGSVHR ASWPDAAEIR KLAEGARVEL LGAVGAALSG
VRRAKSEAKV SQKAEVASVR ISGEADLVAL VESVAADLRS AGSIRELTLS PTGGEIAVAV
ELA