Gene Franean1_3121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3121 
Symbol 
ID5671499 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3675017 
End bp3676975 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content68% 
IMG OID641242018 
Productthiamine pyrophosphate protein central region 
Protein accessionYP_001507438 
Protein GI158314930 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGGCC AGCCCCGCGT CCACGGCCCC GACGAACGGC TGATCGACGT GCTGGCCGCG 
GTCCCCACCC CTGCTCCTCG GGAGGCACCG ATGAGACTCA CCATGGCACA GGCGCTCGTG
CGCTTCCTTG CCAACCAGTA CTCCGAGCGA GACGGCGTAG AGCAGCGGCT CGTCCCCGGC
ATGTGGGGCA TCTTCGGCCA CGGCAACGTC GCCGGCCTCG GGCAGGCGTT GTTACAGGCC
GTCCGCACCG GCGAAGCGGA TCTGCCGTAC TACCTGGCCC GCAACGAGCA GGGCCAGGTC
CACGCGGCCG CGGCGTTCGC CAAGATGCGC AACCGGCTGC AGGCCTTCGC GTGCACCGCG
TCTACCGGCC CCGGTTCGAC GAACATGATC ACCGGCGCGG CGCTGGCGAC CACCAACCGA
CTGCCCGTCC TGCTGCTGCC CAGCGACATG TTCGCGACCC GCTACCCGGA TCCGGTGCTG
CAACAGCTCG AGGACACCCG CGGCGGCGAC GTGACCGTAA ACGACGCTTT CCGCCCGGTG
TCGAAGTACT TCGACCGCAT CACCCGCCCC GAGCAAATGA TCCCAGCCGC TCTCGCTGCG
ATGCGGGTGC TCACCGATCC GGTCGAGACC GGCGCAGTGA CGCTCGCGCT GCCGCAGGAC
GTACAGGCCG AGGCCTACGA CTGGCCGGAG GACTTCTTCC GCCGGCGCGT GTGGCATGTC
AGACGTCCGG CCCCTGAGCC GGAGGCGCTC GCCAGGGCCG TCGAGCTGCT GCGAACTGCC
AGGTCACCGC TGATCGTCGC CGGTGGCGGT GTCGTGTACT CCGAGGGCGA GCGGGAACTG
CGGGCCTTCG CCGAGATGAC CGGCATCCCG GTGGCGGACA CACACGCAGG GAAGGGAGCG
GTGCCGTGGG ACCACCCGTG CGCGGTGGGT GGCATCGGCT CGACCGGCAC CTCCGCGGCC
AATGCACTCG CCGCCGGGGC AGATGTCGTG CTGGGTATCG GCACCCGCTA CAGCGACTTT
ACCACCGCAT CGCACACCGT CTTCAAGAAC CCTGACGTCA CGTTCGTGAA CCTCAACGTT
GCCCCGTTGG ACGCAGCGAA GCACTCCGCG GAAATGCTGG TAGCGGACGC CAAACGCGGC
ATCGTGGCAC TGCACAGGAC CCTGCGCGGC TGGCAGGTCG GCGACGCCTA CCGGTCCCGC
ACGCGGACCC TGGCCGACGA CTGGAACCGC AGGGTCGATG CCTGCGTCAC CCCCGGCCAC
GGCCCATATC CCGCCCAGAC GGAGATCCTT GGCGCACTCA ACAAGGCCCT CTGCGACCGG
GACGTGGTGA TCAACGCAGC CGGGTCGATG CCCGGTGACC TGCAGCTGCT GTGGCGGGCG
AGGGACCCCA AGGCCTACAA CGTTGAATAC GCCTATTCCT GCATGGGATA CGAGGTCGCC
GCCGGAGTCG GGACGAAGAT GGCCGCACCC GACCGTGACG TCGTCGTCCT GGTCGGCGAT
GGCTCCTACC TGATGATGGC GCAGGAAATC GTCACGATGG TCGCCGAGGG CCTCAAGGTG
ATCATCGTAC TGGTGCAGAA CCACGGATTC GCCTCGATCG GCTCACTGTC GGAATCCCTG
GGATCACAGC GATTCGGCAC ATCGTACCGC TACCGCGACA AGTACTCCGG CCTGCTCGAC
GGCGCTTTGC TGCCGATCGA CCTCGCGGCC AACGCGGCCA GCCTCGGCGC GACTGTGATC
CGGGCCGCCA CCGTCGTGGA GTTCACCACC GCGATCGCTG CGGCCAAAGC CAACACCACG
ACTACCGTCG TTCACGTCGA AACTGACCTT TTTGGCCCCA ACCCGCCCAG CTCAGCCTGG
TGGGACGTAC CGGTCTGCGA GGTCTCCGAG CTGGAATCAA CGCAGAAGGC CTACGAGACG
TACTCTGCCG CCAAGAACAC GCAGCGGCAC TACCTGTAG
 
Protein sequence
MGGQPRVHGP DERLIDVLAA VPTPAPREAP MRLTMAQALV RFLANQYSER DGVEQRLVPG 
MWGIFGHGNV AGLGQALLQA VRTGEADLPY YLARNEQGQV HAAAAFAKMR NRLQAFACTA
STGPGSTNMI TGAALATTNR LPVLLLPSDM FATRYPDPVL QQLEDTRGGD VTVNDAFRPV
SKYFDRITRP EQMIPAALAA MRVLTDPVET GAVTLALPQD VQAEAYDWPE DFFRRRVWHV
RRPAPEPEAL ARAVELLRTA RSPLIVAGGG VVYSEGEREL RAFAEMTGIP VADTHAGKGA
VPWDHPCAVG GIGSTGTSAA NALAAGADVV LGIGTRYSDF TTASHTVFKN PDVTFVNLNV
APLDAAKHSA EMLVADAKRG IVALHRTLRG WQVGDAYRSR TRTLADDWNR RVDACVTPGH
GPYPAQTEIL GALNKALCDR DVVINAAGSM PGDLQLLWRA RDPKAYNVEY AYSCMGYEVA
AGVGTKMAAP DRDVVVLVGD GSYLMMAQEI VTMVAEGLKV IIVLVQNHGF ASIGSLSESL
GSQRFGTSYR YRDKYSGLLD GALLPIDLAA NAASLGATVI RAATVVEFTT AIAAAKANTT
TTVVHVETDL FGPNPPSSAW WDVPVCEVSE LESTQKAYET YSAAKNTQRH YL