Gene Franean1_4721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4721 
Symbol 
ID5673063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5635419 
End bp5636474 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content65% 
IMG OID641243578 
Producttransketolase central region 
Protein accessionYP_001508994 
Protein GI158316486 
COG category[C] Energy production and conversion 
COG ID[COG0022] Pyruvate/2-oxoglutarate dehydrogenase complex, dehydrogenase (E1) component, eukaryotic type, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGTTC GCCTTCTCGA TCATTGGACC TGCATAGAGG TGGACGTGGA CGAGCAACGG 
ATGACGATGC GCGAGGCGTT GAACCTCGCT TTGGACCAGG CGTTGGCGCG TGATGAGCGT
GTCTTCCTGC TGGGGGAGGA TATTGCCGAT CCCGGCTCCA GTGGGCCGAC CAAGGGGCTG
TCGACCAAAT ACGGCGCGGA TCGGGTGCTT GACACGCCGA TTTCCGAAGC GGCGATCGTG
GGTGCCGCGA TCGGTGCCGC GATGGAGGGG TTCCGCCCGG TCGCCGAAAT CATGATCATG
GATTTTATTG GTATCGCCGC CGATCAGATC GTCAATCATG CGGCGAAGCT GCGGTTCATG
ACCGGGGGGC GGACGACCGC GCCGATCACG GTCCGGACGC AGGTCTATGG CGGGTTGGGG
ACCGGGGCGA CGCACTCGCA GTCGTTGGAG GCGTGGTTCA TGCATGTTCC GGGGCTGAAG
GTGATCGTGC CTTCGACGCC GCGGGACGCC AAGGGTCTTC TCGCGTCGGC GATTTTTGAT
GATGACCCGT GCGTCTTCCT GGAGACGATT CGTCTGCAGG GCCAGCGCGG GTTGGTGCCG
GTGGATCCGG GATTCTCGAT TCCGCTGGGT CAGGCGGACG TGAAACGGCC GGGTACTGAT
GTCACGTTGA TCGGCTATGG CCGGGGGGTG GTCGAGTCGC TCGGCGCGGC GGCCGTGCTC
GAGGCGGAGG GGGTCAGCGC CGAGGTGCTT GATCTGCGCA CCCTGGTTCC GCTCGATGTG
CCGGCGATGG TTGATTCGGT GCGTCGAACG AGGCGTGCTG TGGTGGTGCA TGACGCGGTG
CGGTTCGCCG GGCCCGGGGC GGAGATCGCC GCGATTCTGC AGCGCGAGCT CTTCGGTGTT
CTTGAGGCGC CGGTGGAGCG GGTGGGTGCC CGGTTCGTGC CGAATCCGGC CCCGCCGGCA
CTGGAATCGC AGATATACCC GAGCACCGAA CGTATTGTCG CGGCGGTCCA GCAGACGCTG
CGGTCGGAAA CGACCAGGGA CGGTGCCTGT GGCTGA
 
Protein sequence
MTVRLLDHWT CIEVDVDEQR MTMREALNLA LDQALARDER VFLLGEDIAD PGSSGPTKGL 
STKYGADRVL DTPISEAAIV GAAIGAAMEG FRPVAEIMIM DFIGIAADQI VNHAAKLRFM
TGGRTTAPIT VRTQVYGGLG TGATHSQSLE AWFMHVPGLK VIVPSTPRDA KGLLASAIFD
DDPCVFLETI RLQGQRGLVP VDPGFSIPLG QADVKRPGTD VTLIGYGRGV VESLGAAAVL
EAEGVSAEVL DLRTLVPLDV PAMVDSVRRT RRAVVVHDAV RFAGPGAEIA AILQRELFGV
LEAPVERVGA RFVPNPAPPA LESQIYPSTE RIVAAVQQTL RSETTRDGAC G