Gene Franean1_7069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7069 
Symbol 
ID5675379 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8627887 
End bp8629962 
Gene Length2076 bp 
Protein Length691 aa 
Translation table11 
GC content73% 
IMG OID641245914 
Productphosphoenolpyruvate-protein phosphotransferase 
Protein accessionYP_001511305 
Protein GI158318797 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1080] Phosphoenolpyruvate-protein kinase (PTS system EI component in bacteria)
[COG1925] Phosphotransferase system, HPr-related proteins 
TIGRFAM ID[TIGR01003] Phosphotransferase System HPr (HPr) Family
[TIGR01417] phosphoenolpyruvate-protein phosphotransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTGTA GGACGGCGAC CGTGAATTCC CGCGGCGGGT TACACGCCCG CCCCGCAGCG 
CTCTTCGTCC GGGCCGCCGC GCAGCAGGCG GTCCCGGTTC GGATCCGCAA GGGCGACGGC
CCGGCCGTGA ACGCCGCCAG CATGCTGTCG GTGCTGGCCC TGGGCGCGAT GTACGGAACG
GTCGTGACGC TGGAAGCGGA CGGTGAGCGG GCCGAGGAGG CGCTCGATGC TCTCGCCGCG
ATCCTTGCCC ACGACCAGGA CGCCGCCGAC AGCGGGCAAG GAGACGGGCG GGTCGGGAGC
GCCAGCGCGT CCGACAAGGC GTTAGACGAG ACGGGCGGGA CCGGCAGGAG CCGGGCGACG
GCTGACGTCT TCGTCGGACT CGGTGTCAGC CCCGGGCTGG TCACCGGTCC CGCCTTCCGG
ATGGCGCGTC AACCCCGGCT GCCCGACCCG CGCCTCGTCC TCGATCCCGA CGAGGAGGCC
GCTGCCGCCG CGCGGGTACT CCGGAACGTG GCCTCCGACC TGCGGGCCCA CGCAGCGGCG
ACCCGGCTCT CCGCTGCCTC GGACATTGTG ACGGCACAGG CGATGATGGC GGAAGACCCG
GTGCTCCTCG ACGAGGTCGG CATGCGGGTC CGGTCGGGTC TGGACGCCGT GCACGCGATC
GATGCCGCAC TTGCCGACCA GCGCCGTCAG CTGGAAGGAG CTGGCGGCTA CCACGCCGAG
CGGGCCGCCG ACATCGACGA CATCCGGCAT CGTGCCGTCG CGGCTCTGCT CGGCCTGCCC
GCGCCCGGCC TCCCCGCGCC CGGGTTCCCG TTCGTCCTGG TGGCCGAGGA CCTCGCACCC
GCCGACACCG CCACCCTCGA CACCGATCTT GTCCTGGCCC TGGTGACCGA GCGTGGCGGT
CCCACCAGCC ACACCGCCAT CCTCGCCCGT GCCCTCGGGC TCCCCGCCGT CGTGTCCTGC
CCAGGCGCGA TGGCCCTCGA CGACGGGACC CCTGTCCGGG TCGACGGCAC CACCGGCGAG
GTCCACGTCG GGGTCGGGGT CGATGTGGAG GCCGACGGCG CGGGCCACGG CGAGGCCATC
GCGCAGGACA CGGCCGATGT CCGCCAGCCC GGCTTGTGGC GGACGACGTC GCGGGCCGCG
AAGGCCGGCC CGGCCCGGGG TGGTCCGGGA CGCACCGCGG ACGGCCGCCC GGTCCAGCTG
CTCCTCAACA TCGGCTCCGC CAAGGACCTG CGCGGCGACG TCGCCGCCAC GGCGGAGGGC
GTGGGCCTGT TCCGCACCGA GTTCCTCTTC CTCAACCGGC GGGTCGCCCC CACTCCTGAC
GAGCAACGGG ACGCCTACCA GGCCGTCTTC CGTGCAGCCG GCAGCAGGAA GGTCGTCGTC
CGCACGCTGG ACGCCGGCGC CGACAAGCCG CTGCCCTTCC TGAGCCTGCC TGACGAGCCC
AACCCGGCGC TCGGCGTGCG GGGCTACCGG ACGGTCTGGC TGCGCCCCGA GGTACTCGAC
ACCCAACTGG GCGCGATCGC GGACGCCGCG GCTGCGTGCG ATGCGGACGT CTGGGTGATG
GCGCCCATGG TTTCGACGCC CCCGGAGGCG GAGGCGTTCG CGGCGGCCGC TCGCGGGCAC
GGGCTGGCAA CGACTGGCGT CATGGTCGAG GTGCCTGCGG CGGCCCTGCG AGCTGGCCGG
ATGCTGGACA CCGTCGACTT CCTCAGCGTC GGAACCAACG ACCTGGGCCA GTACACGCTC
GCCGCCGACA GACAGAGCGG TCACCTCGCG GACCTGCTCA GCCCCTGGCA ACCTGCGTTG
CTCCGCCTCG TGGCGGACTG CGCGGCCGCC GGGGAAGCAT CCGGCAAGCC AGTCGGGGTG
TGCGGCGAGG CCGCGGCCGA CCCCCTGCTC GCCGCCGTCC TGGTCGGGCT CGGAGTCACC
AGCCTGTCCA TGTCAGGGCG GTCCATTGCG GCTGTCCGCG ACTCCCTTGC CGCACACACG
ATCAAGGAGT GCCGGGCGCT GGCAGAGATT GTGATCGACG CGGACGATGC GGAGCGCGCC
CGCGAGCTGG CGACGAAGAA CGCGCGGCAG ACGTAA
 
Protein sequence
MACRTATVNS RGGLHARPAA LFVRAAAQQA VPVRIRKGDG PAVNAASMLS VLALGAMYGT 
VVTLEADGER AEEALDALAA ILAHDQDAAD SGQGDGRVGS ASASDKALDE TGGTGRSRAT
ADVFVGLGVS PGLVTGPAFR MARQPRLPDP RLVLDPDEEA AAAARVLRNV ASDLRAHAAA
TRLSAASDIV TAQAMMAEDP VLLDEVGMRV RSGLDAVHAI DAALADQRRQ LEGAGGYHAE
RAADIDDIRH RAVAALLGLP APGLPAPGFP FVLVAEDLAP ADTATLDTDL VLALVTERGG
PTSHTAILAR ALGLPAVVSC PGAMALDDGT PVRVDGTTGE VHVGVGVDVE ADGAGHGEAI
AQDTADVRQP GLWRTTSRAA KAGPARGGPG RTADGRPVQL LLNIGSAKDL RGDVAATAEG
VGLFRTEFLF LNRRVAPTPD EQRDAYQAVF RAAGSRKVVV RTLDAGADKP LPFLSLPDEP
NPALGVRGYR TVWLRPEVLD TQLGAIADAA AACDADVWVM APMVSTPPEA EAFAAAARGH
GLATTGVMVE VPAAALRAGR MLDTVDFLSV GTNDLGQYTL AADRQSGHLA DLLSPWQPAL
LRLVADCAAA GEASGKPVGV CGEAAADPLL AAVLVGLGVT SLSMSGRSIA AVRDSLAAHT
IKECRALAEI VIDADDAERA RELATKNARQ T