Gene Franean1_5188 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5188 
Symbol 
ID5673522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6228943 
End bp6230148 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content75% 
IMG OID641244042 
ProductROK family protein 
Protein accessionYP_001509452 
Protein GI158316944 
COG category[G] Carbohydrate transport and metabolism
[K] Transcription 
COG ID[COG1940] Transcriptional regulator/sugar kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0518111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTTG ACGGCCACGT GGCCGCGCCG CAAGGCTTTG TGTTCATGCC CCGAACGCCG 
CCCGGCTCCC AGTCGTCGCT GCGCGCGGCC AACCGGGACC GGGTCCTGGG CGTGCTGCGG
CGGGAGGGCA GCCTCGCCCA GGCGGAGATC GCGCGGCTGA CCGGGCTGTC CCCGGCGACC
GTCTCCAACA TCGTCGGCGA GCTGCGCGAG AGCGGGGACG TCGACGTGCG CCCCGCGGTG
AGCGGCGGGC GGCGGGCGGT CCGGGTGTCG CTGTCGCGGC GGTCCGGAGT CGTGATCGGG
CTCGACTTCG GCCACCGCCA CCTGCGGGTC GCCATCGGCG ACCTCGCGCA CGAGGTGCTG
GCCGAGGACG TCGTCGACAT CGACGTCGAT CACCAGGCCC AGGAGGGCAT CGCGACGGCC
GGCCGGCTGG TCGACGACCT GCTCGGACGG CTCGCCGTCG ACCGGGCGGA CGTGGTCGGC
GTGGGCATGG GCCTGCCCGG CCCGATCGAC GCGGTCACCG GCGCCGTCGG ATCCTCGGCG
ATCCTGCCCG GATGGGTCGG CGTGCCCGCC GCCGCGCAGA TGTCGGAGCG GCTCGGGCTG
CCCGTCCGGG TCGACAACGA CGCCAACCTC GGTGCCCTCG CCGAACTGCA CTGGGGCGCC
GGCCAGGGCG TCCGCGACCT CGCCTACCTC AAGGCCTCCA CCGGCGTCGG CTCGGGCCTG
GTCATCGACG GGCGCGTCCA CCGCGGCGGA GCGGGAACCG CGGGCGAGAT CGGGCACACC
ACGCTGGACG AGAACGGCTC GGTCTGCCGC TGCGGAAACC GCGGCTGCCT GGAGACGATC
GTCGGAACGT CGGTGCTGCT CGAGTCGCTG CGCACGAGCC ACGGGCCGGA TCTCACCGTG
CGCGGGATGA TCGACCGCGC GGTCGCCGGT GACGCGGGAT GCGCCCGCGT GGTCTCGGAC
GCCGGCCGGG CCATCGGGAA CGCGGCCGCG AACCTGTGCA ACCTGCTGAA CCCCCAGGTG
ATCGTCGTGG GCGGCGACCT CGCGGCGGCG GGGGAGACTC TGTTGGAACC GATGCGCCAG
GTGGTGCACC GCTTCGCCGT TCCGGCGGCC GTCCCGACAA TCGTGGCCGG CGTGCTCGGC
GAGCGTGCGG AGGTCCTCGG CGCCCTCGCG CTGGTGCTGC GCGAGGGCGA GCCGATCCCG
CGCTGA
 
Protein sequence
MTVDGHVAAP QGFVFMPRTP PGSQSSLRAA NRDRVLGVLR REGSLAQAEI ARLTGLSPAT 
VSNIVGELRE SGDVDVRPAV SGGRRAVRVS LSRRSGVVIG LDFGHRHLRV AIGDLAHEVL
AEDVVDIDVD HQAQEGIATA GRLVDDLLGR LAVDRADVVG VGMGLPGPID AVTGAVGSSA
ILPGWVGVPA AAQMSERLGL PVRVDNDANL GALAELHWGA GQGVRDLAYL KASTGVGSGL
VIDGRVHRGG AGTAGEIGHT TLDENGSVCR CGNRGCLETI VGTSVLLESL RTSHGPDLTV
RGMIDRAVAG DAGCARVVSD AGRAIGNAAA NLCNLLNPQV IVVGGDLAAA GETLLEPMRQ
VVHRFAVPAA VPTIVAGVLG ERAEVLGALA LVLREGEPIP R