Gene Franean1_5567 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5567 
Symbol 
ID5673896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6747275 
End bp6748438 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content76% 
IMG OID641244422 
Producthypothetical protein 
Protein accessionYP_001509826 
Protein GI158317318 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.661467 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGTCC TCTTTGCGTC GTTGCCGGCA TACGGGCACC TCTACCCGCT GATCCCCCTC 
GCGGTGGCCT GCCAGGACGC GGGGCACCGG GTCCGCCTGG CCACCGGCGA GCCCTTCCTC
GGGGCCCTCC CTGTGCCGAC CGTCCAGGGC ACGCCGGCCG GGTGGACGCT GCAGTACGTG
GAGGGCGAGA CAGCCCGTCG CCACCCGGAC GCGACCGGCG TCGAGTTCCC CGTCGCCATG
TTCGCCGATG TGGCGGCCGA AGGGGTGATG GACGCGCTTG AACCGCTGTT CGCCGCGGAT
CCGCCGGAAG TGGTGGTCGC CGACAGCGCC AACCTCGGGG CCGTGATCGC CGCGCACCTC
GCCGGTGTCC CAGCCGTGAT CTTCGGGGTC GGCCAGTGGA GCCCCTTCGG TGAGATGACC
TTCCCCGCCG CCCTGGCGGC GCACCGCTCC CGCTGGACCG CGGCGGGGCT CGTCGCCCCC
GGGGAGCCGG GTGAGGTGAT CGCCGCCTAC CTCGAGCCCT TCCCACCGGG CCTGCGGCAG
GAGCCCGGCC CCGGCGGCGT GCCGGTGCTG CCGATCCGCA GCACGGCCTG GGCCGGCGCG
CAGGCGCCCG TGCCCGGCTG GCTGACCGCT CCCGCCGAGC GGCCCCGGGT GTACGTCACG
CTCGGCACCG TCTCGTTCGG CGCCGTCGAG GTGATCCGGG CGGTCGTCGA CGACCTCGCC
GCGCTGGACG TCGACGTGCT CGTCGCGGCC GGCCCGGAGG GCGACCCGGC GGCCCTGGGC
GCGCTGCCCG AGCGGGTGCG GGTCGAGCGG TTCGTGGCCC AGAGCCGCGT GCTCGGTCTG
GTGGACGTCG CCGTCCACCA CGGAGGCTCG GGCACGGTGC TCGGCGCGCT GGCGAACGGC
GTCCCCCAGG TGCTGCTGCC GCAGGGTGCG GACCACTTCC ACAACGCGCA GCTGCTCGCC
GAGCGCGGCG CCGCCCGGGT GTTCCACAAC GAGGCACGGC AGCCGGGTGA CGTCGCCGCG
GCCGTCCGCG ACCTGCTCGG TGACGCCCCC GAGCGCCGTG CCACCGCCAC GCTCGCCGCG
CAGATCGCCG CGAGCCCGAC TCCCGCCGAC GTTGTGGCGG CAATCGCCGC GATCGCCGAG
GCCGCCGCGA AAACCCGGCG ATGA
 
Protein sequence
MDVLFASLPA YGHLYPLIPL AVACQDAGHR VRLATGEPFL GALPVPTVQG TPAGWTLQYV 
EGETARRHPD ATGVEFPVAM FADVAAEGVM DALEPLFAAD PPEVVVADSA NLGAVIAAHL
AGVPAVIFGV GQWSPFGEMT FPAALAAHRS RWTAAGLVAP GEPGEVIAAY LEPFPPGLRQ
EPGPGGVPVL PIRSTAWAGA QAPVPGWLTA PAERPRVYVT LGTVSFGAVE VIRAVVDDLA
ALDVDVLVAA GPEGDPAALG ALPERVRVER FVAQSRVLGL VDVAVHHGGS GTVLGALANG
VPQVLLPQGA DHFHNAQLLA ERGAARVFHN EARQPGDVAA AVRDLLGDAP ERRATATLAA
QIAASPTPAD VVAAIAAIAE AAAKTRR