Gene Franean1_5092 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5092 
Symbol 
ID5673427 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6094379 
End bp6095338 
Gene Length960 bp 
Protein Length319 aa 
Translation table11 
GC content71% 
IMG OID641243943 
ProductPLP-binding domain-containing protein 
Protein accessionYP_001509357 
Protein GI158316849 
COG category[R] General function prediction only 
COG ID[COG0325] Predicted enzyme with a TIM-barrel fold 
TIGRFAM ID[TIGR00044] pyridoxal phosphate enzyme, YggS family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.340872 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00296969 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCGGCC TCGAGCGCGA GGTGGACCCG CGCCGGCTGG CCGAGTTGCG CGAGCGGCTG 
ACAGTCGTCC GTGACCGGAT CACCGTGGCG GCCCGGGACG CCGGCCGCGA CCCGGCGGAG
CTGACCCTGA TCGCCGTCAG CAAGACCCGA CCCGCCGAGG ACGTCCTCGC GCTGGTCGCG
CTGGGTGTCC GCCACTTCGC CGAGAACCGC GAGCAGGAGG CCGGGCCGAA GACCGAGGCC
GTCCGGCTGG CGCTGGCCGC CGCCGCGGCG GGCGGCTCGG ATGAGCACAC GATGTGGTCC
GATGATGACC GTTTCCGGGA ATCGAGTCCG TTGGCCGACA CGTCGGGGGC ATGGCGTCCA
TCACGTGTGG ATCAGGGTCC GACCCTCGGG CCGGCCGCCG GGCCCGATGT GCCGGTCTGG
CATTTCGTCG GCCAGCTCCA GCGCAACAAG GCCCGGGCCG TGGCGGGCTG GGTGCACTGC
GTGCAGTCGG TGGACCGAGC GCGACTGGTC GAAGCCCTTT CCCGGGCAGC AACGGCACGT
GGTCGTCAGA TTTCAGTGTG CATACAGGTC AATCTGGACA TTGCGGAGCC AGTCGGTGCA
CCGGGTGCGC CCGCGGCGCG AAATGCCCTG TCTACCAGGG AAAACGAGCC CACCGCCGGG
GCTGCGGACG GCGCTCGCGG CGGGGCCCTG CCCGCGGACG TACCGGCGCT CGCCGAGCTC
GTCGCGGGCT CCGAAGGGCT TGTTCTGACG GGTGTGATGG CTGTCGCGCC ACGCGGAGCA
CCCGCCAGGC CGGCCTTCGC TCGACTCCGT GAGGTAGCCG ACCGACTTCT TTCCGAACAT
CCCGCGGCGA CGCTAGTCAG TGCAGGGATG TCCGGCGATC TCGAGGACGC GGTCGCGGAG
GGCGCGACAC ACCTTCGGAT CGGCACCGCT TTGTTCGGTG AACGGCACGG TGTCCCTTAG
 
Protein sequence
MSGLEREVDP RRLAELRERL TVVRDRITVA ARDAGRDPAE LTLIAVSKTR PAEDVLALVA 
LGVRHFAENR EQEAGPKTEA VRLALAAAAA GGSDEHTMWS DDDRFRESSP LADTSGAWRP
SRVDQGPTLG PAAGPDVPVW HFVGQLQRNK ARAVAGWVHC VQSVDRARLV EALSRAATAR
GRQISVCIQV NLDIAEPVGA PGAPAARNAL STRENEPTAG AADGARGGAL PADVPALAEL
VAGSEGLVLT GVMAVAPRGA PARPAFARLR EVADRLLSEH PAATLVSAGM SGDLEDAVAE
GATHLRIGTA LFGERHGVP