Gene Franean1_0431 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0431 
Symbol 
ID5668854 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp508969 
End bp510324 
Gene Length1356 bp 
Protein Length451 aa 
Translation table11 
GC content72% 
IMG OID641239363 
Productglycoside hydrolase family protein 
Protein accessionYP_001504802 
Protein GI158312294 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3507] Beta-xylosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTGAGC CGATCATTGC GGGCTTCTAT CCCGACCCGA CGGTCTGCCG AGTCGGTGAC 
GACTATTACC TGGCCCATTC CAGCTTCGAG TACTTTCCCG GCGCGCCGAT CTGGCACAGC
CGCGACCTGA TCCGCTGGAC GCAGATCGGC CACATCCTGA CCCGCCGCAG CCAGTTCGCC
CGCGGCGACT GCCGCCCGTC CTCCGGGCTG TACGCCGGGA CGCTGCGGCA TCACGACGGG
CGTTTCGTCT ACGTCACCAC GAACGTCAGC GACTTCGACG GCGGCCAGCT GATCGTCACC
GCCGACGACC CGGCCGGCCC GTGGAGCGAG CCGGTTCGGG TGCCCGAGGC GATCGGCATC
GACCCGGACC TCGCCTGGGA CGACGACGGC ACGTGCTACC TGAGCTGGAA GGCGATGAGC
CTCGCCGACG GCGAGATCGG CATCCGCCAG GCCCCGCTCG ACATCTCGTC GGGCAAGTTC
CTCGCCCCGG CGTACCCGAT CTGGCAGGGC AGCGGACTCG CCGCGGCCGA AGGGCCGCAT
CTGTACGCCA TCGACGGCCT CTGGTACCTG CTGCTCGCCG AGGGCGGCAC CGAGCGCGGG
CACGCCGTCA CCGTCGCGCG CGGCCCGCAC CCGTCCGGGC CCTTCGAGGG CTGTGCGGCC
AACCCGATCC TGAGCCACCG CAGCACCATC CACCCGGTGC AGAACACCGG CCACGCCGAC
CTGGTCCGCG CGCCGGGCGG CGGCTGGGCC GCCGTCTATC TGGGTACCCG GCCGCGCGGC
TGGACGCCGG GCTTCCACGT CCTGGGCCGG GAGACGTTCC TGGCCGGCAT CGACTGGGTG
GACGGCTGGC CGGTTTTCGA CGAGGACCGC TACCAGGTCC CACCGCTCGA CACCGGCTTT
ACGGAGGTGT TCGGTGACTC ACCGCCGCAT CTGCGCTGGG TCGCCGACGG CAGCGGCCTG
CGGTGCACCC GGGTGCGCGA TCTGCGCTGG TCCGCCGAAG CGACGCTCGA CGGCCCGGGC
CGCTTCGAGG TGCGCATCGA CGACCGGCAC GCGTACGGCC TGACCCGCCA CCACGACCGG
GTCGAGGCGA CGGCCCGCGT CGGTGACCTC GGCGCCGTCC TCGCCACGCT CCCCGTCGCG
GATGGTCACA CGGTCCTGCG GATCGAGGCG GTGCCACCGC CGTCACGCGA CGCCGGGCCC
GACGAGATTG TCCTCTCCGC CGGCTCGCAC GAACTCGCCC GCCTCGACGG GCGCTACCTC
TCCACCGAAG TCGCGTCCGG GTTCACCGGC CGCATGCTCG CCACCGCCGG ACCGCACGTC
CGGTCGGTGA CCTACCGCCC CCAGGAGCAG CCATGA
 
Protein sequence
MPEPIIAGFY PDPTVCRVGD DYYLAHSSFE YFPGAPIWHS RDLIRWTQIG HILTRRSQFA 
RGDCRPSSGL YAGTLRHHDG RFVYVTTNVS DFDGGQLIVT ADDPAGPWSE PVRVPEAIGI
DPDLAWDDDG TCYLSWKAMS LADGEIGIRQ APLDISSGKF LAPAYPIWQG SGLAAAEGPH
LYAIDGLWYL LLAEGGTERG HAVTVARGPH PSGPFEGCAA NPILSHRSTI HPVQNTGHAD
LVRAPGGGWA AVYLGTRPRG WTPGFHVLGR ETFLAGIDWV DGWPVFDEDR YQVPPLDTGF
TEVFGDSPPH LRWVADGSGL RCTRVRDLRW SAEATLDGPG RFEVRIDDRH AYGLTRHHDR
VEATARVGDL GAVLATLPVA DGHTVLRIEA VPPPSRDAGP DEIVLSAGSH ELARLDGRYL
STEVASGFTG RMLATAGPHV RSVTYRPQEQ P