Gene Franean1_3195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3195 
Symbol 
ID5671571 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3772552 
End bp3774324 
Gene Length1773 bp 
Protein Length590 aa 
Translation table11 
GC content68% 
IMG OID641242089 
Productbeta-D-glucuronidase 
Protein accessionYP_001507509 
Protein GI158315001 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.167785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTCGTC CCCAGGACGG TTCGACACGG GAGCGGCGAT CTCTGGGCGG GCTGTGGGCC 
TTCCGTCTCG ACCCCGCGGG TACCGGGCGG AACGCGGGCT GGTGGCGTGC GCGCCTGCCC
GGGGCCCGGG AGATGCCGGT GCCGGCCAGC TACAACGACA TCGTGCCCGA TCTCGCCGTG
CGTGATCATG TTGGTGACGC CTGGTACCAG ACGGTCGTCC GCATCCCGGC CGGCTGGTCC
GCGCGGCGGG TGGTGCTGCG TTTCGACGCC GCGACCCACC GTGCGGTCGT CTGGGTGAAC
GACACGCTCG TCGCCGAGCA CGAGGGTGGC TACACGCCGT TCGAAGCCGA CATCACCGGG
GTCGTGGCGC CCGGGGACGA GGCCCGGGTC ACCGTGGTCG TCAACAACGA GCTCACGTTG
ACCTCGATAC CGCCCGGCAT CGTGGAGGAC ACCGCGCAGG GACGCCGCCA GAAGTACTTC
CACGACTTCT TCAACTACGC GGGTCTGCAC CGCTCGGTCT GGCTGTACAC CACACCGCAC
ACGCGGATCA GCGACATCGC GGTGGAGACC GCTCTCGACG GCGCCGCGGG CACGGTGCGC
TACGCGGTGG AGATCGAGGG TGCCCAGGGT GTTGCCGCCA TCCGGGTCGT GCTCCGCGAC
GCGCAGGGGC GGGAGGTCGC CGCCGCCGAC GGTGCCGTGG GAATGTTGAA CGTGCCAGCC
GTGCATCCAT GGGCACCGGG TGACGGTTAC CTGTACGAGC TCGATGCCCG TCTGGTCGGT
GCGGGCGGGG AGGTGGCCGA CAGTTATGTC CTGCCGGTCG GTGTGCGTAC CGTCGAGGTA
CGCGGCACCC AGTTTCTGAT CAACGGGGAG CCTTTCTACT TCCGGGGCTT CGGTAAGCAC
GAGGACGCGC CGGTGCGCGG CAAGGCCCAT GACGACGCGC TCATGGTCCA TGATTTCGAG
CTGATGGAGT GGATCGGCGC CAACTCCTTC CGCACCTCCC ACTATCCGTA TGCGGAGGAG
GTACTGGAGT ACGCCGACCG GAGTGGCATC GTCGTGATCG ACGAGACCGC GGCCGTCGGT
CTGAACCTGA AGGCCTCGCT CGCTTTCGGC AGCAGGCCGA CGGTGAGTAC CTTCGGTGAG
GATGGCATCA GCTCCGTCAC CCAGCGCGCG CATCTCCAGG CCGTGCGGGA ACTGATAATC
CGGGACCGGA ACCATCCGAG TGTGGTGTTG TGGAGCCTCG CCAACGAGCC GGATTCCAGT
ACCGCCGCCG CCCGGGAGTA CTTCGCGCCG CTGTTCGCCG AGGCCCGCAA GCTCGACCCG
ACCTGCCCGG TGGGCTTCGT CAACTCGTTC GACCAGTGCC AGGTCACCGA GCTCGCCGAC
GTCGTCATGA TCAACCGTTA CTACGGCTGG TACATCAACA ACGGGGAGCT CAAGGCCGCC
GAGGTCGCAC TGGAGGCCGA GCTGAACAGG TGGGCCGCCC ACGGCAAGCC GGTGATCGTC
ACCGAGTACG GAGCGGACAC CATGGCCGGG CTGCACGCCG TGGTCGACAC CCCCTGGTCC
GAGGAGTACC AGGTGCGGTT CCTCGAGATG CACCACCGGG TGTTCGACCG GGTGGATGCC
GTGATCGGCG AGCACATCTG GAACTTCGCC GACTTCGCCA CCAGCCCGCA CATCATCCGG
GTCGACGGCA ACAAGAAGGG CGTGTTCACC CGGGACCGGC ACCCCAAGAG CGCCGCGTTC
TCCGTCCGCC GGCGCTGGCG CCCGCAGGCC TGA
 
Protein sequence
MLRPQDGSTR ERRSLGGLWA FRLDPAGTGR NAGWWRARLP GAREMPVPAS YNDIVPDLAV 
RDHVGDAWYQ TVVRIPAGWS ARRVVLRFDA ATHRAVVWVN DTLVAEHEGG YTPFEADITG
VVAPGDEARV TVVVNNELTL TSIPPGIVED TAQGRRQKYF HDFFNYAGLH RSVWLYTTPH
TRISDIAVET ALDGAAGTVR YAVEIEGAQG VAAIRVVLRD AQGREVAAAD GAVGMLNVPA
VHPWAPGDGY LYELDARLVG AGGEVADSYV LPVGVRTVEV RGTQFLINGE PFYFRGFGKH
EDAPVRGKAH DDALMVHDFE LMEWIGANSF RTSHYPYAEE VLEYADRSGI VVIDETAAVG
LNLKASLAFG SRPTVSTFGE DGISSVTQRA HLQAVRELII RDRNHPSVVL WSLANEPDSS
TAAAREYFAP LFAEARKLDP TCPVGFVNSF DQCQVTELAD VVMINRYYGW YINNGELKAA
EVALEAELNR WAAHGKPVIV TEYGADTMAG LHAVVDTPWS EEYQVRFLEM HHRVFDRVDA
VIGEHIWNFA DFATSPHIIR VDGNKKGVFT RDRHPKSAAF SVRRRWRPQA