Gene Franean1_0130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0130 
Symbol 
ID5668555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp155425 
End bp157335 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content71% 
IMG OID641239058 
Productglycoside hydrolase 15-related 
Protein accessionYP_001504503 
Protein GI158311995 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.322371 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.176249 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGCACA CACCGACGCG GCGTGGCGGG GGCAGCCCGT TCCCGCCGAT CGCCGAGTAC 
GGGTTCCTCT CCGACTGCGA GACGAACTGC CTCGTGGCGC CCAGCGGCAA CGTCGAGTGG
ATGTGCGTGC CACGCCCGGA CGCGCCGAGC GTGTTCGGTG CCGTCCTCGA TCGCTCGGCC
GGCGGGTTCC GCTTCGGACC GGACCGGACC TTCATTCCGG CCGGCCGGCG GTACCTGCCC
GGCACGAACG TCCTTGAGAC GACCTGGCAG ACGCCCACGG GCTGGCTGAT CGTCACCGAC
TGCCTGGTCG TGGGCCGCTG GCACCGGACG CACCGGCGCT CCAAGACGCA CCGGCGCACC
CCGGGCGACT GGGACGCCGA CCACGTGCTG CTCCGGCTGG CCCGCTGCGA GCACGGCGAC
GTCGACCTCA GCCTCGTCTG CGAGCCCAAC TTCGACTACG GCCGCAGCCC GGCGTCCTGG
CGGTACGAGG ATGAGGACTA CTCGACCGGG ATCATCACCC ACGACTTCCC GGCCGGCGAC
CCGGCAGCGG GCGTCGCGCT GCGGCTGCGC ACCGACCTGC GGCTCGGGTT CGACGGCCGC
CGCGCGCTGG CCCGGACGAC GTTGCGGGAG GGCGACACCG CGTTCGTGGC GATGACCTGG
CGGGCCGAGG ACCCGCTGCT GCCCGGTAAC TATCCGCAGG CCTGCGCCGC CGTCGACAGC
ACCACCGAGT TCTGGCGGCA GTGGCTTTCG CGTGGGCGGT TCCCCGACCA TCCGTGGCGC
CGGCACCTGC AGCGCAGCGC ACTGGCGCTC AAGGGCCTCA CCTACGCCCC GACCGGGGCG
CTGCTCGCCG CCGCGACCAC CTCGCTGCCG GAGACGCCGC AGGGCGAGCG CAACTGGGAC
TACCGCTACA GCTGGATCCG GGACTCCACG TTCGCCCTGT GGGGCCTGTA CACCCTCGGG
CTCGACTACG AGGCCAACGA CTTCTTCTCC TTCATCGCGG ACGTCGCCGA GCACGCCGAC
GACATCCAGG TGATGTACCG GGTCGGTGGC GAGCCGAAGA TCGACGAGGA GATTCTCGGG
CATCTGTCCG GCTATGACGG CGCCGTCCCG GTGCGGGTCG GTAACGAGGC GGCGAAGCAG
CGCCAGCATG ACGTTTGGGG GGCCGTTCTC GACTCGGTCT ACCTGCACAC CCGGTCGCGG
GACTATCTCT CGGAGCGGCT GTGGCCGGTG CTGGTCCGGC TGGTGGAGGC GGCGGCCGCG
CACTGGCGGG AGACGGACCG CGGCATGTGG GAGGTCCGGG GCGAGCCGCG GCATTTCACC
TCGTCGAAGA TGTTCTGCTG GGTCGCGCTG GACCGGGGGC GCCGCCTCGC GCAGATGCGC
GGTGACCTGC GAACCGCGGG CCGCTGGGAC GACATCGCCG ACGAGATCCA CGCCGACGTG
CTCGCGAACG GCGTCGACCA CCGCGGTGTC TTCACCCAGT ACTACGGCTC GACGTCGCTG
GACGCCTCGG TGCTGCTGAT GCCGCTGCTG GGTTTCCTGC CGTCGACGGA CGACCGGGTG
AAGGCGACCG TGCTCGCCAT CGCCGACGAG CTGACGGTGG ACGGCCTGGT GCTGCGCTAC
CGGACGGACG AGACCGATGA CGGGGTCGAG GGCGAGGAGG GCGCCTTCCT CATCTGCTCG
TTCTGGCTCG TCTCCGCTCT GGTGGAGATC GGTGAGCTCA CCCGGGCCCG GCAGCTGTGC
GAGCGGCTGC TGAGCCTGGC CAGCCCGCTG GACCTCTACG CCGAGGAGAT CGATCCGGCC
GACGGCCGGC ACCTGGGCAA CTTCCCGCAG GCCTTCACCC ACCTGGCGCT GATCAACGCG
GTCATGTACG TGATCCGGGC CGAGTCCGGG GAGTCCTTCA CCCGCTCCTA G
 
Protein sequence
MAHTPTRRGG GSPFPPIAEY GFLSDCETNC LVAPSGNVEW MCVPRPDAPS VFGAVLDRSA 
GGFRFGPDRT FIPAGRRYLP GTNVLETTWQ TPTGWLIVTD CLVVGRWHRT HRRSKTHRRT
PGDWDADHVL LRLARCEHGD VDLSLVCEPN FDYGRSPASW RYEDEDYSTG IITHDFPAGD
PAAGVALRLR TDLRLGFDGR RALARTTLRE GDTAFVAMTW RAEDPLLPGN YPQACAAVDS
TTEFWRQWLS RGRFPDHPWR RHLQRSALAL KGLTYAPTGA LLAAATTSLP ETPQGERNWD
YRYSWIRDST FALWGLYTLG LDYEANDFFS FIADVAEHAD DIQVMYRVGG EPKIDEEILG
HLSGYDGAVP VRVGNEAAKQ RQHDVWGAVL DSVYLHTRSR DYLSERLWPV LVRLVEAAAA
HWRETDRGMW EVRGEPRHFT SSKMFCWVAL DRGRRLAQMR GDLRTAGRWD DIADEIHADV
LANGVDHRGV FTQYYGSTSL DASVLLMPLL GFLPSTDDRV KATVLAIADE LTVDGLVLRY
RTDETDDGVE GEEGAFLICS FWLVSALVEI GELTRARQLC ERLLSLASPL DLYAEEIDPA
DGRHLGNFPQ AFTHLALINA VMYVIRAESG ESFTRS