Gene Franean1_1657 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1657 
Symbol 
ID5670059 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1980773 
End bp1982260 
Gene Length1488 bp 
Protein Length495 aa 
Translation table11 
GC content76% 
IMG OID641240575 
Productglycoside hydrolase family protein 
Protein accessionYP_001506001 
Protein GI158313493 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0958307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.376991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCGG CTCGCCCCAT GAGGAATCCC GCCGCCCGGC CGCGCACCGT CGAATACGGC 
TTCGGCGGCG ACGCCGCCGC TGGCGCGGAC ACCGATCACC AGACATCGAT TCACCCGGTG
ACCACTCATC CGGAGACCAA TCACCCGGAG ACCAATCACC CGGCGAGCGC TCACCGGACC
ACGACTCGCC CGGCCACGAG TCACTCAACC GCGAGGCGCC CGGCCGCGGC ACGCTCGGCC
GGCGCGCGCT CTAGCGGCGC GCACGGATCG CGGCGGCCGC CGCTCGATGG CCGGCTCGCC
CGCACGGCGT TCGCGGTCAC CGCGGCGCTG ACGCTACTCG GTGGGGTCGG TCTCGGCCGG
GGCCTCGCCG GCCCCGACGG CTCCGACTCC GCGGTGGACG CGCTCGCGGC GACCGTGGCC
GGCCGCGTGC CCCCGCCCGC CGCCGTCCAC ACGACCGGGC CCACCGCCAC CGTCACCGCA
CCCCCACGGC CGGCTGCCTC CGGGCTCACC GCGGTACAGG TGCCCACGGC CGCGCTCACA
AGCGCTCCGC CGCCCGCCGC CGCGCAGGCC CTGCCGGTGG CGGCGCCGGC GGCCGCCGCC
GCGGCCGCGG GGAACCCGTT CGCCGGCGCC CGGTTCTACA TCGATCCCGC GGACCAGGTC
GCCGCCGCGA TCAACGCGCT GCGCGGCGGG AATCCCTCCG CCATCGCTGC GCTGGAGAAG
ATTCTGCGTG GCGCGCACGC GGACTGGTTC GGATATGCCG ATCCGGCCAC GACCCGGCGC
AACGTGGCCG GCCGGGCCAG CACCATCAGG GGTAACGGCG CGCTGCCCGT CTTCGTGGCC
TACGCGATTC CGAACCGCGA CTGCGGAAGC TATTCCGCCG GCGGCGCCGG CGGCGCGCAG
GGTTATCGCG ACTGGATCGC CGCCTTTGCC GCCGGGCTCG CCGGCGGGCC GGCCGCGGTC
GTGCTCGAGC CCGACGCCAT CGCCCAGATC GATTGCCTCT CCCCCGCCGA CCAGCAGACG
CGCTACGGGA TGCTGTCGAA CGCGGTCGAC GTCCTGAACG CCGCCGGGGC GACCGTCTAC
CTCGACGCCG GCAACGCCGG CTGGCACAGC GCCGCCACCA TCGCCGCCCG GCTGAAGTCG
GCGGGCGTCG ACCGGGCGCG CGGATTCGCA CTGAACGTGT CGAACTTCGG TACCACCGCC
AGCGAAGTCG CCTTCGGTGA CGCGGTCAAC GCCGCGCTGG GCGGCGGAGC CCACTTCGTC
GTGGACACCA GTCGCAACGG GCTGGGCCCG GCGCCGGACA ACGCCTGGTG CAACCCGCCC
GGCCGCGCGC TCGGAACGCC GCCCACCGCC GCGACGGGCG ACAGCGACGT CGACGCATTC
TTCTGGGTGA AGATCCCCGG GGAGTCGGAC GGCACCTGCA ACGGCGGCCC CGCCGCCGGC
CAGTTCTGGC CGGACTACGC CGTCGGCCTG GGCAGCCGGG CCGGCTGA
 
Protein sequence
MPAARPMRNP AARPRTVEYG FGGDAAAGAD TDHQTSIHPV TTHPETNHPE TNHPASAHRT 
TTRPATSHST ARRPAAARSA GARSSGAHGS RRPPLDGRLA RTAFAVTAAL TLLGGVGLGR
GLAGPDGSDS AVDALAATVA GRVPPPAAVH TTGPTATVTA PPRPAASGLT AVQVPTAALT
SAPPPAAAQA LPVAAPAAAA AAAGNPFAGA RFYIDPADQV AAAINALRGG NPSAIAALEK
ILRGAHADWF GYADPATTRR NVAGRASTIR GNGALPVFVA YAIPNRDCGS YSAGGAGGAQ
GYRDWIAAFA AGLAGGPAAV VLEPDAIAQI DCLSPADQQT RYGMLSNAVD VLNAAGATVY
LDAGNAGWHS AATIAARLKS AGVDRARGFA LNVSNFGTTA SEVAFGDAVN AALGGGAHFV
VDTSRNGLGP APDNAWCNPP GRALGTPPTA ATGDSDVDAF FWVKIPGESD GTCNGGPAAG
QFWPDYAVGL GSRAG