Gene Franean1_7197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7197 
Symbol 
ID5675498 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8789630 
End bp8790811 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content74% 
IMG OID641246034 
Productcellulase 
Protein accessionYP_001511422 
Protein GI158318914 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGTCC GGGCGGAAGA CGGCAACGGG CCGAAGCCGC GGTTGCGCGA GCCTCAGGTC 
CGGCCAGGTC GGCCAGGTCG CACCCGCCTC CGGCGCCGGG CCTGTGCCGC CGGGGCGACG
GTGGTCGCGG TGCTCGCCCT CGCCGCCTGC GGCGGGGGCT CGAGCCCCAC CCCGGTGACA
CCCAGCGTGA CCGCGGCCGC ACCGAACGTT CCCACCGGTC CGCTGACAGC GGTCGGACCG
CAATGCCCGG CCGGCGACCC GGCCGGCGCC CAGCGGCCGA CCGAGACCGT CGTGCCCGCC
GATCCCGCTC CCGCCGCGCC CTTCCTCGTC GACCACGGCA GCCAGGCGGC CGAGGAGGCG
CAGCGCAACC CCGCGCGCGC CCAGATCCTC GCGCCACTGG TCAACACCCC CACCGCCTAT
CCGGTCGGTG ACTGGCTGAG AGATGTGCCC GGCGAGGTCC ACAAGCGCGC CTCCGCGAGC
CGCGACACCG GCACGACCGC GACATTCATG ATCTATGCGA TTCCGCACCG TGACGCGGAG
GCCGTCTACT CCGCCGGTGG CCTGCCCAAC GCCGACGCCT ACCGGACGTT CACCCGCCAG
GTCGCCGGCG CGATCGGCGA CGCCCGCGCA GTGATCATCC TTGAGCCGGA CTCGCTCGGT
CAGATGGACA GCCTGCCCGC CGACCAGCAG GCCGAGCGCT ACGCGCTGCT CAACGACGCG
GTCGGCGTGT ACGGCGCGCT GCCCAACACC AGCGTCTACC TGGACGGCGC GAACTGCGGC
TGGATGCCGG CGGGAGCGGC GCCGGTGATC GCCGAACGGC TCCTGCGGGC CGGGGTGAAG
GGCGCCCGCG GCTTCGCGGT CAACGTGTCC AACTACTACC GGACCGAGGA CGAGACCGCC
CGAGGCGAGA TCATCTCGGC CCTGACCGGC GGCACCCACT TCGTGGTCGA CACCTCGCGC
AACGGCCGGG GCCCCGCCGA GGGCATCCAG AACCAGTGGT GCAACCCGCC GGACCGTGGC
CTGGGCGTCG CCCCGACGAT CGAGACGGGC TCACCGCACG CCGACGCGTT CCTCTGGATC
AAGACCCCTG GTGCCAGCGA CGGCGAGTGC GGACGCGGCA ACCCCGCGGC CGGCGCCTGG
TGGCAACAGC AGGCGGAGGA GCTGGTCCGC AATGCGGCCT GA
 
Protein sequence
MTVRAEDGNG PKPRLREPQV RPGRPGRTRL RRRACAAGAT VVAVLALAAC GGGSSPTPVT 
PSVTAAAPNV PTGPLTAVGP QCPAGDPAGA QRPTETVVPA DPAPAAPFLV DHGSQAAEEA
QRNPARAQIL APLVNTPTAY PVGDWLRDVP GEVHKRASAS RDTGTTATFM IYAIPHRDAE
AVYSAGGLPN ADAYRTFTRQ VAGAIGDARA VIILEPDSLG QMDSLPADQQ AERYALLNDA
VGVYGALPNT SVYLDGANCG WMPAGAAPVI AERLLRAGVK GARGFAVNVS NYYRTEDETA
RGEIISALTG GTHFVVDTSR NGRGPAEGIQ NQWCNPPDRG LGVAPTIETG SPHADAFLWI
KTPGASDGEC GRGNPAAGAW WQQQAEELVR NAA