Gene Franean1_6988 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6988 
Symbol 
ID5675299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8509551 
End bp8511170 
Gene Length1620 bp 
Protein Length539 aa 
Translation table11 
GC content74% 
IMG OID641245834 
Productlysozyme 
Protein accessionYP_001511225 
Protein GI158318717 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG5297] Cellobiohydrolase A (1,4-beta-cellobiosidase A) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.14763 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCCTT TGCGAGGTCG GCACCGACGC CGTCGTCGTC AGATCATTTC GGGTACCGCG 
GCCCTGCTCA CCGTTGGCGG TGCGGTCGGC CTGGCCGTGG TGGTGCCCGG CACCGCGGAC
GCCGCCGGCC TCGACGCCGC GTACAGCCGC ACCAACGACT GGGGCACCGG CTACTCCGCC
CAGTATCAGG TCACCAACTC CGCGGACTCG CCGGACGGCT TCACCCTCGA GTTCGACCTG
CCGGACGGTG CGACCCTCAC CTCGCTGTGG AACGCCGCCT ACCAGGTGGA CGGCCGTCAC
GTGACCGTCA CCCCGCCGGC CTGGCAGACC ACACTCGCGC CACGGGAATC GGTCGACGTC
GGCTTCGTCA TCGCCGCTCC GGGCGGCGCC ACCGACCCCC TCGGATGTCG GATCAACGGC
GAGGACTGCA CGCCCGGCTC CGGCAACGGC GACCCGGGCC CGGAGCCTGA ACCCTCGGCC
ACCGGGCCGG CCGCACCGTC GTCCCCGCCG CCGAACGCGT CGCCGACCGA CCCGGCCACG
CCACCCGACA CGGCGCCGCG GCCCACCGCG GGCCCGGGCA CCGGCCAGCC GAGCGGCGCT
CCGACGAGCA CCCCGAGCAC GCCGACCACG CCGCCCGCTT CGACCACCGC GCCGCCGGCA
CCGCCGGCAC CACCGGCACC GCCGTCGAAC GGCTCCAGCG GATCGGGCGG GTTCGCGCCG
TACGTCGACA CCTCGCTGTA CCCGCCGTTC GACCTGGTCG CGGCGGCGCG GACCGCCGGC
CTGCGTGACG TCACGCTGGC CTTCGTCGTG GCCGGTGGAG GCGGCTGTAC GCCGAAGTGG
GGCGGGGTCA GCGACCTCAC CATGGACGGC GTGCCCGGCC AGATCGGCCG GTTCCGTGAG
CTGGGCGGCG ACGTCCGGGT GTCGTTCGGC GGGGCGTCCG GAACCGAGCT CGCCAGTGCC
TGCGGCAGCG CGGGCGACCT GGCGGCCGCG TACCGCAAGG TGGTCGACGT CTACGGGGTG
ACCCGGCTCG ACTTCGACGT CGAGGGCGGC ACGTTGCCGG ACGTCGCCGC GAACACCCGG
CGTGCCCAGG CGATCGCCCG GCTTCAGCGG GAGGCCGCGG CCGGAGGCCG GCCACTGGAG
GTCTCGTTCA CGCTGCCGGT GCTGCCGTCC GGCCTGACCC AGGCCGGCGT GGACCTGCTG
GCCAACGCCC GGGAGAACGG CGTGACGGTG AACGCCGTCA ACATCATGGC GATGGACTAC
GGCGACGGCG CCGCGCCGAA CCCGGCGGGC CGGATGGGCC AGTACGCCAT CGACGCCGCC
ACCGCGACCC AGGCCCAGGT CAAGGGCGTG TTCGAGCTGT CCGACGCGCA GGCGTGGGGG
CGGGTGGCCG TCACCCCGAT GATCGGTGTG AACGACGTCG CCAGCGAGGT GTTCACCCTG
GCCGACGCGC GGCGGCTGGT GCGGTTCGCG TCCGAGGTCG ACCTCGCCTG GCTGTCGATG
TGGTCGCTGA CCCGCGACCA GCCCTGCCCC GGTGGGCCGG TGCCGTACGC GCAGCCGACC
TGCGGCGGCA TCGAGGCGCA GCCGTTCGAT TTCACCCGCG CCTTCAACGC CGCCCAGTGA
 
Protein sequence
MSPLRGRHRR RRRQIISGTA ALLTVGGAVG LAVVVPGTAD AAGLDAAYSR TNDWGTGYSA 
QYQVTNSADS PDGFTLEFDL PDGATLTSLW NAAYQVDGRH VTVTPPAWQT TLAPRESVDV
GFVIAAPGGA TDPLGCRING EDCTPGSGNG DPGPEPEPSA TGPAAPSSPP PNASPTDPAT
PPDTAPRPTA GPGTGQPSGA PTSTPSTPTT PPASTTAPPA PPAPPAPPSN GSSGSGGFAP
YVDTSLYPPF DLVAAARTAG LRDVTLAFVV AGGGGCTPKW GGVSDLTMDG VPGQIGRFRE
LGGDVRVSFG GASGTELASA CGSAGDLAAA YRKVVDVYGV TRLDFDVEGG TLPDVAANTR
RAQAIARLQR EAAAGGRPLE VSFTLPVLPS GLTQAGVDLL ANARENGVTV NAVNIMAMDY
GDGAAPNPAG RMGQYAIDAA TATQAQVKGV FELSDAQAWG RVAVTPMIGV NDVASEVFTL
ADARRLVRFA SEVDLAWLSM WSLTRDQPCP GGPVPYAQPT CGGIEAQPFD FTRAFNAAQ