Gene Franean1_6989 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6989 
Symbol 
ID5675300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8511212 
End bp8512402 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content72% 
IMG OID641245835 
Productglucan endo-1,3-beta-D-glucosidase 
Protein accessionYP_001511226 
Protein GI158318718 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.632525 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACC GCCGCACGTT CCTGCTGGCG GCGTCCGCCG CCGTCGCGGG TACCGCCGGA 
GGCGCCGTGT GGGCCGCCGC CGGCAGCCGG GAGCACGCCG CGCTCGCTGC TGGCGCGGGC
CTGCCGCTGA CCGTGGTCAA CCACACCTAC CGGTACGCCA ACAACCAGAT CTGGCTCTAC
GTCGTCGGCA CCGACCTGAT CACCGGCCGG CAGGTGTACG CCCGCCGCGA CGGCGCCCTC
GCCCAGGTCT CACTCGCCGA CAACGGCCCG GACGGCTTCG CCGACCTGTC CATCCCCCTG
GTGTCGGACG GTGACACGCC GTTCGTCGTC CCGAACGGGA TGTCCGGGCG GATCTACGTG
TCGACCGGTT CGAAGCTGCG CTTCAAGGTC GTGGTCGATG GCGCGGGCAA CGCGGCGCTC
CAGCACCCGG CCGGCTGGGT GCGCGCCGAC CCCAGCTTCG GCGTGGTGCA CGACTTCGTC
GAGTTCACCC ACAACGACGC CGGCATGTTC TGCAACACCA CGGCGGTCGA CATGTTCAGC
GTGCCGATGG CCATCGGGCT GCGCGGCAGC GCCGACCAGA CGACTGGACG GCTGGCGTCG
GGCGGCCGGG CCGCCGTCTT CGACGCGATG CGGGCACACC CGGTGTTCGC CCCGCTGGTC
GTCGACGACG CTGACCGGCA GGGCACTCGG GTGATCGCTC CGGGCCACGG CCTGGAGGCC
GGCATCTTTC CCGCCACCTA TTTCGACGGC TACATCGACG CGGTGTGGAA CCAGTACACG
TCGCGCCAGC TCACGGTGAA CGTCGGGACG AGCACCCGGG TCGGCACGGT GAACGGCGGC
CTGCTGCGCT TCGACGGCGG GGTCGCGCCG TTCGTCCGGC CGAGCACCCG CGACGTCCTG
TTCTGCGACG GCGCGCTCGC GGCGCCGAAC GACGGCGTCA CCGGGCCGGT GGCCGCCGTG
CTGGGAGCCG GCTTCAACCG TTCGACGCTG CTCACCCAGC CGACCCAGCC GACGACCGAC
CCCGCGGGCT TCTACCGCGA CCCGACGACC AACCACTACG CCCGGGTCCT GCACGAGCAC
AGCGCGGACG GCCGAGCGTA CGGATTCGCC TTCGACGACG TCGCCGGCTT CGCCTCCTAC
ATCCAGGACA CCGCGCCGAC GTCCGCCACC CTGTGGCTCA CGCCCTTCTG A
 
Protein sequence
MPNRRTFLLA ASAAVAGTAG GAVWAAAGSR EHAALAAGAG LPLTVVNHTY RYANNQIWLY 
VVGTDLITGR QVYARRDGAL AQVSLADNGP DGFADLSIPL VSDGDTPFVV PNGMSGRIYV
STGSKLRFKV VVDGAGNAAL QHPAGWVRAD PSFGVVHDFV EFTHNDAGMF CNTTAVDMFS
VPMAIGLRGS ADQTTGRLAS GGRAAVFDAM RAHPVFAPLV VDDADRQGTR VIAPGHGLEA
GIFPATYFDG YIDAVWNQYT SRQLTVNVGT STRVGTVNGG LLRFDGGVAP FVRPSTRDVL
FCDGALAAPN DGVTGPVAAV LGAGFNRSTL LTQPTQPTTD PAGFYRDPTT NHYARVLHEH
SADGRAYGFA FDDVAGFASY IQDTAPTSAT LWLTPF