Gene Franean1_3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3155 
Symbol 
ID5671532 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3714848 
End bp3717070 
Gene Length2223 bp 
Protein Length740 aa 
Translation table11 
GC content67% 
IMG OID641242050 
Productglycoside hydrolase family 3 protein 
Protein accessionYP_001507470 
Protein GI158314962 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCGC GCAACGATCC CGTCGACCTC ACCTTACAGC ACAAGGCATC CCTTCTATCG 
GGACACGACT TCTGGTCCAC AAAATCCCTG GACGACGCGG GGATCCCCTC GATAGTTCTC
GCTGACGGTC CGCACGGCGT GCGCCTTCAG CGCGATAGAG CAGACCACGC GGGCATCTAC
GACAGCTTGC CCTCCACCTG CCTGCCTCCG GCGGTCGCGG TCGGGTCCAG CTGGGATGTC
GAGGTGGCGA GCCGAGTGGG TGCCGCAGTC GGGCGGGAGG CCCGCGCTCT GGGGGTCGCG
GTCGTGCTGG GCCCGGGAGT GAACATCAAG CGGTCGCCTC TGTGCGGCCG CAACTTCGAG
TACTACTCCG AGGATCCGCT GCTAACCGGC GTGCTGGCGA CCGCGCACAC ACAGGCGTTG
CAGGCCCAGG GAGTGGGCGC GTCGGTGAAG CACTTCGCGG CCAACAACCA GGAGACCGAC
CGGATGGTCA TCAGCGCCGA CGTCGACGAA CGCACCCTGC GGGAGATCTA CCTTCCCGCC
TTCGAGCGGA TCGTCACCGA AGCGCAGCCC GCGACCGTTA TGGCCGCCTA CAACCGCGTC
AACGGGGTCG CCGCGTCGCA GAACCGGTGG CTTCTGACTG ACATCCTGCG CACCGAGTGG
GGCTTCACAG GGATCGTTGT CTCGGACTGG GGCGGCGTCT ACGACCGGAT CGCCGCCCTC
GCGGCGGGAC TGGACCTCGA GATGCCAGGA TCCTCTGGTG TCAACGACGC GAAGGTCGCG
CAGGCCGTCC GCGACGGGCT CCTGGCCGAG ACCGTCGTCG ACGCGAGCGT ACAGCGCCTG
ATCACGCTTG CCGGTCTCGG TAAGGCCACC GAGGGCGATC TGCCCGCCGA GAGAGATCAC
CACGCCCTCG CCAGGGAGGT GGCCGCCGAC TGTGCCGTCC TACTCAGAAA CGAACACGCG
ACGCTTCCGC TCGACCCAGC CGCTCGCATC GCGGTCATCG GCGAGCTCGC CGTCTCCCCC
AGGTTCCAGG GCGGGGGCAG CTCACGGGTG AACCCGACCC GGATCGACGT CCCGTTGGAG
GCCATCACCG CCCTCGGGCA GAGCGTCGTG TACGCCCGCG GTTTCACCAC CGACGGCTCC
AAAGACGCGC GGGAGCTTCG AGACGACGCC GTCCGCATCG CGCTCGACGC CGACGTCTCC
ATCATTTTCG CCGGAGGAGA TGAAGACTCG GAAGGCATCG ACCGCGAACA CATCAACCTG
CCCGCCGACC AGATCGACCT GATACGAGCA GTCGCCGCCG TGGCCCCGCG GACTGTCGTC
GTCTTGTCCA ACGGAGGCGT GGTCTCGCTC GAGGGTTGGC ACGACGACGT GGACGCGATC
CTGGAGGCCT GGTTTCTCGG ACAGGCAGCC GGCGGCGCGA TCGCCGACCT TCTGTTCGGC
GTGGTCAACC CCTCCGGTCA CCTCGCCGAG ACCATTCCAC GGCAGCTGCA GGACACTCCT
TCCTACCTCA CCTTCCCCGG AGAGCAGGGC CACGTCCGGT ACGGCGAGGG CGTCATGGTC
GGCTATCGCT ACTACGAAAG CGTCCAACGC GCCGCCCGCT ACCCTTTCGG GCACGGTCTG
AGCTACACGA AGTTCGCCAC CAGCGAGCTG CAGGTGACCG TGGACGGTGA CGATTCCGCG
ACCGTCCGCG TCACCGTCAC GAACATCGGC GACCGAGCTG GTAAGCACGT CGTGCAAGTC
TATGTTGCCA CCGACGCGGG ACGGGTTCGC CGCCCGGCAC GGGAGCTGCG CGCGTTCACC
AAGATCGCAC TACAGCCCGG CGAGAGCCGA ACCGTCGAAC TGGCCCTCGA TCGGCGCTCC
TTCGCCTACT ACGACATCAA ACAGGCCCGA TGGACCGTCG CGCCCGGTAG CTACTCGATC
CAGATCGGCG AGAGCGCGAC GCGGATCATC GCCGAGCAGA ACATCACCCT CCCAGGCGAC
ACCGTGACCC AGGAGCTTTC CCTGGACTCA GCCGTCTCCG ACTGGCTCGA CCACCCCATC
GTCGGGCCCA TGTTCCGGCG CACCCTTGAC AAGGCCTCGC CTGACCGACG AAGCCTCCTC
ACCGACCAGG CCTCGAGGGC CATCGAAATG GTCGCGTCGA TGCCCCTGCG CCAAATCATG
CAGTTTCCCA CCGTGGACCT GCCCGTCGAC GCACTCACCC ACATGATGGA GCTGACCCGA
TGA
 
Protein sequence
MTPRNDPVDL TLQHKASLLS GHDFWSTKSL DDAGIPSIVL ADGPHGVRLQ RDRADHAGIY 
DSLPSTCLPP AVAVGSSWDV EVASRVGAAV GREARALGVA VVLGPGVNIK RSPLCGRNFE
YYSEDPLLTG VLATAHTQAL QAQGVGASVK HFAANNQETD RMVISADVDE RTLREIYLPA
FERIVTEAQP ATVMAAYNRV NGVAASQNRW LLTDILRTEW GFTGIVVSDW GGVYDRIAAL
AAGLDLEMPG SSGVNDAKVA QAVRDGLLAE TVVDASVQRL ITLAGLGKAT EGDLPAERDH
HALAREVAAD CAVLLRNEHA TLPLDPAARI AVIGELAVSP RFQGGGSSRV NPTRIDVPLE
AITALGQSVV YARGFTTDGS KDARELRDDA VRIALDADVS IIFAGGDEDS EGIDREHINL
PADQIDLIRA VAAVAPRTVV VLSNGGVVSL EGWHDDVDAI LEAWFLGQAA GGAIADLLFG
VVNPSGHLAE TIPRQLQDTP SYLTFPGEQG HVRYGEGVMV GYRYYESVQR AARYPFGHGL
SYTKFATSEL QVTVDGDDSA TVRVTVTNIG DRAGKHVVQV YVATDAGRVR RPARELRAFT
KIALQPGESR TVELALDRRS FAYYDIKQAR WTVAPGSYSI QIGESATRII AEQNITLPGD
TVTQELSLDS AVSDWLDHPI VGPMFRRTLD KASPDRRSLL TDQASRAIEM VASMPLRQIM
QFPTVDLPVD ALTHMMELTR