Gene Francci3_1889 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1889 
Symbol 
ID3906838 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2221126 
End bp2223012 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content71% 
IMG OID637879227 
Productglycoside hydrolase 15-related 
Protein accessionYP_480994 
Protein GI86740594 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.510271 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCATCCC TGATCGAGGA CTACGCGCTG ATCGGCGACA CCCACTCCGC GGCGCTCGTG 
TCGCGCACCG GGTCGATCGA CTGGCTGTGC CTGCCCCGAT TCGACTCGCC GTCGTGCTTC
GCCGCCCTGC TCGGCGACTC CGAGGCGGGC CACTGGAAGA TCGCTCCCGT CGAGCCGGTC
CTGGGCGTCA GCCGGCGCTA CCGGGGCGAC ACGCTGGTCC TGGAGACGGA CATGACGACC
GCGTCCGGCA CCGTGCGCAT CGTCGACGCG ATGTTTCCCC GCGCAGGCAC GCACACCGTG
CTCCGCCTGG TCGAATGCCT CGAAGGCGCG GTCCACCTGC GATCGGAGAC CCGGTTTCGC
TTCGACTACG GCTCCATCGT GCCGTGGGTC CGCCGGGTGG ACGAGCACAC CATGTCGGCG
GTGGCCGGGC CGGATGCGGT CACCCTGCGG ACGACCGCCC CGATGGAGGG GCACGACATG
GCCACCTACG CCGACTTCCA CGTCGCCGCC GGCCAGTCGG TACCGTTCTC ATTGACCTGG
ACCCCCTCCC ATCAGACTCC CCCGCCGTCC CACGACGTCC GGCGCATGAT CACCCTCACC
GAGGCGTGGT GGTCGGACTG GATGGCGGGC TGCACCTACG ACGGCCAGTG GCAGCCGGCC
GTCCGCCGGT CGCTGATCAC CCTCAAGGCG CTCACCTACG CCCCGACCGG AGGGATCGTC
GCCGCCGTCA CGACCTCGTT GCCGGAGCAC ATCGGCGGCG TGCGCAACTG GGACTACCGG
TACTGCTGGC TTCGGGACGC GACGATCACG CTGCTCGCCC TGCTCGACGC CGGGTTCACC
AGCGAGGCGA CCGCATGGCG GGAGTGGCTG CTGCGCGCGG TCGCCGGTGA CCCCTCCCGG
GTACAGATCA TGTACGGCGT TGCCGGGGAA CGCCGGCTGC CCGAATACGA GATCCCGTGG
CTACCGGGGT ACGAGAACTC CGCCCCGGTG CGGGTCGGCA ACGCCGCCGT CGACCAGTTC
CAGCTCGACG TCTACGGCGA GGTCCTCGAC GCCCTGCATG TCGCGCGGGT CGCGGTCGCC
AACCGTCGCC AGACCGTGCC TGGGCTGGCG CTCCCCGGCG GGCAGACCAT CACCGAGAGC
CATGCGGACG ACTCCTGGCC GCTGCAGACC AAGCTGATGG ACTTCCTCGA GACCGGCTGG
CGGAAGACCG ACGAGGGCAT CTGGGAGGTG CGCGGCCCCC GCCGCCACTT CGTCCACTCG
AAGGTGATGG CCTGGGTCGC GGCCGACCGG GCGGTACGCG GGATCGTTGA GTCCCGGCTA
CCGGGCCCTG TCGACCGCTG GTCGGCGCTG CGGGACGAGA TCCACGCGGA GGTCTGTACC
CGTGGGTTCG ACTCCGAACG CAACACCTTC ACCCAGTTCT ACGGCTCCAA GGAACTCGAC
GCGGCGCTGC TGTACATGCC GCTGGTGGGG TTCCTGCCCG CCACCGACCC CCGCGCCGTG
GGAACAGTCG CCGCCATCGA GCGGGAGCTG ATGGAGGACG GGTTCGTCCT GCGGTATCCG
ACGGCCGAGG ACGGCGCGGT CGACGGACTG CCCGCGGGCG AGGGCGCCTT CCTGGCCTGC
ACCTTCTGGT TGGCCGACAA CTACGCCCTG TCCGGGCGGG TCCACGAGGC TCAGGAACTG
TTCGAACGCC TGCTGTCGTT GCGTAACGAC GTCGGGCTCC TCGCGGAGGA GTACGACCCC
AGGCTGGGCC GGATGACGGG CAACTTCCCG CAGGCGTTCA GCCACGTCCC CCTGGTCAAC
ACCGCGCGGA CGCTCACCGA CGCGCTGCGC GGCACACCGC GCTCGCGCAC CGACCGGGCC
CACCCGCCCG GCCACTTCTT CGGCTGA
 
Protein sequence
MPSLIEDYAL IGDTHSAALV SRTGSIDWLC LPRFDSPSCF AALLGDSEAG HWKIAPVEPV 
LGVSRRYRGD TLVLETDMTT ASGTVRIVDA MFPRAGTHTV LRLVECLEGA VHLRSETRFR
FDYGSIVPWV RRVDEHTMSA VAGPDAVTLR TTAPMEGHDM ATYADFHVAA GQSVPFSLTW
TPSHQTPPPS HDVRRMITLT EAWWSDWMAG CTYDGQWQPA VRRSLITLKA LTYAPTGGIV
AAVTTSLPEH IGGVRNWDYR YCWLRDATIT LLALLDAGFT SEATAWREWL LRAVAGDPSR
VQIMYGVAGE RRLPEYEIPW LPGYENSAPV RVGNAAVDQF QLDVYGEVLD ALHVARVAVA
NRRQTVPGLA LPGGQTITES HADDSWPLQT KLMDFLETGW RKTDEGIWEV RGPRRHFVHS
KVMAWVAADR AVRGIVESRL PGPVDRWSAL RDEIHAEVCT RGFDSERNTF TQFYGSKELD
AALLYMPLVG FLPATDPRAV GTVAAIEREL MEDGFVLRYP TAEDGAVDGL PAGEGAFLAC
TFWLADNYAL SGRVHEAQEL FERLLSLRND VGLLAEEYDP RLGRMTGNFP QAFSHVPLVN
TARTLTDALR GTPRSRTDRA HPPGHFFG