Gene Francci3_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3972 
Symbol 
ID3906932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4752239 
End bp4754023 
Gene Length1785 bp 
Protein Length594 aa 
Translation table11 
GC content69% 
IMG OID637881300 
Productglycoside hydrolase 15-related 
Protein accessionYP_483051 
Protein GI86742651 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.855212 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGACA CCGACTATCC GCCCATCGAA GAGCACGCGG TCATCGGCGA CCTGCGTACC 
GTGGCCCTGG TCGCCACGGA CGGCACGATC GACTGGTACT GCCCGCAGCG CTTCGACGCG
CCGTCGGTTT TCGCGAGTCT GCTGGACGCC GACCGTGGCG GATCGTTCCG TATCCACTGT
CCGGACTCCC GGGCCAGGCA GCTCTACCTC CCCGACTCGA ACATCCTGAT GACCCGTTTC
CTCGCCCCCC GCGCGGTCGG TGAGGTCATC GACTTCATGG TCCCGGTGGA CAGCGACGTC
ACCGAGACAC CGCACGTCGT GGTCCGCCAG GCCCGAGCGG TGCGTGGAAC CGCCACGTTC
CGGCTGCGCT GCGACCCCCG GTTCGACTAC GGCCGGGCCC CGCACACCGT CACCCTGGTG
CCTGGTTCGG GCGCGGTGTT CGAGTCGACC GCCGGCACCC TTGTGCTGCG CACGGCCCTG
CCGCTGCGCG TCGAGGCCAC GGCCGTGGTC GCCGAGTTCG AGCTGGCGAT GGGCGAGTGT
GCCGACATCG TGCTCGAATG GAACTCGACC ATCCGCCCAT TGATCATCGG CGAGGCGGAG
ACGTTGTTCA CCCGGACGCT GAACTACTGG CAGGCCTGGA TCCGGCGCGG GCGCTATCAC
GGCCGGTGGC GGGAGATGGT GCTACGCAGC GCTCTCGTCC TCAAACTGCT CGTCTACCGG
CCCACCGGGG CGCTGGTGGC CGCGCCGACC ACCTCGCTAC CCGAGGAGCT CGGCGGCGTG
CGCAACTGGG ACTACCGCTA CACCTGGATC CGCGACGCGG CCTTCACGGT GTACGCGCTG
ATGGCGCTCG GATTCACCGA CGAGACCGCG GCGTTCATGG ACTGGCTCGA ACAGCGATGT
CACGAGGCAC CCAAGCACTG CGGGCTCTAC GTGCTCTACA GCGTGGACGG CAACGCCGAC
CTGGACGAGC TGGTCCTCGA CCAGCTGTCC GGCTATCGCG GCTCGAAGCC GGTGCGCATC
GGCAACGCCG CCGCCACGCA GCTCCAGCTC GACATCTACG GCGAGCTGAT GGACTCGGTG
TACCTGTACA ACAAGCAGGT CCCGATCTCC TTCCAGCTCT GGGAGGCGCT GGGCCGCCAG
CTCGACTGGC TGGCCAGGCA CTGGGACGAA CCGGACGAGG GCATCTGGGA GACCCGGGGT
GGGCGCCAGC GCTTCACCTA CTCGGCCGTG ATGACCTGGG TCGCCTTCGA ACGGGCCTGC
CGGATCTCGC GCCAGCGCGG CCTGCCCGGG CCGACGAACG AATGGAAGGA CTACGCCGGG
CGGGCCTACC GGTTCGTCCA GAACGAGGCG TGGAACCCCG CCCGCGGCGC CTACATGGAG
TTCCCCGGCT CACCACGGAT GGACGCATCC CTGCTGTGCA TGCCGCTGGT GAAGTTCTCC
GGCCCCACCG ACCCCCGGTT CCTGTCGACC CTGGAACGGT TCAGCGGGGA CCTGGTCAGC
GACAGTCTGG TGCGCCGGTA CGCCGCGGAC GGCAGCGACG GCCTCACGGG TGACGAGGGC
ACCTTCAACC TGTGCTCGTT CTGGTACGTG GAGGCACTGA CCCGGGCCGG TCGGGTCGCC
GAGGCCCGGA TGGTCTTCGA GAAGATGCTC ACCTACGCCA ACCATGTGGG GCTCTACGCC
GAGGAGATCG GTTCCTCCGG GGAAGCGCTC GGCAACTTCC CGCAGGCGTT CACCCACCTC
GCCCTGATCA GTGCCGCGAT CCATCTCGAC CGCGCGCTGG GGTGA
 
Protein sequence
MPDTDYPPIE EHAVIGDLRT VALVATDGTI DWYCPQRFDA PSVFASLLDA DRGGSFRIHC 
PDSRARQLYL PDSNILMTRF LAPRAVGEVI DFMVPVDSDV TETPHVVVRQ ARAVRGTATF
RLRCDPRFDY GRAPHTVTLV PGSGAVFEST AGTLVLRTAL PLRVEATAVV AEFELAMGEC
ADIVLEWNST IRPLIIGEAE TLFTRTLNYW QAWIRRGRYH GRWREMVLRS ALVLKLLVYR
PTGALVAAPT TSLPEELGGV RNWDYRYTWI RDAAFTVYAL MALGFTDETA AFMDWLEQRC
HEAPKHCGLY VLYSVDGNAD LDELVLDQLS GYRGSKPVRI GNAAATQLQL DIYGELMDSV
YLYNKQVPIS FQLWEALGRQ LDWLARHWDE PDEGIWETRG GRQRFTYSAV MTWVAFERAC
RISRQRGLPG PTNEWKDYAG RAYRFVQNEA WNPARGAYME FPGSPRMDAS LLCMPLVKFS
GPTDPRFLST LERFSGDLVS DSLVRRYAAD GSDGLTGDEG TFNLCSFWYV EALTRAGRVA
EARMVFEKML TYANHVGLYA EEIGSSGEAL GNFPQAFTHL ALISAAIHLD RALG