Gene Francci3_4442 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4442 
Symbol 
ID3907418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5308495 
End bp5310402 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content68% 
IMG OID637881774 
Productglycoside hydrolase 15-related 
Protein accessionYP_483517 
Protein GI86743117 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.807083 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.234797 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACATT CACCGGTGCG GCGCAGCGGG GGCAGTCCCT TCCCGCCGAT CGCCGAGTAC 
GGATTCCTGT CCGACTGTGA GACGTCCTGT CTCGTTGCCC CCAGCGGCAA CGTCGAGTGG
ATGTGTGTGC CCCGGCCCGA TGCCCCGAGT GTCTTCGGGT CGGTGCTGGA CCGGTCGGCC
GGGGGCTTCC GTTTCGGTCC CGAGCGTACG CAGATCCCGG CCGGCCGGCG CTACCTCCCC
GGGACGAACA TTCTCGAGAC CACCTGGCAG ACGCCGAACG GCTGGCTCAT CGTCACCGAT
TGCCTGGTGG TCGGGCGCTG GCATCGCACC CACAAGCGTT CGAACACGCA CCGTCGGACG
CCGAGCGACT GGGACGCCGA CCACGTGCTG CTCCGGCTGG CTCGCTGTGA GCACGGCTCG
GTGGATCTCA GCCTGGTGTG CGAGCCGAAC TTCGACTACG GCCGTGAGCC CGAGTCCTGG
CAGTACGAGG GAGAGGATTA CTCCTCCGGC GTGATCAGCC ATGCGGGCAC CAACGTCACG
CTGCGGCTGC GTACCGACCT GCGGCTCGGT TTCGACGGCC GGCGGGCACT GGCTCGCACC
ACGCTGCGGG AGGGGGACAC CGCGTTCGTC GCGCTCACCT GGCGCCCGGA GGAGCCCCTG
CTTCCGGACA CCTACCTGCA GGCGTGCGTC GCCGTCGACC GGACAACGGA GTTCTGGCGC
CAGTGGCTGT CGCGGGGAAT GTTTCCCGAC CATCCCTGGC GGCGCCATCT GCAGCGCAGC
GCCCTCGCGC TGAAAGGACT GACCTACGCG CCGACCGGTG CTCTGCTCGC CGCCTCGACC
ACCTCGTTGC CGGAGACCCC GCGCGGCGAA CGGAACTGGG ACTACCGATA CAGCTGGATC
CGCGATTCCA CCTTCGCGCT GTGGGGCCTG TACACCCTCG GGCTGGACTA CGAGGCCAAC
GATTTCTTCT CGTTCATCGC CGACGTCGCC GAGAACGATG ACGACGACAT CCAGGTGATG
TACCGGGTGA GCGGCGAGCC GAAGATCGAC GAGGAGGTCC TCGGTCACCT GTCCGGCTAC
GAGGGAGCCT ACCCGGTACG GATCGGCAAC GCAGCGGCAC TGCAACGTCA ACACGACGTC
TGGGGTGTCG TTCTCGACTC CGTGTACCTG CACACCAAGT CCCGTGACTA CCTCTCCGAA
CGGCTCTGGC CGGTGCTCGT GCGCCTCGTC GAGGCGGCGG CCACGCACTG GCGGGAGCCG
GACCGCGGGA TGTGGGAGGT GCGCGGGGCG CCGCAGCACT TCACCGTGTC CAAGATGATG
TGCTGGGTTG CCCTGGACCG GGGTCGGCGG CTGGCGCAGA TGCGGGGGGA CGCCAAGACG
GCGGCGCGCT GGCGGGCCGT GGCGGAGGAG ATCCACGCCG AGGTGTGCGA GAAGGGTGTG
GATCATCGCG GTGTGTTTAC CCAGTACTAC GGATCGAAGG CGCTCGACGC CTCACTGCTC
CTGATTCCGC TGCTCGGGTT CCTGCCGGCG GCCGACGAAC GCGTGAAGGC CACCGTGCTC
GCGATCGCCG ACGAGCTGAC CGTTGACGGG CTGGTCCTGC GCTACCGCAC GGAGGAGACC
GACGACGGGG TCTCCGGCAC CGAGGGCGCC TTCCTGATCT GCTCGTTCTG GCTGGTCTCC
GCCCTGGTGG AGATCGGCGA GCTGACCCGT GCCCGGCAGC TGTGTGAACG GCTGCTGAGC
CTCGCCAGCC CGCTGGATCT CTATGCCGAG GAGATCGATC CGGTCAGTGG CCGCCATCTC
GGAAACTTCC CCCAGGCGTT CACCCATCTG GCCCTGATCA ACGCGGTGAT GTACGTGATC
AGGGCCGAGG ACGCCGAGGC CTACGCCCGG CCGTCCCCCT CGACCTGA
 
Protein sequence
MAHSPVRRSG GSPFPPIAEY GFLSDCETSC LVAPSGNVEW MCVPRPDAPS VFGSVLDRSA 
GGFRFGPERT QIPAGRRYLP GTNILETTWQ TPNGWLIVTD CLVVGRWHRT HKRSNTHRRT
PSDWDADHVL LRLARCEHGS VDLSLVCEPN FDYGREPESW QYEGEDYSSG VISHAGTNVT
LRLRTDLRLG FDGRRALART TLREGDTAFV ALTWRPEEPL LPDTYLQACV AVDRTTEFWR
QWLSRGMFPD HPWRRHLQRS ALALKGLTYA PTGALLAAST TSLPETPRGE RNWDYRYSWI
RDSTFALWGL YTLGLDYEAN DFFSFIADVA ENDDDDIQVM YRVSGEPKID EEVLGHLSGY
EGAYPVRIGN AAALQRQHDV WGVVLDSVYL HTKSRDYLSE RLWPVLVRLV EAAATHWREP
DRGMWEVRGA PQHFTVSKMM CWVALDRGRR LAQMRGDAKT AARWRAVAEE IHAEVCEKGV
DHRGVFTQYY GSKALDASLL LIPLLGFLPA ADERVKATVL AIADELTVDG LVLRYRTEET
DDGVSGTEGA FLICSFWLVS ALVEIGELTR ARQLCERLLS LASPLDLYAE EIDPVSGRHL
GNFPQAFTHL ALINAVMYVI RAEDAEAYAR PSPST