Gene Francci3_2494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2494 
Symbol 
ID3904872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2941957 
End bp2943768 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content75% 
IMG OID637879824 
Productglycoside hydrolase 15-related 
Protein accessionYP_481590 
Protein GI86741190 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3387] Glucoamylase and related glycosyl hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCGG AGCGGAGCGA GCGGGACGAT AATCCGGGGC TGGTGCCGAC GCTGCGTGAG 
TTCGCCCTGC TCGCGGACGG GGAACGGGGT GCGCTGATCG GCCCGGACGG CCGCGTGTGT
TGGATGTGCG TGCCCTCGTG GGCGGACGAC GCGGTGTTCT CGACGCTGAT CGGTGGCGCC
GGCTCCTACT CGGTGACTCC GGCGACCGCC TTCACCTGGG GCGGTTACTA CGAGGACCGG
TCGTTGATCT GGCACAGCCG GTGGGTGACG CGGGCGGGCG TGGCGGAGTG CCGGGAGGCT
CTCGCCCTCC CCGGGCGGCG GCACCGTGCG GTGCTGCTGC GCCGGATTCT CGGGGTCGAG
GGGACCGTGC CGGTGCGGGT GACGCTGGAT CCGCGTCCCG GGTTCGGCCG GCGGGCGGTG
GCGGGGCTGC GGCAAGACGG GCTGGACGGG CCGGACGGGG TGGAGGCCTG GTGCGGGCGA
GTCGGCGAGC TGGAGCTGCG CTGGTCGGGA CGGCTCGCGG CGGTCGCCCG GATCGACCAT
GATGCGGGCG GTGGCAGTCT CGTCGCGGAG CGCCTGGTGC GGGCCGGCGA CTGTCTCGAT
CTTGTGCTGG AGCTCGGTGA GCTGCCGGCG GACCGTCCGG TCCCGGACGT GCTGTGGGCC
GAGACGAAGC GGGCCTGGCG GTCCGCGGTA CCGGAGCTGT CGAGCCTGGC CGGTGTCCGC
GACGCTCAGC ACGCGGTCGC GGTGCTGCGC GGGCTGACCG GCGGCGGGCA CGGGATGGTG
GCCGCGGCCA GCACCTCCCT GCCCGAGCGG GCCCGTCAGG GCCGCGACTA CGACTATCGG
TATGTCTGGA TCCGAGATCA GTGCTACGCG GGGCTGGCGG CCGCGCGGGC CGGCCTCGAC
GGGCTGCTCG ACGACGCCGT CACGTTCGTC ACCGAACGGC TGTTGGCGGA CGGCCCGAAC
CTGCGTCCGG CCTACACCGC CCGCGGTGGC CCGATCCCGG ACGAGCGCCG ACTCGATCTG
CCGGGGTACC CCGGCGGCGA CGACGTGGTC GGGAACAAGG TCGGCCACCA GTTCCAGCTG
GACGTCTTCG GGGAGGCGGC GCTGCTGCTG GCGGCCGCGG CGCGGCGGGG CCGGCTGACC
GATGACGCCC GGCGGGCCGC GGCGGTGACC GCCGGGGTGA TCGCGGCACG CTGGCGGATC
CCGGACACAG GGGTCTGGGA GCTGGAGCCG CGGCACTGGA CGCACAGCCG GTTGACCTGC
GCGGCAGGGC TGCTCGCCTT GGCCGGCGTG GACGGCGCCG GGGTGGGGGA GGCCGACGGC
TGGCGGGCGC TGGCAAGGCG GCTGCTCAGC GCCGCCCGCG AGGAGATGCG GCATTCGAGC
GGGCGCTGGC AGCGGGCGAC CGACGACCCG CGGGTGGACG CGGCGCTGCT GCTGCCCGCG
GTGCGGGGAG CGGTCGACGT CCGCGACCCG GTGTCGGTGG CGACGCTGCG GGCGGTGGCC
GAGCAGCTGA CCGACGACGG CTACGTCTAC CGATTCGATC ATGCCGGGAT CCGGCTCGGG
GTGGCCGAAG GGGCCTTCCT GCTGTGCGGC TTCTGGCTGG CGCTGGCCTA CCACCGTGCC
GGCCGGCACG ATCAGGCCCG GCGGTACTTC GAGCGTGGCC GCGCCGCCTG CGGGCCGCCC
GGCATCTACG CCGAGGAGTT CGATGTCACT CAGCGCCAGC TGCGTGGCAA CATTCCCCAG
GCGTTCGTCC ATGCCCTGGT TCTGGAGACT GCCGCCACCC TCCGACCCCT CGACGGCACC
GACCCGGGAT GA
 
Protein sequence
MTAERSERDD NPGLVPTLRE FALLADGERG ALIGPDGRVC WMCVPSWADD AVFSTLIGGA 
GSYSVTPATA FTWGGYYEDR SLIWHSRWVT RAGVAECREA LALPGRRHRA VLLRRILGVE
GTVPVRVTLD PRPGFGRRAV AGLRQDGLDG PDGVEAWCGR VGELELRWSG RLAAVARIDH
DAGGGSLVAE RLVRAGDCLD LVLELGELPA DRPVPDVLWA ETKRAWRSAV PELSSLAGVR
DAQHAVAVLR GLTGGGHGMV AAASTSLPER ARQGRDYDYR YVWIRDQCYA GLAAARAGLD
GLLDDAVTFV TERLLADGPN LRPAYTARGG PIPDERRLDL PGYPGGDDVV GNKVGHQFQL
DVFGEAALLL AAAARRGRLT DDARRAAAVT AGVIAARWRI PDTGVWELEP RHWTHSRLTC
AAGLLALAGV DGAGVGEADG WRALARRLLS AAREEMRHSS GRWQRATDDP RVDAALLLPA
VRGAVDVRDP VSVATLRAVA EQLTDDGYVY RFDHAGIRLG VAEGAFLLCG FWLALAYHRA
GRHDQARRYF ERGRAACGPP GIYAEEFDVT QRQLRGNIPQ AFVHALVLET AATLRPLDGT
DPG