Gene Francci3_1668 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1668 
Symbol 
ID3903055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2002166 
End bp2003401 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content75% 
IMG OID637879006 
Productglycogen synthase 
Protein accessionYP_480773 
Protein GI86740373 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR02149] glycogen synthase, Corynebacterium family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCG CCATGCTCAC CCGGGAGTAC CCCCCGGACG TCTACGGCGG GGCGGGTGTG 
CACGTCGAGT ACCTGAGCCG GGAGCTCGCC CGGCTTGTCG ACCTCACCGT CCACCGCGAG
GGGCCGGCGC CGGGCGGCGG GTCGGCGGGC GGCGGGCCCG CGGGTGCGGC GCCGTCGACC
GGCGCCGTCG ACCGACCGGT GGTCGAGAGG GCGGGTGCCC CGGGGGTCGC CGCGGTCGCG
GCGCACCGGG GCTGGGCTGC GCTCGCCGAC GCGAACGACG CGCTGCGGAC CGTGTCGATG
GACCTGTCCA TGGCCGCCGC GGCGGTCGGC GCGGACGTGA TCCACTCGCA CACCTGGTAC
GCGAATCTCG GTGGCCATCT CGCCGCGCTG CTCGGCGGCG TCCCGCACGT GATGACCTCG
CACTCGCTGG AGCCCCGGCG TCCCTGGAAG GCCGAGCAGC TGGGCGGGGG CTACCGGGTG
TCGTCCTGGT GCGAGCGGGT CGCGATCGAG TCGGCGGCCG CCGTGGTCGC GGTCAGCGCG
GGCATGCGCG TGGACATCCT CGATGCCTAC CCGGCGGTCG ATCCGGCCCG TGTGCACGTC
ATCCGCAATG GCATCGACAC CGACGAGTAC CGTCCGGACA CCGCGACCGA CGTGCTGGAA
CGTCACGGGG TGGATCCGGC CCGGCCGACG GTGATCTTCG TGGGGCGGAT TACCCGGCAG
AAGGGGCTGC CGGTGCTGCT GCGGGCGGCT GCGGCGATCG ACCCCCGGGC GCAGCTCGTG
CTCTGTGCCG GGGCCCCGGA CACCCCTGCG CTGCTCACGG AGATCACCGA CCTGGTCGAG
GGCCTGCGCG CAGGCCGCGA CGGCGTGGTC TGGCTGCCCG GGATGCTGAC GAAGCCGGAG
GTGATCCAGC TGCTCAGCCA TGCCACCGTG TTCGTCTGCC CGTCGGTCTA CGAGCCGCTC
GGCATCGTCA ACCTGGAGGC GATGGCGTGC GGCACCGCGG TCGTCGCCTC CCGGGTCGGT
GGCATTCCCG AGGTCGTCGA CGACGGCGTC ACCGGTCTGC TGGTGCCGCC CGGCGACCCC
GGCGCGCTGG CCGGGGCCGT CAACGAGGTG CTCGCGGACC CGGTCCGGGC GGCCGCGATG
GGCCATGCGG GCCGGGACCG GGCCGTCACC GAGTTCGGCT GGGCGGCGAT CGCCGAGCGC
ACCGCGCGGC TGTACGCATC GCTGACCGGC GGCTGA
 
Protein sequence
MRVAMLTREY PPDVYGGAGV HVEYLSRELA RLVDLTVHRE GPAPGGGSAG GGPAGAAPST 
GAVDRPVVER AGAPGVAAVA AHRGWAALAD ANDALRTVSM DLSMAAAAVG ADVIHSHTWY
ANLGGHLAAL LGGVPHVMTS HSLEPRRPWK AEQLGGGYRV SSWCERVAIE SAAAVVAVSA
GMRVDILDAY PAVDPARVHV IRNGIDTDEY RPDTATDVLE RHGVDPARPT VIFVGRITRQ
KGLPVLLRAA AAIDPRAQLV LCAGAPDTPA LLTEITDLVE GLRAGRDGVV WLPGMLTKPE
VIQLLSHATV FVCPSVYEPL GIVNLEAMAC GTAVVASRVG GIPEVVDDGV TGLLVPPGDP
GALAGAVNEV LADPVRAAAM GHAGRDRAVT EFGWAAIAER TARLYASLTG G