Gene Francci3_2035 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2035 
Symbol 
ID3906752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2396439 
End bp2397497 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content71% 
IMG OID637879372 
ProductNAD-dependent epimerase/dehydratase 
Protein accessionYP_481138 
Protein GI86740738 
COG category[G] Carbohydrate transport and metabolism
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0451] Nucleoside-diphosphate-sugar epimerases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0499801 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCTGC TGATCCTGGG AGGAACCTGG TTCGTCGGCC GTGTCCTGGC TGAGGACGCG 
GTGGGCCGTG GCTGGGCGGT CACGACGTTC AACCGGGGTA GGTCCGGACC GGATGTCGCA
GGGGTCCACC CGTTGCGCGG CGACCGAACC GACGTCCAGG ATCTTGAACG CCTCGCGGCG
GCGGGGCCGT GGGATGCCGT GGTGGACGTC GGCGGAGCGG AGCCCCGCTC GGTCGGCCTG
GCCGCTCAGG TTCTGGGCGC GCAGGCCGGT CGGTACGTGT TCGTGTCGAC CGTCTCGGTG
TATCGCGACT GGCCCGCGTC CCCGGTCGAC GAATCCTCAC CTCTACATCC GGGAAACCCC
GATCTTGTGG TGGAAGATCC TCGCTGGGAC GCGGTGCGGT ACGGCCCCCA CAAGGCCGGG
TGTGAGGCCG CGGTCCGGCG GAGCGTTTCC CCGGATCGGC TGCTCATGGT GCGGCCGGGG
GTGGTTCTCG GCCCGTACGA GTACGTCGGA CGGTTGCCGT GGTGGCTTCG GCGGATGGCG
CGCGGCGGGC GGGTGCTGGC CCCCGCACCC GCCGACAGGC CGATCCAGCC TGTGGACGTG
CGTGACCTCG CGTCGTTCCT GCTCGACCTG ATCGGGCGGT CGGCCAGCGG CATCTTCAAC
GTCGCGGCGC CCACCGGCCA CGCGACCTAC GGCCGGATGC TGGACGCGTG CGCTGCGGCG
ACGCGGGACG TCCGAGGCGC AGATGAGATC GAGGTTGTCT GGGCGGAACC CGATTGGTTG
GTCGAACAGG GGGTGCGTCA GTGGACGGAG ATCCCGCTGT GGCGGGTGCA GCCAGGGACA
TGGCGCCTGG ATGCGACCCG CGCGGCGGCG GCGGGCCTGC GTTGCCGGCC GATCGAGAAG
ACGGTCCTGG CCACGTGGGC GTGGCTGGCG GCCGGTGGCG CTCCGGTCCG GCATGAACGT
CAGGACGAGC ACGGTTTCGA CCCCGACAGA GAGCGCCGCC TCGTCGACCT GTGGGAGTGC
CGGTCACAGG CCGCCTCCGG CGAGAAGGGC CTGGTGTGA
 
Protein sequence
MRLLILGGTW FVGRVLAEDA VGRGWAVTTF NRGRSGPDVA GVHPLRGDRT DVQDLERLAA 
AGPWDAVVDV GGAEPRSVGL AAQVLGAQAG RYVFVSTVSV YRDWPASPVD ESSPLHPGNP
DLVVEDPRWD AVRYGPHKAG CEAAVRRSVS PDRLLMVRPG VVLGPYEYVG RLPWWLRRMA
RGGRVLAPAP ADRPIQPVDV RDLASFLLDL IGRSASGIFN VAAPTGHATY GRMLDACAAA
TRDVRGADEI EVVWAEPDWL VEQGVRQWTE IPLWRVQPGT WRLDATRAAA AGLRCRPIEK
TVLATWAWLA AGGAPVRHER QDEHGFDPDR ERRLVDLWEC RSQAASGEKG LV