Gene Francci3_3098 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3098 
Symbol 
ID3904224 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3670095 
End bp3671225 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID637880419 
Productglycosyl transferase, group 1 
Protein accessionYP_482184 
Protein GI86741784 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.034289 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.933214 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTGC TGGCCGTCAC GAACGATTTC CCGCCCCGTC CCGGCGGGAT CCAGGCCTAC 
GTGCACAACT TCGCCTCGCG GCTGCCCGAG GGCGAGATCG TCGTCTACGC CCCGGCCTGG
AAGAACGCCG CCGCCTTCGA CGCCGAACAG AACTTCCCGG TCGTGCGGCA CACCACGTCG
CTAATGCTGC CGACACCGGA CGTCCTGCGC CGAGCCAGAG AGATCGCCCG GGCGGAGGGA
TGCGACACGA TGTGGTTCGG CGCCGCGGCA CCGCTCGGGC TGCTTGGAGC CCGGCTGCGC
CGCGACACGG CCATGCGCAG GATGGTCGCG AGCACCCATG GTCACGAGGT CGGCTGGGCG
GCGCTGCCGG GGGCGCGTCA GGCGTTGCAC AGCATCGGCA CCGCGGCCGA CGTCATCACC
TATCTGACCG ACTACACCCG GGCCCGGATC CGGCCCGCCT TCGGCGGCCA TCCCACCTTC
GCCCGGCTGC CCAGCGGAGT CGACCCCTCG CTGTTCCATC CCGGTCACGG GCGCGAGGAG
ATGCGCCGGC GCCACGGGCT GACGGGCCGC CGGGTGGTGG TGTGCGTAAG CCGGCTGGTC
GCCCGCAAGG GCCAGGACAT GCTGATCAGG GCGCTGCCCA TGGTACGGCG CCGCGTACCG
GACGCCGCGC TGCTGATCGT CGGCGGCGGT CCCCGGCGGG GTGACCTTGA ACGGCTCGCC
CGGGAGAACG ACGTCGCCGA GCATGTGATC ATGACTGGTT CGGTGCCGTG GGAGGAACTG
CCGGCGCACT ATGCGGCGGG CGATGTGTTC GCGATGCCCT GCCGCTCCCG CCTCGCCGGC
CTGGAGGTCG AGGGGCTCGG CATCGTCTTC CTCGAGGCGT CGGCGACCGG CCTGCCGGTG
GTGGCCGGCC GCAGTGGGGG TTCCCCCGAC GCCGTCCTGC ACCAGCACAC CGGCATCGTG
ATCGACGGTA CCGATCTGGC GCAGGTCGTG ACGACCATCG GTGATCTTCT TGCCGACCCC
GACCGGGCGG CGTCGATGGG TGCCGCGGGG CGGGCGTGGG TCGAGCTGCG CTGGCGGTGG
GACGTCCTCG CGCAGGACCT GCGCACGCTG CTCGCCGGCC CGGACGGTTA G
 
Protein sequence
MRVLAVTNDF PPRPGGIQAY VHNFASRLPE GEIVVYAPAW KNAAAFDAEQ NFPVVRHTTS 
LMLPTPDVLR RAREIARAEG CDTMWFGAAA PLGLLGARLR RDTAMRRMVA STHGHEVGWA
ALPGARQALH SIGTAADVIT YLTDYTRARI RPAFGGHPTF ARLPSGVDPS LFHPGHGREE
MRRRHGLTGR RVVVCVSRLV ARKGQDMLIR ALPMVRRRVP DAALLIVGGG PRRGDLERLA
RENDVAEHVI MTGSVPWEEL PAHYAAGDVF AMPCRSRLAG LEVEGLGIVF LEASATGLPV
VAGRSGGSPD AVLHQHTGIV IDGTDLAQVV TTIGDLLADP DRAASMGAAG RAWVELRWRW
DVLAQDLRTL LAGPDG