Gene Francci3_1907 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1907 
Symbol 
ID3906856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2239756 
End bp2241036 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content74% 
IMG OID637879245 
Productglycosyl transferase, group 1 
Protein accessionYP_481012 
Protein GI86740612 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.278952 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTGCC ACCGTGCTCG TGTGCGCATA GCGGTGATCA CCGAGTCGTT CCTCCCCCAT 
GTTGACGGGG TGACGAACAC GGTCTGTCGA GTGCTGGAAC ATCTGCGGGA CCGGCAGCAC
GAGGCGATGG TCATCGCGCC GGCTCCGGCG CCCGCCGCCC GACGCGCGGC GGCGCGCAGC
CACGCCGGCG CACCGGTGCT GTGGGCACCG TCGGCTCCCC TACCGGGTTA CCCGGCGTTT
CGCTTCGCAG TCCCGTGGCC CGGGCTTCCC GCCGCCTTAC GGGAGTTCAA CCCGGACATC
GTCCATCTGG CGGCGCCCGC CGGCCTCGGG GCGCAGGCGG TGTTCGCGGC GCGGCGCCTC
GGCATACCGA GCATCGCCGT CTACCAGACG GACATCGCCG CGTTCGCAGC CCGCTACGGG
CTGGCCACCG CGGAACGCAC GATCTGGCAC TGGCTCGCCA TCGTACATCG GCTCGCGGCC
CGGACCCTCG CGCCGTCCTG GGATGCCGTC GACACCCTCC TGAGCCAGGG CGTGCAACGG
GTCGCCCGCT GGAGTCGGGG GGTGGACCTC GAACGTTTCC ACCCGGCGCA CCGGGACGAC
GAGCTGCGCC GCCGCCTCGC TCCGAACGGC GAGGTCCTGG TCGGCTACGT CGGCCGGCTG
GCCCGGGAGA AGCGGGTGGA GCTGCTCGGA GCGGTGTCCG ACATCCCGAA CACCCGGCTC
GTCGTCGTCG GTGACGGCCC CTCCCGTCCC ACCCTGGCCC GGTCGATGCC GAACGCGGCG
TTCCTCGGAT TCCGCGCCGG GCAGGAGCTC TCGGCCGCGG TCGCGAGCCT CGACGTCTTC
GTCCACACCG GCATCCACGA GACATTCTGC CAGGCGGCGC AGGAGGCCAA GGCGAGCGGG
GTACCGGTGG TCGCCCCTGC GGCGGGCGGG CTGCTTGACG TTGTCGAGCA CGGCCGCACC
GGGCTGCACT ACACTCCCGG CGACCCCGCG GCCCTGCGGG CGCAGGTCGC GGCCCTGACC
GACGACCTCC CTCGTCGGGT GGCGATGGGT GCGGCGGCCC GGGAGTCGGT CGCGGGATGC
GGGTGGAGCG CGATCGGCGA CGAGCTGCTC GGCCACTACC GTGACGTGCT CGGCACCGGC
GGCGGGGCCG GTCGCTTCGG CCGCCTCAGG CACCCCGGCC CGGTCATCGA GGGACGGAGC
GGACGGCGGG GCGGACGACG GGACGGATGG GACGCACGCA AGCGGCCCGG GACGGGAAAC
GACGACGGGT GGTCAGCATG A
 
Protein sequence
MPCHRARVRI AVITESFLPH VDGVTNTVCR VLEHLRDRQH EAMVIAPAPA PAARRAAARS 
HAGAPVLWAP SAPLPGYPAF RFAVPWPGLP AALREFNPDI VHLAAPAGLG AQAVFAARRL
GIPSIAVYQT DIAAFAARYG LATAERTIWH WLAIVHRLAA RTLAPSWDAV DTLLSQGVQR
VARWSRGVDL ERFHPAHRDD ELRRRLAPNG EVLVGYVGRL AREKRVELLG AVSDIPNTRL
VVVGDGPSRP TLARSMPNAA FLGFRAGQEL SAAVASLDVF VHTGIHETFC QAAQEAKASG
VPVVAPAAGG LLDVVEHGRT GLHYTPGDPA ALRAQVAALT DDLPRRVAMG AAARESVAGC
GWSAIGDELL GHYRDVLGTG GGAGRFGRLR HPGPVIEGRS GRRGGRRDGW DARKRPGTGN
DDGWSA