Gene Francci3_0735 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0735 
Symbol 
ID3905862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp850033 
End bp851163 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content72% 
IMG OID637878068 
Productglycosyl transferase, group 1 
Protein accessionYP_479848 
Protein GI86739448 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.605031 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACCCC GGGTGCTCGT CGACGCCACT TCGGTCCCGG CCGACCGTGG TGGCGTAGGG 
CGTTATGTCG ACGGGTTGGT CGCGGCGCTG GGCGCGACCG GCGCGGACAT GGCGCTCGTC
TGCCAACGGT CGGATGAGGA ACGCTACAGC CGGATGGCCC CACACGCCAC CGTTCTGTCC
GGGCCGGCGG CCATCGCCCA CCGGCCCGCG CGGTTGGCCT GGGAACAGAC CGGGCTGCCA
CTGGTCGCCG AACAGGTGAA TGCCGATGTC ATCCATTCAC CGCACTACAC GATGCCGTTG
CGGGCGCACC AGCCCGTGTG CGTGACGATT CACGATGTCA CCTTCTTCAC CGAGCCCGAC
ATGCACACCG CGGTGAAGGG CACGTTCTTC CGGTCCGCGA TGCGCACCGC GGTGCGCCGG
GCGGCCCGGA TCATCGTCCC CTCGAAGGCC ACCCGGGACG AGTTGGTGCG GGTGCTCGAC
GGGGAGTCGA CGACCACCGA CGTCGCCTAC CACGGCGTCG ACACGTCGAC CTTCCATCCG
CCGACCGAGG AGGACAAGCG GCGGGTGCGC CGCCGGCTCG GGCTCGGCGA CTCGCGCTAC
GTCGCGTTCC TCGGGATGCT GGAGCCGCGC AAGAACGTGC CGAACCTCAT CCGGGGCTGG
GCCGAGGCGG TGCACTGGCG GGACGAGCCG CCCGCGCTCG TCCTGGCCGG CGGCTCCGGC
TGGGACGACG ACGTCGATGC CGCCGTCGCC TCGGTGCCGA GCCACCTGCG GGTGATCCGG
CCGGGTTATC TGCGCTTCTC CGACCTGCCG GGCTATCTGG GAGGTTCGTC ACTGGTGGCG
TATCCCTCGC ACGGTGAGGG CTTCGGGCTG CCGGTGCTGG AGGCGATGGC CTGCGGCGCC
CCGGTTCTCA CCACGCCGCG TCTGTCGCTG CCGGAAGTCG GCGGCGACGC CGTCGCCTAC
ACTCAGCCGG ACGCCGGCTC GATCGCCCGG GAGATGGGCG CCCTGCTCGA TGACGCCGAG
CGGCGCAGGC AACTCGGTGA GGCCGGTCTC GCCCGGGCCC GGGAGTTCAC CTGGGCGGCC
AGCGCCGAGG CGCATCTCGC GAGCTACGCG CGGGCGGTCG CCGAGCACTG A
 
Protein sequence
MGPRVLVDAT SVPADRGGVG RYVDGLVAAL GATGADMALV CQRSDEERYS RMAPHATVLS 
GPAAIAHRPA RLAWEQTGLP LVAEQVNADV IHSPHYTMPL RAHQPVCVTI HDVTFFTEPD
MHTAVKGTFF RSAMRTAVRR AARIIVPSKA TRDELVRVLD GESTTTDVAY HGVDTSTFHP
PTEEDKRRVR RRLGLGDSRY VAFLGMLEPR KNVPNLIRGW AEAVHWRDEP PALVLAGGSG
WDDDVDAAVA SVPSHLRVIR PGYLRFSDLP GYLGGSSLVA YPSHGEGFGL PVLEAMACGA
PVLTTPRLSL PEVGGDAVAY TQPDAGSIAR EMGALLDDAE RRRQLGEAGL ARAREFTWAA
SAEAHLASYA RAVAEH