Gene Francci3_0730 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0730 
Symbol 
ID3905857 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp845016 
End bp846110 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content66% 
IMG OID637878063 
Productglycosyl transferase family protein 
Protein accessionYP_479843 
Protein GI86739443 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.578304 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.224353 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACCACC GCCACGGACC GGCGATCGGA GCGGGAACGA CGACGTCGGA CCGCCAGGAC 
GCCGGCCCGG GACGCCGCTG GGCCGGAACG ACGACCGGGA GCCACACCGA CGTGAGTGTC
AGCGCCATCC ACCGACCTGA ACATCCGCAG GCCGGCACCC CAGTGCAGGC GGCTTCCGGG
GACGGCGTCG CCGCCGTCGG TCCGGACGCC CCCTACGTGA CCATCGTGCT GCCCTGCTAC
AACGAGCAGG ACCATGTGCT GCTCGAACTC GAGCGCATCA CCGCCGCGAT GGACGCGAGC
GGTTACTCCT ACGAGGTCCT GGCGATCGAC GACAAGTCGA CCGACAACAC GCTCGCGGTG
CTGCGCGAGG TGGCCCCGCG GTTTCCGCGC ATGCGGGTGA TGCCGTTCCG CCGCAACGGT
GGTTCCGGCA CGGCCCGCCG GATCGGGACC CAGGAGGCCC GCGGCAAGAT CGTCGTCTGG
ACCGACGCGG ACATGACGTA CCCGAACGAG CGCATCCCGG AGTTCGTCCG TTACCTGGAC
GACAACCTCG ACGTCGACCA GGTCGTCGGG GCGCGCCGCA CCGAGGAGGG CACTCACAAG
TGGGCCCGGG TGCCGGCGAA ATGGTTCATC CGGATGATCG CCCAGCGGCT GTCCGGCATG
AAGATTCCCG ACCTCAACTC CGGGCTTCGC GCCTTCCGCC GGGACGTCTC CCTGCCCTAT
CTGCGGCTGC TGCCCCCCGG TTTCTCCTGC GTCACCACGA TCACGATGTC CTTCCTGTCG
AACCAGCATC CGGTGGATTA CATCCCGATC GATTACGCAA AACGGTCCGG AACGTCGAAG
TTCCATCCCT TCCGGGACGC GCGGCGTTAC ATCCTTCAGG TGCTGCGGAT GGTGATGTAC
TTCGATCCGA TCAAGGTTCT CATGCCGGTC GCCCTGTGGA TCATGGGCTT GGGTTTCGTC
AAGCTGATCG TGGACCTGAT CCGCTACGAC TTCCATGTGG CGACGTCAAC GCTGCTGGCG
ATTCTGGTCG GCTTCCAGAT CGTCGTGCTG GCGTTGATCG GCGATCTGGT GGCCCGCTCG
CGCAGCGACA CCTGA
 
Protein sequence
MHHRHGPAIG AGTTTSDRQD AGPGRRWAGT TTGSHTDVSV SAIHRPEHPQ AGTPVQAASG 
DGVAAVGPDA PYVTIVLPCY NEQDHVLLEL ERITAAMDAS GYSYEVLAID DKSTDNTLAV
LREVAPRFPR MRVMPFRRNG GSGTARRIGT QEARGKIVVW TDADMTYPNE RIPEFVRYLD
DNLDVDQVVG ARRTEEGTHK WARVPAKWFI RMIAQRLSGM KIPDLNSGLR AFRRDVSLPY
LRLLPPGFSC VTTITMSFLS NQHPVDYIPI DYAKRSGTSK FHPFRDARRY ILQVLRMVMY
FDPIKVLMPV ALWIMGLGFV KLIVDLIRYD FHVATSTLLA ILVGFQIVVL ALIGDLVARS
RSDT