Gene Francci3_0965 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0965 
Symbol 
ID3903872 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1141010 
End bp1142944 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content71% 
IMG OID637878299 
Productcell wall biosynthesis glycosyltransferase-like protein 
Protein accessionYP_480078 
Protein GI86739678 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACATCCG ATTCTGATTC CGAAGTCCCG CGTGCGCGGG CGCTCCCCCG CGGCGCGCGT 
CGGCATGCCC ATGCGCACCG AGATCCCCGC GTGCCCCGAC GCTCACCTCG GCTGTGGATG
GTAGCGGCCG CGGGGTGGTG GGCCGCGGGG TGGGGGGCCG CCCTGCACAC CACGTCGGCC
CTCGTGTTGT CCGCGTTGGT CGGTCTGGAC GGGAGCATGT CTTACCTCTT GCCCGGCGCA
CTCGCGGTCG TCCTGCCTCG GATGTTCGGC CGGGGTGGCC GGGGCCGGCG GATCCCGTGG
CTCGCCCGGC TCGCGTGCGA GTCGGGCGGC TGGTACGTCT ACATGTTCGC CGGTTCGCCG
GACATCAGGC TGGTCTCGTA CGGCGGTCCG CGCCTGGGCC AGCCGTTCGC CGTGGTACTG
GGCGCGGTGG CGGCGGTGAC CGGCGGCCTG GTCCTGTCCG GTCGGCTCCG GGCGCGGACC
CGCGGGTCGA TCCCCGGCGC CAGCCGCCGG CGGCGCTGGG TGTGGGCGCT GGTCAGCGTG
CCCCTCGCGG TGACGCTGGG GGTGCGTGCG GCCGAGCATC ACCACGTCGT GCTACTCGCC
GCATGGACGA TCGTCTCGTT GCTGATGCTG CTGGTCGCCA GCCTGACCCT CGTGCGCGGC
CAGTACAGCT GGCAGCGTCC CGAGCGGGTG CATCACGTGG ACCTGACCGG CAACCTCGCG
CCCCGCAACC GCTTCTCCCT CATCGTCCCC GCCCGCGACG AGCCGGTGCT CGGGCGGACC
CTGACCCAGA TCCTTGCCGG CGACTACCCC GGCGACCTGG TGGAACTGGT CGTGATGGTG
TCGTATGACG AGGTCGACCA GGAGACCCGA CGGGTCGCCG AGGAGATCGC CGGCCGGCAT
TCCAACGTGC GGGTCGTCGC GCCCGAAGGC AGCCGGCGTA GTAAACCCCT GTCCCTGGAG
GACGCCCGCC GGCACTGCAC CGGCGACCTG GTCGGCGTCG TCGACGCGGA GTCCCTGCTC
GCCGGGGGAC TCCTGCGTTA CGTGAACACG CTGGCCCTGC GCCACGCCGA CGTCGGCATC
TTTCAGGGCG GCGTGCAGCT CATGAACGCG CGTGCCACGG CGTGGCGGCG CGCCGCGGAC
CACTCACCGA TACGGGCGTT CCTCCACTGG TTGGACGCGG GCACCTCCTG GTGGCGGGCC
CGCAACTGCC TCGAATACTA CATCTGGTTC ATGTCGCGGC TGCGTTTCCA GGCGCGCGCG
CGTTTCATCC CACTCGGCGG CAACACGGTG TTCATTCGGC GAACGGTGCT GGAAAGGCTC
GGTGGGTGGG ACGTCTCCTG CCTCACCGAG GACTGTGACC TCGGGGTGCG GGCTTCGGCC
GCCGGAATCC CGACCGCGGT CTTCTACCAT CCGGATCTGA CGACCCGGGA GGAGACCCCG
GAAAGCCTCA CGAAACTCAT CATCCAGCGC ACCCGCTGGA TGATGGGGTT CATGCAGGTC
CTCTTCAAGG GGGACTGGCG GGCGCTGCCG GGAGCACGTC AGCGCATCAT GGCGGTGGAG
ATGCTCACGA TGCCGTTCTT CCAGGCACTC GCCGGCGTCC TGCTGCCGGT GTCGCTGGTC
CTCACGTTGT TTCTGGCGGC GCCGACCGGT CTGGTCATCG TCTTCTGGCT TCCGTTCGGT
GCGACCGTCA TGACGGTTTT CTCCGAGCAG GCGGCGTTCC GCGAGTTCGC CGAGGCGTAC
GGGCTCGATC TGCGTCGCTG GGACAGCGTG CGGCTCGTGC TCTGCGCCCC GCTGTACCAG
CTCGCGCTCT CCGCCGCCGC GGTGCGCGCC ACGGCCCGGT TGCTGCGTGG CCGGGTCGAG
TGGGAGAAGA CCTCCCACTC CGGGGCCCAC CACAGCACCG GCTCCGGCCG CTTCGAGCTG
GAGGCGGCGT CGTGA
 
Protein sequence
MTSDSDSEVP RARALPRGAR RHAHAHRDPR VPRRSPRLWM VAAAGWWAAG WGAALHTTSA 
LVLSALVGLD GSMSYLLPGA LAVVLPRMFG RGGRGRRIPW LARLACESGG WYVYMFAGSP
DIRLVSYGGP RLGQPFAVVL GAVAAVTGGL VLSGRLRART RGSIPGASRR RRWVWALVSV
PLAVTLGVRA AEHHHVVLLA AWTIVSLLML LVASLTLVRG QYSWQRPERV HHVDLTGNLA
PRNRFSLIVP ARDEPVLGRT LTQILAGDYP GDLVELVVMV SYDEVDQETR RVAEEIAGRH
SNVRVVAPEG SRRSKPLSLE DARRHCTGDL VGVVDAESLL AGGLLRYVNT LALRHADVGI
FQGGVQLMNA RATAWRRAAD HSPIRAFLHW LDAGTSWWRA RNCLEYYIWF MSRLRFQARA
RFIPLGGNTV FIRRTVLERL GGWDVSCLTE DCDLGVRASA AGIPTAVFYH PDLTTREETP
ESLTKLIIQR TRWMMGFMQV LFKGDWRALP GARQRIMAVE MLTMPFFQAL AGVLLPVSLV
LTLFLAAPTG LVIVFWLPFG ATVMTVFSEQ AAFREFAEAY GLDLRRWDSV RLVLCAPLYQ
LALSAAAVRA TARLLRGRVE WEKTSHSGAH HSTGSGRFEL EAAS