Gene Francci3_3949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3949 
Symbol 
ID3906908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4728330 
End bp4729445 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content70% 
IMG OID637881276 
Productglycosyl transferase, group 1 
Protein accessionYP_483028 
Protein GI86742628 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAGGTCC TGGTGCCTTC CCGGGAGAAC CTGGATATCG CCGTGTACGA GGAGGCCGTG 
CCGGGTTCGG CCGTCATCCG GCTGCCTCGT GACGACAGCC CACGGGCACA CCTGACGGCC
CGGCCATTCA TGCACGCATC CCGGCCGGTC GGTCCGTTGC GCCGCGCGCT GGAGTCGACA
CCGCCACGGG CCGACGCCGT GATCAGTTAC AGCTGCCGGA CGTCGCATCT CGGCGAGGAG
GTCGCCCGGA TCTGGAAGCT GCCGCATCTG GTGCGCGCGC ACAACGTCGA CTCGGAGTTC
TTCCGGGTGC TGGCCCGCAA CAGCTCGGGG CCGCGCGCCG TCGCCTACGA GCTGGAGTAT
CACAAACTGC GGCTGGCGGA GCTTTCGATG CACCACTCTC CGCTCATCAC CTGCATCGCG
GACATCTCGC TGGAAGACCA CAACTGGCGG CGTACGCGGG CGAGCATCCC AACGTTCCAC
CTGCCGCCGT TCCTGCCCGT CGGCGCCCTC ACCGCACCGA CGGGCACCGG GTCGGCCGGT
ACCGAGCCGG CCCCGCCGGG CCGGAAGCCG GCGCCTCGGG ACAGCGCCGG CAAAAAGCTG
ATCTTTGTCG GCTCGCTGGA CACCCCGACC AACATCGAAG CCCTGCGCTG GTTCCTCGGC
GGCTGCTGGC CGACGATCCG GGTCCGCCAT CCCGACGCGA CCTTCCAGGT GGTGGGGCGG
CGTCCGGAGG AGGGCCTGGC GGCATGGCTC GGTGGCTTCG AGGGCGTGGA GTTGCACACC
GACGTGCCGA GCGTCGCCGG TTACCTCGCC ACCGCCACCC TGTCGGTGAA CCCGATGCGT
TCGGGATCCG GCGTGAACAT CAAGGCGATC GAGGCAATGG CGGCGGGAAC GCCGGTCGTG
AGCACACCCA CCGGCAGCCG CGGCCTCGGC TGGACGCCCG GTACCCACCT CCTCGTCGCC
GAGGACGCGG GCGCCTTCGC CGACGCCGCC TGCAGCCTGC TGGAACAGCC CTGGAAGGCG
GCCGAGATCG GAACAGCCGG CCAGGCCTAC GTGCTACAGG AACTCGACCA CACGGCGCTC
ATCGGTCGCA TCCAGGAACA TCTGACGCCA GCCTGA
 
Protein sequence
MQVLVPSREN LDIAVYEEAV PGSAVIRLPR DDSPRAHLTA RPFMHASRPV GPLRRALEST 
PPRADAVISY SCRTSHLGEE VARIWKLPHL VRAHNVDSEF FRVLARNSSG PRAVAYELEY
HKLRLAELSM HHSPLITCIA DISLEDHNWR RTRASIPTFH LPPFLPVGAL TAPTGTGSAG
TEPAPPGRKP APRDSAGKKL IFVGSLDTPT NIEALRWFLG GCWPTIRVRH PDATFQVVGR
RPEEGLAAWL GGFEGVELHT DVPSVAGYLA TATLSVNPMR SGSGVNIKAI EAMAAGTPVV
STPTGSRGLG WTPGTHLLVA EDAGAFADAA CSLLEQPWKA AEIGTAGQAY VLQELDHTAL
IGRIQEHLTP A