Gene Francci3_0010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0010 
Symbol 
ID3902957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp13122 
End bp14525 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content73% 
IMG OID637877340 
Productglycosyl transferase, group 1 
Protein accessionYP_479133 
Protein GI86738733 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.785418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCGGATTG CGTTGTTGTC GTACCGGAGT CTGCCCACCT GCGGTGGCCA GGGTGTGTAC 
GTCCGACATC TGTCCCGCGA ACTGGTCGCC CTGGGGCACC GCGTCCAGGT GGTGAGCGGG
CCGCCGTACC CGGTGCTTGA GGAGGGTGTC GGCCTCACCG AACTGCCCAG CCTTGATCTC
TACCGGGACT CCGACCCGTT CCGGTGGCCC GGACTGGCCG AGTTCCACGG CCTGCCGGAC
GTCGTCGAGT TCGCCATGAT GCGGACCGGC CAGTTCTCCG AGCCGCTCGC CTTCAGCCTG
CGTGCCTGCC AAGCCCTGCG CCCCAGGGCA TCCGGGGGTC GGCCGCCGTT CGACATCGTC
CATGACAACC AGGGGCTGGG ATACGGCCTG CTGGCGCTGC GGGCAGCCCT GCGCCCGTAC
CGGATCCCGG TCGTGGGCAC CGTGCATCAT CCGATCACGG TCGATCGCCG CCTGCACCTC
GCCGCGGCGA CCACCTTCAC CTCCCGGCTG GGCCTGCGCC GGTGGTACTC GTTCCTCCCG
ATGCAGGCCC GGGTCGCGCG GGGGCTGGAT GGCATCGTCA TCCCGTCGGA GAGCTCCCGG
CGAGAGATCA TCGCGGACAT GAACCTCCCC CCGACGGTCA TGCGCACGGT TCCCTTGGGA
GTCGACGCCG ACGTCTTCAC GCCGGCGCCC GCGGGCAATC CGGCGGTTCC GGGCCGTGTC
GTCGTCGTGA CCAGCGCCGA TGTTCCGCTT AAGGGCCTGC TTGTCCTGCT CGAGGCGCTC
GCGAAGCTGC GGGTGGATCG CTCGGCGCAC CTGGTCTGCG TCGGCAAGGT CCGCGAAGGG
GGAACGGCCC AGCGCCAGGT TGCCGAGCTG GGCCTGGCCG ACGCCGTGAC GTTCCGTTCC
AACATGCCGG AACCGGAGCT GGTGGACCTG TTGCGCTCCG CCGAGGTCGC GGTCGTTCCC
TCGCTTTACG AGGGGTTCAG CCTGCCCGCC GTCGAGGAGA TGGCCTGCGG GATCCCGCTG
GTCGCCACCA CCGCCGGGGC GTTGCCCGAG GTCGCCGGTC CGGACGGGGA GGCCGCGTTG
CTGGTCCCAC CGGGGGATGC GGGAGCCCTG GCGGACGCCA TCGGTTCCCT GCTCGATGAT
CCCGAACGAC GGGCCCGAAT GGGTGCCGCC GGGCGCCGCC GGGTGGAGGC GCGGTTCTCC
TGGCGGGCGG CCGCCGCGGC CACCGCGGAC TGGTACGCCG AGCGGATCGC GGCGGTCGGC
GGGACGCCCA CCTCCCCGGT CCCGGCCCCG GGACCGGGAC CGGGACCGGC GGCCCAGTGG
ACGCCGGCAC CGCTGTCCAC ACCCGGCGCC GTCACACCCG GCGCCGCGGC GTCCGCACCG
GCGTCGTCGA CGCTGACCGG CTGA
 
Protein sequence
MRIALLSYRS LPTCGGQGVY VRHLSRELVA LGHRVQVVSG PPYPVLEEGV GLTELPSLDL 
YRDSDPFRWP GLAEFHGLPD VVEFAMMRTG QFSEPLAFSL RACQALRPRA SGGRPPFDIV
HDNQGLGYGL LALRAALRPY RIPVVGTVHH PITVDRRLHL AAATTFTSRL GLRRWYSFLP
MQARVARGLD GIVIPSESSR REIIADMNLP PTVMRTVPLG VDADVFTPAP AGNPAVPGRV
VVVTSADVPL KGLLVLLEAL AKLRVDRSAH LVCVGKVREG GTAQRQVAEL GLADAVTFRS
NMPEPELVDL LRSAEVAVVP SLYEGFSLPA VEEMACGIPL VATTAGALPE VAGPDGEAAL
LVPPGDAGAL ADAIGSLLDD PERRARMGAA GRRRVEARFS WRAAAAATAD WYAERIAAVG
GTPTSPVPAP GPGPGPAAQW TPAPLSTPGA VTPGAAASAP ASSTLTG