Gene Franean1_6665 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6665 
Symbol 
ID5674980 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8095408 
End bp8096709 
Gene Length1302 bp 
Protein Length433 aa 
Translation table11 
GC content75% 
IMG OID641245516 
Productglycosyl transferase group 1 
Protein accessionYP_001510908 
Protein GI158318400 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.143503 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.818272 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTGA ACGGGGGCGA CGGTGAAGCC CCGGTGAGCG GGGGCACGGC AGGCGGGAGC 
TCGGCGGGCG GCCCGCTCTC AGTGGCGTTG CTGACCTATT CCACCCGGCC CCGCGGCGGG
GTGGTGGCCA CCCTCGCGCT GGCCGAGGCG CTGGCGCGCG CCGGGCACCG CGTCAGCCTG
TGGACGCTGG CCCGCGGCGG GGACGCCGGC TTCTTCCGCC CGGTCGACCC GGCGGTCGAG
GTGGTGGCGG TGCCGTTCCC GGAGGTCGCC GACGAGACGG TCGGGAAGCG CATTATACGC
TCGATCGCCA TCCTCCGGGA TGCCTTAGAG GCGTTTCCCG GCGGGTACGA CATCGTGCAC
GCCCAGGACT GCATCGCCGC GAACGCCGTG GCCGACTGCG TCCGCACTGT CCACCACCTG
GACACCTTCA CCACCCCCGA GCTCGTCGCC TGCCATGAGC GGGCGCTGCG CCGGCCGTAC
GCGCACGTGT GCGTGTCGGC GGCCGTCGCG GTCGAGCTGG CCGCCGGCTG GGGGATCACC
GCGACGGTGA TCCCGAACGG CGTCGACGCG GCCCGCTTCA CCGCGGCTGC CGGCCCGGAG
GCTCCCGCCC GCGAGGCCCG CGGGCGCTGG CGTGCCCGGC TCGGCCGGTA CGTGCTCGCC
GTCGGCGGGA TCGAGCCGCG TAAGGGGACC GCCGACCTGG TCGAGGCGTT CGCGCTGCTG
CGGGAGCGGG TGACGCCGGT TTCCCTCGTC GTCGCCGGTG GGGAGACCCT GTTCGACTAC
CGCGGGTACC GCGAGCAGGT GCTGGGCCGG GCCGCGCAAC TCGGCGTCGA GCCCGTCATC
CTCGGGCCGG TGGCCCACGA GGAGCTGCCC GCCCTGGTCG CGGCGGCGGA CGTCTTCGCG
TTCCCGTCCG CGAAGGAGGG GTTCGGGCTG GCCGCGCTGG AGGCGCTGGC CGCGGGTGTC
CCGGTCGTCA CCCGTGACCT GCCGGTGCTA CGCGAGGTGC TGGCCGCGGC CGGGGACGCG
GTGTGCTTCG CCTCGACGCC GACCGAGTTC GCGGCCGCGC TGGAGGCGTT CCTCGACTCC
GCGCAACGGC GGCCGGCCGC GGGCGCCGAG CAGCCGCAGG CGGGAGCGGG CGCCGGACAT
CCGCGGGTGG CGGCCGGGCG GGCAGTCGCG CGAGGGTACA GCTGGGCGAC CGCGGCCGAC
CGGCATGTCG CCCTTTACCG CGAACTGATC AGAACACAGG TCACCGGACG GCCGACGCGC
CGAATCAGCG TCACCACAGA TAAATCATCC ATCTCATTGT AA
 
Protein sequence
MTVNGGDGEA PVSGGTAGGS SAGGPLSVAL LTYSTRPRGG VVATLALAEA LARAGHRVSL 
WTLARGGDAG FFRPVDPAVE VVAVPFPEVA DETVGKRIIR SIAILRDALE AFPGGYDIVH
AQDCIAANAV ADCVRTVHHL DTFTTPELVA CHERALRRPY AHVCVSAAVA VELAAGWGIT
ATVIPNGVDA ARFTAAAGPE APAREARGRW RARLGRYVLA VGGIEPRKGT ADLVEAFALL
RERVTPVSLV VAGGETLFDY RGYREQVLGR AAQLGVEPVI LGPVAHEELP ALVAAADVFA
FPSAKEGFGL AALEALAAGV PVVTRDLPVL REVLAAAGDA VCFASTPTEF AAALEAFLDS
AQRRPAAGAE QPQAGAGAGH PRVAAGRAVA RGYSWATAAD RHVALYRELI RTQVTGRPTR
RISVTTDKSS ISL