Gene Franean1_2160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2160 
Symbol 
ID5670560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2590153 
End bp2591706 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content71% 
IMG OID641241081 
Productglycosyl transferase group 1 
Protein accessionYP_001506502 
Protein GI158313994 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCAGACG ACTCCCGCCC ATCGCCCGTC GCCCCGGCGC GGCCATGGCC GTCCACGACC 
AGGTCGGCGC CAACGCCGTC GCGGCAGGTC AGCAGCGGCC CCCCCAGTGG ATGGCCACGC
ATCCTGCTCG TGACGCACTA TTTCCCCCCC GAGGTCGGGG CACCGCAGGC CCGGCTCTCG
GAGACGGCGC GGGCCTGGGC ACAGGCCGGC GCGGATGTCA CCGTGCTGAC CGGCATGCCC
AACCATCCGA CCGGCATCGT GCCACCGTCC TACCGCGGCG CGGCCCGGCG AGTGGAGCAC
AGCGACGGCT ACCGGATAGT GCGGACCTGG CTTTATGCGA CCCCGAACGA AGGCGTGCTG
CGCAAAACGA TCGGCCACAT CTCTTTCACA CTCAGCTCGG TACTGCTGGG CGGCCGGCTC
GCCGGGCCGG CTGACGTCGT CGTCGTCTCC TCGCCCACGT TCTTCCCCCT GGGCTCGGCG
TGGTGGCTGG CCCGCCGATG GCGGGCCCGG CTAGTCGTCG AGGTACGGGA CCTGTGGCCG
GCGATCTTCA CTCAGCTCGG AGTGATCAAG AACCGCCGCG TCATCGCCGC GCTGGAACGA
CTGGAGCTGG CCGCATACCG GGCCGCGGAC GCGGTTGTCA CCGTAACCGA CGGATTCCGG
GACGACATCG TGCGCCGCGG CATCCCGGCG GAAAAAGTAC ACGTCATTCC CAACGGCGTG
GACCTCGACC GCTTCCAGCC GGGCGAACCG GCATCCGCCG AGGTACGGGC GAGGCTGGGA
GCCGGCCCGG ACGACATTCT CGTGCTGTAC GTCGGCGCAC ACGGCATCTC GCAGGGGCTC
ACCTCGATCG CGGACGCCGC GGCCCGACTG GCCGAGAAGG CTCCGGCGAT CCGATTCGCC
TTCGTAGGCG AGGGAGCCGA CAAGCAGCGG CTGACCGACC ATGTCGGGCA GCTCGGCCTG
ACAAACACCA CCCTGGCGCC GGCGGTCCCT CGCGCGGACA TGGCCACGCT TCTCGCCTCC
GCCGACATCT GCCTGGTCCC GCTGCGGGAC GTGCCGTTGT TCGACACCTT CATCCCGTCG
AAGATGTTCG AGCTGCTGGC GGCGGGGCGC CCGGTGATCG GCTCGGTGCG CGGCGAGGCG
GCCCGCATAC TCGCCGAGGC CGGCGCGGTC GTGGTGCCTC CTGAAGACCC TGACGCGCTC
GCCGAGGCAG TGTTGGATGC GGCAACCGAT CCGGGGCGGG ACGTCGACAT GGGCCGCACG
GCCCGTCAGT ACGTCGCACA ACACTTTGAC CGGTCGATGC TGGCCCAGCG CTACCACGAC
CTGCTCCTGG GACTCCTGAC CGGGCGGCCG GGCGACGCGG TGTCCAGGGA TGAGGCGGCC
CTGGGCCCAG GGTCGCCGCG AGACCAGGGG GCGCGCAACG AACCGGGTCA GGAGCGCGTG
GTCGCTCCGA TGCCGCCGTT GGAGGAGCAG TCACCGGTTC CCAGTCCCCG CCCCGCTCGC
CCCGGACCGA CGGACCCACA TCTCGCGATC GACACCCGAG GGAGATCGGC ATGA
 
Protein sequence
MPDDSRPSPV APARPWPSTT RSAPTPSRQV SSGPPSGWPR ILLVTHYFPP EVGAPQARLS 
ETARAWAQAG ADVTVLTGMP NHPTGIVPPS YRGAARRVEH SDGYRIVRTW LYATPNEGVL
RKTIGHISFT LSSVLLGGRL AGPADVVVVS SPTFFPLGSA WWLARRWRAR LVVEVRDLWP
AIFTQLGVIK NRRVIAALER LELAAYRAAD AVVTVTDGFR DDIVRRGIPA EKVHVIPNGV
DLDRFQPGEP ASAEVRARLG AGPDDILVLY VGAHGISQGL TSIADAAARL AEKAPAIRFA
FVGEGADKQR LTDHVGQLGL TNTTLAPAVP RADMATLLAS ADICLVPLRD VPLFDTFIPS
KMFELLAAGR PVIGSVRGEA ARILAEAGAV VVPPEDPDAL AEAVLDAATD PGRDVDMGRT
ARQYVAQHFD RSMLAQRYHD LLLGLLTGRP GDAVSRDEAA LGPGSPRDQG ARNEPGQERV
VAPMPPLEEQ SPVPSPRPAR PGPTDPHLAI DTRGRSA