Gene Franean1_5300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5300 
Symbol 
ID5673634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6376412 
End bp6378109 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content77% 
IMG OID641244157 
Productglycosyl transferase group 1 
Protein accessionYP_001509564 
Protein GI158317056 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.225407 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.649548 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCACAG CCGCGCCCCG GCCCGTTCAC GCCCTCACCG GGCGTCACCT GGTCTTCCTC 
AACTGGCGCG ACAACGCCCA TCCGCAGGCC GGCGGGGCGG AGCTGTTCTG CCACTCGGTC
GCGGAGCGGT TCGCCGCTGC CGGCGTCCGC GTCACCCTGC TCACCTCCCG CCCGCCGGGC
GCCGCCGCCG CGACCACGGA CGGCGGGGTC GCCGTGCGCC GCGGCGGGGG CACCTTCGGG
GTGTACCCGT CGGTGCTCGC CCGGCTGGCG CGGATGGTCC GCTCCGGGGA GCGGGTCGAC
GCCGTCGTCG ACTGCCAGAA CGGCATCCCG TTCTTCAGCC CGCTCGTGCT GCCGAGCCGG
ATTCCGGTGG TGCAGGTGCT GCACCACGTC CACCAGAAGC AGTTCCCGCT GTACTTCCCG
CGGCCGGTGG CGCGGATCGG CCAGCTACTC GAGACCCCGG GCAGCCGGTG GGTCTACGGC
CGCCGGCCGG TGGCCGTGGT CTCGCCGTCC ACCAGGGACG AGGCCCGTGG GGTGCTGGCG
CTGCCCGGGG CCCGGTTCCT CGTCCCGAAC GGCGTCACCA TCGCCGGCGG TGATGGTGAC
GCCGGCGGCG ATGCCGTCGC CTCCGGCGGC GCGGGCGGGA CGGACGGCCC GTTCGGTGCC
GATGGCGCCA TCGGGGCGAG GGCGGCGGCG CCCACGATCG TGTGTGTGGG CCGGCTCGTC
CCGCACAAGC GCCTGCACCT GCTGATCGAG GCGCTGCCCG TGCTGGTCGG GCGGCACCCG
GGCCTCAGCC TGCACCTCGT CGGCGACGGG CCGGACCGCC GCCGCCTCGC CGACACCGCC
GCCCGGTTGA TGCTCACACA GGGCGACGGC TCGTCGGACG CGACCGTGCG CTGGCACGGA
TTCGCGGCTC CCGAGGTCCG CGACGCCGTG CTGGCGTCGG CCTGGCTGAC GGTGAACCCC
TCCCACGGCG AAGGATGGGG CCTGTCGGTA CTCGAGGCGA ACGGGATGGG GGTACCGGCG
GTCGCGTTCC GGGTCCCGGG ACTGCGCGAC TCCGTCCGCG ACGGGGTGAC CGGCTGGCTG
GTGGACGAGC CCGGGCAGCT CTCCGACGCG GTCGACCGCG CGTTGACCCT GCTCGCCGAC
CCGGCGCGGG CCGGGGAGAT CCGCGCGGCC GCGCGGGCGT GGGCCGGCGG CTTCAGCTGG
GACACCAGCG CCGATCTGCT GGCCGCCGTC ATCGGCTCCG AGCTCGACCG GCTCGCCGGG
GCCGGCGCGG GCGGCCGCTC GGCCGTGGGC CCGGGCGAGC CTCCGGCCCG GCACGCGCCA
GCACGCCCAC TCGCCACCCG CCCGCCGCGG GACCGGCGGC GCCGCGACGA CCAGGCGACC
TGGGTCGAGT TCGATCTGGC GCCGGGTGCG GAGGTTCCCG TGCTGCGCCG GACCGATCTG
GTCTTCGAGG TCGCCCCGGA ATCCGGTGCG GCCGGCCCGG AGCCGGGGCT GCCGGGCCCG
GGGCGCCGGT TCGTCGCGCT GTTCTACGGG GCCGACAGCA CCGGGGCCCG CACCGCGCTG
GCCCGTCGCG GGCTGCGGCC GGCGCAGCGG CCCCGCGCGG CCACCGGCGA GGACCTGCTC
CTCGCCGCCA CGCACGCCGG CCCGAACCAC GCCAGCGCCA GCGCCAGCGG CGTACGGCTA
CGGGACATGG CCGGCTGA
 
Protein sequence
MTTAAPRPVH ALTGRHLVFL NWRDNAHPQA GGAELFCHSV AERFAAAGVR VTLLTSRPPG 
AAAATTDGGV AVRRGGGTFG VYPSVLARLA RMVRSGERVD AVVDCQNGIP FFSPLVLPSR
IPVVQVLHHV HQKQFPLYFP RPVARIGQLL ETPGSRWVYG RRPVAVVSPS TRDEARGVLA
LPGARFLVPN GVTIAGGDGD AGGDAVASGG AGGTDGPFGA DGAIGARAAA PTIVCVGRLV
PHKRLHLLIE ALPVLVGRHP GLSLHLVGDG PDRRRLADTA ARLMLTQGDG SSDATVRWHG
FAAPEVRDAV LASAWLTVNP SHGEGWGLSV LEANGMGVPA VAFRVPGLRD SVRDGVTGWL
VDEPGQLSDA VDRALTLLAD PARAGEIRAA ARAWAGGFSW DTSADLLAAV IGSELDRLAG
AGAGGRSAVG PGEPPARHAP ARPLATRPPR DRRRRDDQAT WVEFDLAPGA EVPVLRRTDL
VFEVAPESGA AGPEPGLPGP GRRFVALFYG ADSTGARTAL ARRGLRPAQR PRAATGEDLL
LAATHAGPNH ASASASGVRL RDMAG