Gene Franean1_6549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6549 
Symbol 
ID5674864 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7965342 
End bp7966625 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content71% 
IMG OID641245398 
Productglycosyl transferase group 1 
Protein accessionYP_001510792 
Protein GI158318284 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.245344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACACCGC TACTGACACC ACTGGCGACG GACCTCGACG GCCACGGCCA CGGCCCGGCG 
GGCCCGCCGG CGAACCGACC GATGGCGCAG CCGCTCCGGG TCCTCTACAG CTTCCCGCAT
CCGCTCGGTC TACCCGGGAT CGGCACGACT GCCTACCACC AGGTCATCTC GCTGTGGCGA
CAGGGCATCG AGGTGCAGGT CTACTGCACG AGCGTCGCTC GTCCGCTCCC CCCTGGCCTG
CCGGTGCGCC AGACGATGGC GCTCGGGGGG CAACGCCTGC CGCCCAGAGC GGTCGGAGTG
AAACGCGCCA GGTACTGGCA CGACCGGGTG GTGGCGACCG CACTGGCCCG GGAGTACTTC
GATGTCGCGC ATGTCTGGCC AGGTGCTGCC GTTCACACGC TGCGCGCATG TCGACGACTG
GGCATCCCGG GACTGCGCGA GGCCCCCAAC ACCCACACCG CCCATGCCTG TGACGTTGTC
GCTCGGGAGA CGGCACGCCT GGGTCTGACC ATGCAAAGGA ACTCCAGCCA TGCGCCGAAC
CCGCGTTCGC TGCGGCTGGA AGACGCCGAG TACGGCGCGG CGACCGCGCT GCTGGTCCCG
TCCGATGTCG CCGCCGAGAC GTTCGTGGGA CGAGGTATGC CGGCCGGCCG GCTCGTCCGG
CATCGGTACG GGTTCGATCC GCGCACATTT CCGGCGCCCC GGGCCGAGGA GATGGAAAGA
CCAGGCACCC GGCCACTGCA TGTGGTCTTC GTGGGCCGGT GTGAGCCACG CAAGGGACTG
CATCTACTGC TCGAGGCCTG GCGGAGGTCG GGTCTCGCGG GACGGGCCCG CCTGACGATC
TGCGGGTCGT TCTGGTCCTC GTACCGAGCC CTGCTCGCTC CCGCACTGGC CCAGCCCGGT
GTCGAAACAC CCGGTTTCGT AACGGACGTG CCGGGCCTGC TGCGGTCCGC CGACGTGCTG
GCCCTTCCCT CCCTGGAAGA GGGCAGCGCG CTGGTCACCT ACGAGGCCCA GGCGAGCGGG
TGCGCGCTGC TGGTTTCCCG GCAGTCCGGT GCCGTCCTCA CCCATGGTGA GCAGGGGCTG
CTGCACGAGG CGGGCGATGT CGACACGCTG GCCGCGCACC TGCGGCAGCT GGAACACGAC
CGGAGCCTGC TCGAACGGCT ACGCTGCCGA GCCCTGGCCG CGCGGAAGTC GCTGACCTGG
AACCACGCCG GAACCATTCT CCACGCGGCC TATGAGCGGT CCAGGGCGGC CGCGGTAGAC
GGCGCGGGGA CGGACGCGGC ATGA
 
Protein sequence
MTPLLTPLAT DLDGHGHGPA GPPANRPMAQ PLRVLYSFPH PLGLPGIGTT AYHQVISLWR 
QGIEVQVYCT SVARPLPPGL PVRQTMALGG QRLPPRAVGV KRARYWHDRV VATALAREYF
DVAHVWPGAA VHTLRACRRL GIPGLREAPN THTAHACDVV ARETARLGLT MQRNSSHAPN
PRSLRLEDAE YGAATALLVP SDVAAETFVG RGMPAGRLVR HRYGFDPRTF PAPRAEEMER
PGTRPLHVVF VGRCEPRKGL HLLLEAWRRS GLAGRARLTI CGSFWSSYRA LLAPALAQPG
VETPGFVTDV PGLLRSADVL ALPSLEEGSA LVTYEAQASG CALLVSRQSG AVLTHGEQGL
LHEAGDVDTL AAHLRQLEHD RSLLERLRCR ALAARKSLTW NHAGTILHAA YERSRAAAVD
GAGTDAA