Gene Franean1_5611 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5611 
Symbol 
ID5673938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6808950 
End bp6810140 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content74% 
IMG OID641244464 
Productglycosyl transferase family protein 
Protein accessionYP_001509868 
Protein GI158317360 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR01426] glycosyltransferase, MGT family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.126524 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000566285 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGCCGTG TTCTCTTCGC CGTGCCGCCG CTGACCGGGC ACGTGAACCC GGCGGTGGGC 
ATCGCCGGCG AGCTGGCGGC CCGCGGGCAG GAGGTGGCCC TGGTCGGCCA CGCGAGCGTC
GTCGGGCCGC TCGTCCCGCC GTCCGTCCCG CTCATCGCGC TGCCAGGGGA GATATCGGCC
GACCAGCGGG CCGAGCTGGA GGCACGGTCC CGGCCGCTGC GCGGGCCCGC GTCACTGAAG
TTCCTGTGGG ACGAGTTCCT GCTGCCGCTG GGCGCCTCGA TGGCGCGGGA CGTCGGTGCC
GTCGTCGAAC GGTGGCGCCC GGATGTGATC GTCGCCGACC AGCAGGCGGT CGGGGTCGCC
ATGGTCGCCC GTCGGCGCGG CATCCGGTGG GCCACGCTCG CCACCACGTC GGCGGAGCTC
GACGACCCCT ACGCCGTGCT CGCCGGGGTC GGGAACTGGG TGTCGGAGCG GCTGCGGGAC
TTCCAGGTCG CGAACGGCGT CCCGGCGGAG GAGGCGGCGC GCGGTGACCT GCGCTTCTCT
GAGGACCTCA CTGTGGTCTG CTCGGTGCCC TCGTTGCTGC GTACTGCCAG TCATCCGTCC
CATCACGTGT TCGTCGGCTG CGCCGCCGGA CTGCGCCGGT CGGCCCCGGA GTTCCCCTGG
GAGTGGCTCG ACCGGGACCG CCGCACCGTG CTCGTCTCGC TCGGCACGGT GACCCGGGAG
GCCGGCGGGC GTTTCCTGCG CGCGGCCGCG GAGGCGCTGG TGGGGATGTC CGACCGGGTG
CAGGCCGTGA TCGTCGCGCC TCCCGGCCCG CTGGACGACC TCGCCGGCCA GGTTCCCGAC
GACCTGCTGG TCCGTCCGTT CGTGCCGCAG GTGGACCTGA TGGCCGGACT GGACGCGATA
GTGTGCCACG CGGGCAACAA CACGGTGTGT GAGGCTTTGT CGCGGGGAGT GCCGCTGGTG
GTCGCGCCGG TTCGTGACGA CCAGCCGATC ATCGGCGAGC AGGTGGTGCG GGCCGGTGCC
GGTGTGCGGG TGCGCTTCGG GCGCTCGACC CCGGTGACGC TGGCCACCGC GATCGGCACC
GTGCTCGACG AGCCGTCCCA CCGGGTCGCG GCGCGGCGGC TGCAGGGCGA GTTCAGCGCG
GCGGGCGGTG TCGTGGCCGC CGCCGACCAC ATTGAGAAGC TGCTGCCGTA G
 
Protein sequence
MGRVLFAVPP LTGHVNPAVG IAGELAARGQ EVALVGHASV VGPLVPPSVP LIALPGEISA 
DQRAELEARS RPLRGPASLK FLWDEFLLPL GASMARDVGA VVERWRPDVI VADQQAVGVA
MVARRRGIRW ATLATTSAEL DDPYAVLAGV GNWVSERLRD FQVANGVPAE EAARGDLRFS
EDLTVVCSVP SLLRTASHPS HHVFVGCAAG LRRSAPEFPW EWLDRDRRTV LVSLGTVTRE
AGGRFLRAAA EALVGMSDRV QAVIVAPPGP LDDLAGQVPD DLLVRPFVPQ VDLMAGLDAI
VCHAGNNTVC EALSRGVPLV VAPVRDDQPI IGEQVVRAGA GVRVRFGRST PVTLATAIGT
VLDEPSHRVA ARRLQGEFSA AGGVVAAADH IEKLLP