Gene Franean1_6162 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6162 
Symbol 
ID5674483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7495980 
End bp7497284 
Gene Length1305 bp 
Protein Length434 aa 
Translation table11 
GC content74% 
IMG OID641245014 
Productglycosyl transferase group 1 
Protein accessionYP_001510412 
Protein GI158317904 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.189514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCATCTGG TCGGAGGCGC GCCGCCGGAA CATCCCGAAC CAGCCCGGCC GAAGCCAGCC 
TCGCGGGTCG CGATGCTGTC GATGCACACG TCGCCCCTGG AGCAGCCCGG CACCGGCGAC
GCGGGCGGGA TGAACGTCTA CGTCATCGAG CTCGCCCGGC AGCTGGCTGC GCTCGGCACC
GAGGTGGAGG TCTTCACCCG GGCCGTGAGC AGCCGGCTCC CACCCGCCCT GGAGATCGCC
CCCGGGGTCA TCGTCCGGCA TGTGCCCGCC GGCCCGTTCG AGGACATCGG CCGCGAGGAG
CTGCCCGCCT GGCTGTGCGC CTTCACGGCG GACGTCCTGC GCACCGAGGC CGGCCACGCA
GCGGGCTGGT TCGACGTCGT CCACTCGCAC TACTGGCTGT CCGGGCAGGT CGGGCTGTCG
GCCGCCCGCC GGTGGGGCGT GCCTCTGGTG CACACGGCCC ACACCCTGGC GCGGGTCAAG
AACGCCTCTC TCGCCGACGG CGACCGCCCC GAGCCGGAAC CGCGCGTCCA GGGCGAGCAG
GAGATCATCA AGGCGGCGAC GCGGCTCATC GCGTCGACCG ACACCGAGCG CCGCCACCTC
ACCCAGCTCT ACGGCGCGGC GCCGGGCAAG GTGGACGTGG TCGCCCCCGG CGTCGACCTT
GACGTCTTCC GCCCCGGCGA CCCCCGGGCC GCCCGCAAGC GGGTCGGCCT CGACCCCGAC
ACCCAGCTCC TCCTCTTCGT CGGACGCATC CAGCCGCTCA AGGCACCGGA CGTCCTGCTC
GCCGCCGCGG CCGAACTCAT CCACCGCGAC CCCGACCGCC GCGGACAGCT GGCGGTCGTC
GTCGTCGGCG GCCCGAGCGG CTCCGGGCTC GAACGCCCGG ACTCTCTGGT CAAGCTCGCC
GCCGAACTCG GCATCACCGA CATCGTCCGT TTCCAGCCGC CGGTACCCCA GGAGCAGCTC
GCCCACTGGT ACCGCGCGGC GACAGCGGTC GTCGTCCCCA GCCACAGCGA GAGCTTCGGC
CTGGTCGCGG TCGAGGCCCA GGCCTGCGGC ACCCCCGTGG TGGCCGCCTC CGTCGGCGGC
CTGCGCACCG CCGTCGCCCA CGGGACGTCC GGAGTCCTCG TCCACGGCTG GGAGCCCGCC
GACTACGCCG ACGCCCTGGA ACGCATCCTC ACCGAGGAAC GCTGGCGCCG GCACCTGTCG
ACAGGCGCCC GCCTGCGCGC CGCGAGCTTC GGCTGGACAG CGACCGCGAA AGGCGTCCTC
GCGAGCTACC AGGCGGCGAT CTCACCAGCG GCCGTCGCCG TCTGA
 
Protein sequence
MHLVGGAPPE HPEPARPKPA SRVAMLSMHT SPLEQPGTGD AGGMNVYVIE LARQLAALGT 
EVEVFTRAVS SRLPPALEIA PGVIVRHVPA GPFEDIGREE LPAWLCAFTA DVLRTEAGHA
AGWFDVVHSH YWLSGQVGLS AARRWGVPLV HTAHTLARVK NASLADGDRP EPEPRVQGEQ
EIIKAATRLI ASTDTERRHL TQLYGAAPGK VDVVAPGVDL DVFRPGDPRA ARKRVGLDPD
TQLLLFVGRI QPLKAPDVLL AAAAELIHRD PDRRGQLAVV VVGGPSGSGL ERPDSLVKLA
AELGITDIVR FQPPVPQEQL AHWYRAATAV VVPSHSESFG LVAVEAQACG TPVVAASVGG
LRTAVAHGTS GVLVHGWEPA DYADALERIL TEERWRRHLS TGARLRAASF GWTATAKGVL
ASYQAAISPA AVAV