Gene Franean1_0383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0383 
Symbol 
ID5668807 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp459444 
End bp460775 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content72% 
IMG OID641239315 
Productglycosyl transferase family protein 
Protein accessionYP_001504755 
Protein GI158312247 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.138846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0930893 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACCTGC TCGACGCCGA CGGCCGCGAC GCGCGCCTGC TCGCCGCGGC CATGATCACC 
TTCGGGATGG TCTACCTGTT CGCGATGCTG GTGCTCTCCC GTGTCCACCG ACCCCGGACG
GGCACGCCCC CCGACGGCCT CTTCTTCGTC TTCGTGATGC CCTGCCTCAA CGAGGAGGCG
GTCATCGAGG CGAGCCTGCG CCGGCTCCTG CTCTCCCCCG CCACGAACCG CCGGGCCCTC
GTCGTCGACG ACGGCTCGGA CGACCGGACC TCGCTGATCG TCCGCGGGGT GGCCGACGAC
CGGGTGTGGC TGCTGCGCCG CGAGCCACCG GACGCCCGAC GGGGCAAGGG TGCCGCGCTG
AACGCCGCCG TGGCCCATCT GGCGACCCGC CCGGAGATCG CCGCCCGCGA CCCCGACGAC
GTGATCATCG CCGTGGTCGA CGCGGATGGC CGTCTCGACC CGCACTCGGT GGAGGCGGTC
GCCCCCTACT TCGCCGATCC CCGCACCGCC GGCGTGCAGA CGGGCGTGCG CATCAACAAC
CGGCACACCA GCCTGCTCGC CCGGCTCCAG GACATGGAGT TCGTGATCTA CACCGATGTC
TTCCAGCGCG GACGCGGCCA ACTGGACAAT GTCGGCCTCG GTGGCAACGG CCAGTTCGTC
CGGCTCTCGG CGCTGCGGTC CCTCGGCGGC GACCCGTGGT CGCACAGCCT GACCGAGGAT
CTCGATCTCG GTGTCCGGCT GCTGCTGACG GGCTGGCGCA ACCAGTTCTG CCCGCAGGCG
GACGTCCACC AGCAGGGGGT CGTCCGGCTG GGACGGCTGC TGCGGCAGCG GTCGCGCTGG
TTCCAGGGCC ACCTGCAGTC GTGGGCGCTG ATGCCCCGCG TGCTGCGGCA GGCGCGGGCC
CGGGCGCTGC CCGACATGCT GTTTCACCTC TCCAGCCCGC TGCTGATCCT CCTCGCGTCG
CTGCTGACGG CCGCCTTCGT GCTCAGCACG GTGGGCGTGC TGACGAGCTG GCTCGCCGGC
GGGCCGGCGC CGGACCCGCG CTACTTCCTC GGCGCCTACC TGATGGCGGC GGGACCGGCG
CTGGTCTGCG CGCTGATCTA CCGGTCACGC GAGCCGCTTG TTGGCTTCGG CGTGGTCCGC
CTAGCCGGCT ATGCGCACCT CTACATGCTG TACGCGCTGG TGTGGTTCGT CGCCGGCTGG
TGGGCGATGG GACGGGTGGT CAGCGGCCGG ACGAGCTGGC ACAAGACCGC CCGAACCCCG
GAGAGCGCGC CACCCACGCC CCTCCAGCCA GTGTCCGCCG CTCCCGCCGC TCCCGAGGGA
CCGGACCGGT GA
 
Protein sequence
MNLLDADGRD ARLLAAAMIT FGMVYLFAML VLSRVHRPRT GTPPDGLFFV FVMPCLNEEA 
VIEASLRRLL LSPATNRRAL VVDDGSDDRT SLIVRGVADD RVWLLRREPP DARRGKGAAL
NAAVAHLATR PEIAARDPDD VIIAVVDADG RLDPHSVEAV APYFADPRTA GVQTGVRINN
RHTSLLARLQ DMEFVIYTDV FQRGRGQLDN VGLGGNGQFV RLSALRSLGG DPWSHSLTED
LDLGVRLLLT GWRNQFCPQA DVHQQGVVRL GRLLRQRSRW FQGHLQSWAL MPRVLRQARA
RALPDMLFHL SSPLLILLAS LLTAAFVLST VGVLTSWLAG GPAPDPRYFL GAYLMAAGPA
LVCALIYRSR EPLVGFGVVR LAGYAHLYML YALVWFVAGW WAMGRVVSGR TSWHKTARTP
ESAPPTPLQP VSAAPAAPEG PDR