Gene Franean1_6548 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6548 
Symbol 
ID5674863 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7964128 
End bp7965345 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content73% 
IMG OID641245397 
Productglycosyl transferase group 1 
Protein accessionYP_001510791 
Protein GI158318283 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.252141 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTCG GGGACGGGAA CGCGGAGGGC GGACGTGCGG GGTCGCTGCG GGTCCTCTAT 
TCCATCTCCC ACCCGCTCGG CCGGCCCGGC ATCGCCACCA CGGCTCTGTA CCAGGTCAGA
GGCCTGATCG CGGCCGGTTT CCAGGTGACC GTGTACTGCA CCAGCCTTGC GATCCCGGTA
CCCGCCACCC ACCGGGTCGT GCAGACCATG GTCGCCCGAG GCGTGCGGAT CCCGAACCGG
GCGGTCGGAG TGCGGCGGGC GTACGCCTAC CACGACTGGT GTTTGGCCAG GGAGCTCGCC
CGGCGCCCGC ACACCTACGA TGTCGTCCAC GCCTGGCCAC GGGGGTGCGT CCGTACCCTG
CGCACCGCCC GCCGTATCGG GTTGCCCGCC TTCCGCGAGG TCTGCAGCCC GCACAGCCGG
GCCGCATTCG ACCTCGCCGG CCGGGAGGCG GCCGCGACGG GGGTCCGGCT GCCGCGACGG
CACGCGCAGC GGGGCCGGTC CTGGCGGCTG CGGCTGGAGG AGGCGGAATA CGCGGCGGCG
TCCTGGCTGC TGTGCCCGTC TGACCATGTC GTACGGACCT TCATCGAACA CGGTGTGGAC
GCCGGCCGCC TCGTCCGCCA CCAGTACGGT TTCGACCCCG ACCGCTTCCG GCCGGCGCTT
GGTCCGCGGC CCGCGGATCG GCCGTTCACC GTCGCCTTCG CCGGGCGGGG CGAACCGAAC
AAGGGGCTGC ACTACGCCCT GCGGGCCTGG CGGGACGCCG GAAGCCCGGG AACCTTCCTG
ATCTGCGGTG TGATCATGCC TGATTACCGG GCCCGCCTCG GCGAGCTGCT CGGGCTGCCC
GGGGTCCGGG AACTCGGCTT CGTCAGTGAC CTGGACCGGG TGCTGCGCGA GTCCGACGCG
CTCGTGCTGC CCAGCGTGAG TGAAGGCAGC GCGCTGGTCA CGCTGGAAGC GGCGGGCTCG
GGGTGCATCC CGGTGGTCTC CGACGCCTGT GGATCGCCTA CCCGGCATCT CATCGACGGA
CTTGTGCACC GTGCGACCGA CACTGCCGAG CTGACCCGTC ACCTGCGACT GCTCGCAGAG
GAACCGACCA CTCGCGCCCG GCTGCGGGCT GCCTGCATCG CCGGCCGGGA TGCTCACACC
TGGGCGCGGG CCGGGGAACG GCTCGCGGAG ATCTACCAGG CCGCGGTGCG TGCCCGCCCC
GGAGGCGCTC CCGGCTGA
 
Protein sequence
MRLGDGNAEG GRAGSLRVLY SISHPLGRPG IATTALYQVR GLIAAGFQVT VYCTSLAIPV 
PATHRVVQTM VARGVRIPNR AVGVRRAYAY HDWCLARELA RRPHTYDVVH AWPRGCVRTL
RTARRIGLPA FREVCSPHSR AAFDLAGREA AATGVRLPRR HAQRGRSWRL RLEEAEYAAA
SWLLCPSDHV VRTFIEHGVD AGRLVRHQYG FDPDRFRPAL GPRPADRPFT VAFAGRGEPN
KGLHYALRAW RDAGSPGTFL ICGVIMPDYR ARLGELLGLP GVRELGFVSD LDRVLRESDA
LVLPSVSEGS ALVTLEAAGS GCIPVVSDAC GSPTRHLIDG LVHRATDTAE LTRHLRLLAE
EPTTRARLRA ACIAGRDAHT WARAGERLAE IYQAAVRARP GGAPG