Gene Franean1_5894 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5894 
Symbol 
ID5674216 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7157747 
End bp7158895 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content79% 
IMG OID641244743 
Productglycosyl transferase group 1 
Protein accessionYP_001510145 
Protein GI158317637 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.262799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.416223 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCGAAAC TGCGCGTCGT CCTCGACGGA ACCCCGCTGC TCGGGCCGCG CACCGGCGTC 
GGCCGATACA CGGCGGCGCT GCTGGCCGGC CTCGTCGAGC TGCCCGGGGC GGATCTCGAC
GTGGCGGCCA CCGCCTTCAC CTGGCGCGGC GGGGCGGGGC TCGACGCCGC CCTGCCCGCC
GGGGTCCGCC CGGCGGCGCG GCGCGTCCCG GCGCGGCTGC TACAGGACGC CTGGACACGC
TCGGAGCGCC CGCCCACCGA GTGGCTCACC GGGCGGGCGG ACATCGTGCA CGGGACGAAC
TTCGTCCTCG GCCCGCTGTC ATCGGCGCGC GGTGTGCTGA CCGTGCACGA CCTGTCGTAC
CTGCGTACCC CGGACACGGT GTCGGCTGCC TCCGCGCGCT ACGCGACGCT GGTGCCGCGC
GGGCTGCGCC GCGCCGCCGC GGTGCTCACC CCCAGCCGCG CCGTCGCCGA CGAGGTGATC
GCCGCCTACC GGCTCGACCC GGACATGGTC ACCCCGACCC CGCTCGGCGT CGACGCCGCC
TGGTTCGACG CCGCTCCCCC GGCCCGCGGC TGGCTCGCCG CGCGCGGGCT GCCCGAGCGG
TACCTGCTGT TCGTCGGGTC GGCGGAGCCG CGCAAGAACC TGCCGGTGCT GCTGGAGGCG
CTGCGCCGGC TGCGCGCCGA CGCGCCCGAC ACCCCGCCGC TGGCGCTCGT CGGCCCGCCC
GGCTGGGGCC CGGCGCTCGA CACCTCGGGC CTGCCCGCGG ACGCCGTCGT CACCGTCGGC
TACCTCGACG ACGCCGAGCT GCGCTCCGTG GTCGCCGGCG CGGCCGCGCT GTGCTTCCCG
TCCCGCTACG AGGGCTTCGG GCTACCGCCG CTGGAGGCGC TGGCCGCCGG TACCCGGGTC
GTGGCCGCCG ACATCCCCGC GGTGCGCGAG GTGGTCGGCG CCGCCGCCGG TGTCCGCCTG
GTCACCCCCG GCCGGTGGGA CGTCTTCGCC GACGACCTCG CCGGAGCCCT CGGCGCCGCG
CTCGCCGAAC CGACCGGCAC CGCCCAGTCC GCCACCCAGG CCGCCGCCGG CCGCGAGCAC
GCCCGCGCGT TCACCTGGCG GCGCACCGCC GAGCTGACCG CCGCCGTCTA CCGCCGCGTC
GCCGGCTGA
 
Protein sequence
MPKLRVVLDG TPLLGPRTGV GRYTAALLAG LVELPGADLD VAATAFTWRG GAGLDAALPA 
GVRPAARRVP ARLLQDAWTR SERPPTEWLT GRADIVHGTN FVLGPLSSAR GVLTVHDLSY
LRTPDTVSAA SARYATLVPR GLRRAAAVLT PSRAVADEVI AAYRLDPDMV TPTPLGVDAA
WFDAAPPARG WLAARGLPER YLLFVGSAEP RKNLPVLLEA LRRLRADAPD TPPLALVGPP
GWGPALDTSG LPADAVVTVG YLDDAELRSV VAGAAALCFP SRYEGFGLPP LEALAAGTRV
VAADIPAVRE VVGAAAGVRL VTPGRWDVFA DDLAGALGAA LAEPTGTAQS ATQAAAGREH
ARAFTWRRTA ELTAAVYRRV AG