Gene Francci3_3848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3848 
Symbol 
ID3905596 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4609963 
End bp4611090 
Gene Length1128 bp 
Protein Length375 aa 
Translation table11 
GC content63% 
IMG OID637881174 
Productglycosyl transferase, group 1 
Protein accessionYP_482927 
Protein GI86742527 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.454248 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGACGG CACTGAGGGT GCTCCAGGTG GCGGCACGAT TCTTCCCGGA CATGGGTGGT 
ACCGAGACGC ATGTCTACGA GACGAGTCGT CGACTCAACG CTACTAGTGA CATTACCGTG
GAGATATTGA CGACCGACCG CAGCGGCAAG CTTCCCAGCA GGGAGAATGT CGCGGGAACG
GTGGTGCACC GGGTCGCGGC GTGGCCGCAA GAGAAAGACT ATTATTTGGC GCCGGCCGTT
GCCAAAGTGG TCGGATTCGG TTCTTATGAT CTTGTTCATT GTCAGGGAAT TCACAATCTC
GTGCCGCCGG TGGCGATGGC GGCGGCTCGC CTGAGGGGTA TTCCTTATAT TGTCTCACCG
CATACCGGCG GTCATTCCTC GCAGGTCCGG AACACGGCGC GCCGAGTACA GTGGGGCCTG
CTCGGGCCAC TGATCAGGAA CGCCCGCCGC GTCATCTGCG TCGCGGAGTT CGAGTCCCAC
ATCTTCATGC GCCAAGCCGG GGTCGCTGCA GACCGGATCT CGGTCGTCCC GAACGGAGTG
TCCATCGTTC CACCGAGTGG CCATGTCAAA CCCGATACGA GTGAGCCGCT TGTCGTCTGC
GTCGGCCGGT TGGAGAAGTA CAAGGGACAG CGGCACCTTG TCCGCGCTCT GCCGAGTCTG
ATTACTCTAG TTCCTGACGT ACGGCTGATG CTGGTGGGGC GGGGCCCTGA TGAGCCGGAG
TTGAGGCGCC TGGCTGACCG GCTGGGTGTG GTGGACCGGG TGAGTTTCAC CTCGATACCA
CCGGAGGACC GGCAGGCAAT GTCTGACTGT ATCGCGAGAG CGGGTGTCGT GGCCCTGCTT
AGCGAGTATG AGGCTCATCC GGTCGCGGTC ATGGAAGCCG TGGCCCTGGG TAGGCCGGTG
GTGGTGGCGC CCACTGCCGG ACTGGGAGAG CTGGCTGCGG CAGGGCTCGC GCAGAGCGTT
GCGGATCCGG CCGATGAACA ACTCGTGGCG AAGACGCTGG GCATCTACCT GCTCGCCAGC
GCTGGGGACT CACCGAGCGA GACGAGGCCG ACTCCCGAAA TCTCCACCCT GCCGACCTGG
GACGGCTGCG CGGAAGCCTT GGCGAGGATT TATCGTGAGT CGGTCTGA
 
Protein sequence
MGTALRVLQV AARFFPDMGG TETHVYETSR RLNATSDITV EILTTDRSGK LPSRENVAGT 
VVHRVAAWPQ EKDYYLAPAV AKVVGFGSYD LVHCQGIHNL VPPVAMAAAR LRGIPYIVSP
HTGGHSSQVR NTARRVQWGL LGPLIRNARR VICVAEFESH IFMRQAGVAA DRISVVPNGV
SIVPPSGHVK PDTSEPLVVC VGRLEKYKGQ RHLVRALPSL ITLVPDVRLM LVGRGPDEPE
LRRLADRLGV VDRVSFTSIP PEDRQAMSDC IARAGVVALL SEYEAHPVAV MEAVALGRPV
VVAPTAGLGE LAAAGLAQSV ADPADEQLVA KTLGIYLLAS AGDSPSETRP TPEISTLPTW
DGCAEALARI YRESV