Gene Francci3_1301 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1301 
Symbol 
ID3904350 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1556212 
End bp1557264 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content71% 
IMG OID637878634 
Productglycosyl transferase family protein 
Protein accessionYP_480407 
Protein GI86740007 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0463] Glycosyltransferases involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGACGC AGACCCACCG AATCCCGCCG CGCCAGGGCC CGTCGTTCCC CGCGGCACCA 
CCGGCCGCAC CAGAACCGGC AACGCCAGCA CCGGCCGCGC CAGAACCAGC CGCGCCAGAA
CCAGCAACGA CGCGAACCCC GCGCCGAGCG AACTGGCCGA CCCCCTATGA CACCGCCGCC
CCCGCCGGAT CGACACAGCC ACAGCCGGCG CCGGGATCGG TTCCCGTCAC CGCGATCATC
CTCGCCCACA ACGAGGCACC AAACATCGTC CGGGCGATCA GGTCCGCCGG TTGGTGTCGA
CAGGTCGTCG TCGTGGACTC GGGTTCCACC GACGGCACCG CGGATCTGGC CCGCGCCATC
GGTGCGACCG TCTGGCACGA GCCCTGGCGT GGGTTCGCCG GCCAGCGGCA GTGGGCGATG
ACCAACCCGG GGATCGCCCA CGACTGGGTG TACTTCCTCG ACAGCGACGA ATGGGTGTCG
ACCGCGCTCG CCGCCGAGAT CGCCGCCCGG CTGGGGACGG CGGACTGCGC GGCCTACAGC
CAACGACGCC GGCTGGTGTT CGAAGGCCGC TGGATCGCGC ATTGCGGGTG GTACGCGAAC
AGCTGGCAGG CGCGGCTGCT CGATCGGCGG GTGGCGTACT TCGACGCCGC CGTCACCTAC
AGCGAACGGG CCGTGGTCAC TGGTGAGGTC GGACGGCTGT CCGCCGACCT GATCGACGAG
GACCACAAGG GACTCGCCGC CTGGCTACGC AAGCACGTGC GCTACGCCGA ACTGGAGGCG
GCGCGCCGCG TGACGCAGCC CGCTGTCCGG GAGCGGTTGG CGCGGGTCCG CGAGGAGGTG
CGCCGGCCCA CCGGGTCGAC CCGACCGCTC ACCCGGACCA TCGCGAGGGA CGTGATCTTC
CCGCTGGTTC CGGCCAAGCC GGCGGTCCTC TTCTGCTACA TGTACCTGCT ACGGAGCGGA
TGGCGGGATG GGCGGCAGGG GCTGCTGTTC TGTCTCTACT ACGCCTGGTA TGAGCTCACG
ATCGGCGCAC TGACTCGGTC CGTTCACCGG TGA
 
Protein sequence
MTTQTHRIPP RQGPSFPAAP PAAPEPATPA PAAPEPAAPE PATTRTPRRA NWPTPYDTAA 
PAGSTQPQPA PGSVPVTAII LAHNEAPNIV RAIRSAGWCR QVVVVDSGST DGTADLARAI
GATVWHEPWR GFAGQRQWAM TNPGIAHDWV YFLDSDEWVS TALAAEIAAR LGTADCAAYS
QRRRLVFEGR WIAHCGWYAN SWQARLLDRR VAYFDAAVTY SERAVVTGEV GRLSADLIDE
DHKGLAAWLR KHVRYAELEA ARRVTQPAVR ERLARVREEV RRPTGSTRPL TRTIARDVIF
PLVPAKPAVL FCYMYLLRSG WRDGRQGLLF CLYYAWYELT IGALTRSVHR