Gene Francci3_1562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1562 
Symbol 
ID3904794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1874078 
End bp1875469 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content65% 
IMG OID637878899 
Productglycosyl transferase, group 1 
Protein accessionYP_480667 
Protein GI86740267 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.515981 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGACAGA CACGCGACGG GACTATCTCG TTGGCTACGG GCGCGAATAC TCTCACGTCG 
GCGCCTGACA CCGACTCGAC CGAATCTCCC GACATCGGCG ACGCAGGACG GGTTCTGATC
GTAGTTCAGA ACCTGCCGAT TCTGATCGAC CATCGGACGT GGCAGGAATG CCAGGCGCTT
CGCAACCAGG GCTACCAGGT TGCCGTAATA TGCCCACGGG GCCCGGGGCA GCGGCGTCAC
CAGATCGTTG ACGGTGTCAG CATCTGGACG TATCGTGGCG CACCCCGGAC CCAGGGTGTG
CTCAGTTACG TCTTCGAGTT CGGCTACTGC TGGGTGCGAA CCGCTGTCCT GTCCGTCGCC
GTGGCCCGCC GGCACGGATT CGACGTGATC CAGGCATGTA ACCCGCCGGA CACGTACTGG
GCGCTCGCGC TGTTCTACAA GCCATTCGGC AAGAAGTTCG TCTTCGACCA TCATGATCTC
TGTCCGGAAC TCTATCGCTC ACGGTTCGGT AAGGACGGGG GGCTGTTGCT CCGCGCGTTG
CTGCTGCTTG AACGGATGAA TCAGCGGGTC GCTGACCACG TGATCGTGAC CAACGAATCT
TACCGGCGGC TGGCCCTGAC CAGGGGGCGG CTGCCGTCGG AACGGGTGAC CGTCGTCCGA
AACGGACCGG ACCCCGAACT GATGAAGCCC GCCGCGGAGC GGCCCGAATT GCGGCGCGGC
CGGCAGCACC TTGCCTGTTA TCTGGGAATA ATGGGCCCGC AGGACGGTGT CGACCGACTC
CTCGATGCCA TACATCACTA TGTCCACGTT CTCGGGCGCA CCGACTGCGC CTTCGCCCTG
CTGGGATTTG GTGACTGTCT GGACGATCTG CGGAAACAGT CGAGCCGGCT CGGTCTCGAC
GACTGGGTCG AGTTCACCGG GCGGGCCGAC GACCAGATGA TCCGGGACTA CCTGTCCACC
GCCACGGTCG GATTGTCGCC GGACCCGCGG AGCCCGCTGA ACGAGGTGTC CACCATGTCC
AAGACGCTCG AATACATGGC CTATGCGCTG CCGGTGGTGG CCTATGACCT GACCGAGACC
CGGGTGAGCG CCGAGGACGC CGCCGTCTAT GTACCCTCCG ACACGGTGGC GGACTTCGCC
CGGACCCTCG CCGAACTGCT GACGGACCCG GACCGACGCC GGACCCTTGG CGCCCGGGGT
CGGGAGCGCA TTGTGAACGA ACTGTCCTGG CGGTATTCGG AGCCCCGCTA CGTCGGGGTC
CACGACCGGC TACGTGGTCG CCAGGCCGGC CACCCCAGTA TCCCGGTCCC TCGTCAGGGC
GGCTTGACCC GCGTGAGCGG CTCGGCAGCG CCGGTTCACG CGACCGGCTC CCGGGAACGA
GCGGAACGAT GA
 
Protein sequence
MGQTRDGTIS LATGANTLTS APDTDSTESP DIGDAGRVLI VVQNLPILID HRTWQECQAL 
RNQGYQVAVI CPRGPGQRRH QIVDGVSIWT YRGAPRTQGV LSYVFEFGYC WVRTAVLSVA
VARRHGFDVI QACNPPDTYW ALALFYKPFG KKFVFDHHDL CPELYRSRFG KDGGLLLRAL
LLLERMNQRV ADHVIVTNES YRRLALTRGR LPSERVTVVR NGPDPELMKP AAERPELRRG
RQHLACYLGI MGPQDGVDRL LDAIHHYVHV LGRTDCAFAL LGFGDCLDDL RKQSSRLGLD
DWVEFTGRAD DQMIRDYLST ATVGLSPDPR SPLNEVSTMS KTLEYMAYAL PVVAYDLTET
RVSAEDAAVY VPSDTVADFA RTLAELLTDP DRRRTLGARG RERIVNELSW RYSEPRYVGV
HDRLRGRQAG HPSIPVPRQG GLTRVSGSAA PVHATGSRER AER