Gene Francci3_0829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0829 
Symbol 
ID3905106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp967427 
End bp969034 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content70% 
IMG OID637878162 
Productglycosyl transferase family protein 
Protein accessionYP_479942 
Protein GI86739542 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID[TIGR03469] hopene-associated glycosyltransferase HpnB 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.759846 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGCGC GGCAGGAGAG TTTCGGGGGG GAGATCTCCA TACCGGCTCG GCCAGACCGG 
GACGGAAGCC TTGTAACCCC GCATTACCCG GGGCGCCCGC ACGCCCTTGA GGGGCCCGTG
GCAGTATCAC TTTCTGTGAC AGTGCTGCTG TGGGTCGCCG TTGCCTCTCT GCTCGCCTGG
CTCTACCTGA CCGTCGGCCA TGGGTTCTTC TGGCGGACCG ATCAGCGCCT GCCATCCCGG
CAGGCGCCGA CCGCTTGGCC CAGTGTGGCG ATCGTCGTGC CGGCTCGGGA CGAGGCCGAC
GTGCTTCCCG TGACGCTTCC GACGCTGCTT GCCCAGGACT ATCCCGGTCC TGTCCAGCTG
ATCCTGGTGG ACGACGGTTC CACCGACGGG ACCACGGAGG TTGCCCGGGA CCTGGCGGAA
CAGGCCGCCA CGCGTGGTGA GACCAACGTG ACGCCGGCCA TGACCACATC CACCGAACCC
CCCACTGGCT GGACCGGGAA GCTGTGGGCG CTGCGGCGGG GCATCGAGTG CGCTGGCGAC
GTCGACTTCC TGCTCCTCAC CGACGCGGAC ATCTCCCACC GCCCGGGATC GCTGACCGCG
CTCGTCGAGT CGGCGACCTC CCAGAAGCTG GACATGGTGT CGCAGATGGC CGTGCTACGG
GTGCAGACCG GCTGGGAGCG TCTGATCGTC CCGGCCTTCG TGCACTTCTT CGCGATGCTG
TACCCGTTCC GGTGGTCGAA CCGGCCGGGT TCGCGGGTGG CCGCCGCGGC CGGCGGATGC
TCCCTGATCC GCCGGGAGGC CCTTGCGGCG GCCGGTGGAC TGGCGGAGGT GCGCGGCGCC
GTCATCGACG ACGTGGCGAT CGCGCGGATC ATCAAGCGCT CGGGCGGGCG GACGTGGCTC
GGACTCGCCG AGCAGGTCCA CAGCCAGCGC CCGTATCCGC GGCTCGCGGA TCTGTGGAAG
ATGGTGTCCC GCAGCGCCTA TGCGCAGTTG CGGCACTCCC CGTCGCTGCT GGTCGGGACC
GTGTTGGGAC TGAGCCTGGT CTTCGTCATC CCAGTGGTCG CGACCATCGT GGGCATCGTG
ACCGGCGACG TCGCCACGGC TCTCGTCGGT GGGATCGCTT GGTTGATCAT GACTGTCACC
TATCTGCCGA TGACCCGCTA CTATTGTCAG CCGCTGCCGC TGGCCCTGCT GCTCCCCGGA
GTGGCCGTGC TGTACCTCGC GATGACGGTG GACTCGGCGC GGCTCAAGCG GGCCGGGCGG
GGAGCGGCCT GGAAGGGACG TACCTACCAG GATCACGGTG CCCCGGCCGC CGCCCCCGAG
TATCCGAACG GCCCGGGTGA ACGGCGTGAG GGTTCGGTGG CCGGGCCGCC CGACTCCGGC
GGCTCATCGG CCTCCGCCGG CGGCTCATCG GCTTTGCCAT CCGCCTCGTC CACGGTGCCC
GCAACACCCC TGGTAATGGC GACATCCTCG GCGGTCCCCG CGCCGGCCGT GGCGCCGGCT
TCCCCCACGC CTACGTCCAT ACCGACGCCC ACTCCCACGC GGTGGCCCGC ATCGTCATCG
GCCTCGCCGC CGGTCCGGGG CGGATCGACC GATCAGCCCC GGACCTAG
 
Protein sequence
MVARQESFGG EISIPARPDR DGSLVTPHYP GRPHALEGPV AVSLSVTVLL WVAVASLLAW 
LYLTVGHGFF WRTDQRLPSR QAPTAWPSVA IVVPARDEAD VLPVTLPTLL AQDYPGPVQL
ILVDDGSTDG TTEVARDLAE QAATRGETNV TPAMTTSTEP PTGWTGKLWA LRRGIECAGD
VDFLLLTDAD ISHRPGSLTA LVESATSQKL DMVSQMAVLR VQTGWERLIV PAFVHFFAML
YPFRWSNRPG SRVAAAAGGC SLIRREALAA AGGLAEVRGA VIDDVAIARI IKRSGGRTWL
GLAEQVHSQR PYPRLADLWK MVSRSAYAQL RHSPSLLVGT VLGLSLVFVI PVVATIVGIV
TGDVATALVG GIAWLIMTVT YLPMTRYYCQ PLPLALLLPG VAVLYLAMTV DSARLKRAGR
GAAWKGRTYQ DHGAPAAAPE YPNGPGERRE GSVAGPPDSG GSSASAGGSS ALPSASSTVP
ATPLVMATSS AVPAPAVAPA SPTPTSIPTP TPTRWPASSS ASPPVRGGST DQPRT