Gene Francci3_1580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1580 
Symbol 
ID3903715 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1894480 
End bp1896276 
Gene Length1797 bp 
Protein Length598 aa 
Translation table11 
GC content72% 
IMG OID637878917 
Productglycosyl transferase family protein 
Protein accessionYP_480685 
Protein GI86740285 
COG category[G] Carbohydrate transport and metabolism
[R] General function prediction only 
COG ID[COG0726] Predicted xylanase/chitin deacetylase
[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.47169 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCC CGGCCGGCCC GCGGCTCTCG GTGGTCATCT GTGCCTATAC CGAACGGCGC 
CGGCACGATC TGCAGCGCGC GGTGACCAGC ATCGCCGAAC AGACCAGGAA GGCCGACCAG
CTGATCCTCG TCATCGACCA CAATGACCGG TTGCGCCGGT GGGCCGAGGC TGCGTACCCC
GGTGCCACCG TGATCCCCAA CACCGGCCGC CGTGGCCTGT CCGGTGCCCG CAACTGCGGC
GTCGGGGCCG CCACCGGCGA CGTGGTCGCC TTCCTGGACG ACGACGCCCA CGCCGAGCCG
GACTGGCTGG CCCAGTTGGC CGCCCACTAC ACCGACCCCC GGGTCGCCGG GGTCGGCGGT
GCCGCCATGC CGGTGTGGCC GCACCGCCGA CCCCGCTGGT TCCCCCCCGA GTTCGACTGG
GTGGTCGGAT GCAGCTACGT CGGGCTGCCG ACCGACACGG CTGCGGTCCG TAACCCGATC
GGAGCGGGGA TGTCGTTCCG GCGGGCCGTG TTCGATCGCG TCGGCGGGTT CACCGAGGGG
CTTGGGCGGG TCGGCACCAC ACCGCTGGGC TGCGAGGAAA CCGAGTTCGG TATCCGCCTT
CAGGCGATCC TCCCGGACGC TGTCGTCCGC TACGAGCCAC GGGCACGGGT CTGGCACCAG
GTAACCGGCG ACCGGGCCTC CCTGCGGTAC TTCCTGGCAC GCTGCCATGC CGAAGGCCTG
TCCAAGGCCG CGGTAGCCGA CCGGGCCGGG GCGGACGCGG CGCTCGCCAC CGAACGGCAG
TACCTGAGGC GGACGCTGCC CCGAGCACTG GCTCGTGACC GCCACAGCCT GGGCACCTGG
CCGCGAGCCG GGGCCGTGCT GGTCGGCACC GGGTCCACGG CCATCGGCTA CGCACGCGGC
CGGCTACGCC TCGCCGCCGG CCGGCGTGAC ACGCCACGTC AGCCGATCCC GTCGGAGGTA
GCTGTGATCC CGATTCTGCT GTACCACTCG GTGACCGATT ATCCGGTGGC GAGCTATCGA
CGTTGGACGG TCGATACCGC AACCTTTGTC CGCCATCTCA CCCTCATTGC CGGTTCCGGC
CGGGTACCGC TGACGGTGTC CGAGTATGTC GAACGGAGGC GGCACCAGAC CCTGCCTCCG
CGACCGGTCC TCATCACCTT CGACGACGGA TTCGCCGACA ACCTGGCCGC CGCTCGTGAG
GTCGTCGCGC ACGGGCTCAC CGCGACCTGC TACGTCGTCA CCGACTGGAT CGGCCAGGTC
GGCATGCTGC GCGGCGCCGA CCTGCGGACC CTGGCCGGCC TCGGGGTCGA GATCGGTGGT
CACAGCCACA CCCATCCCCG GCTCGACGAG CTTCGCCCCG ACGAGGCGCG CCGGGAGATC
AGCGACTGCA ACGCCCGGTT GACCGCTGCG ATCGGCGCAC CGGTGGGCTC CTTCGCCTAC
CCCCACGGCA ATTACGACCA TGCCGTTCGG CGGCTCGTCG GGCAGGCCGG CTTCACGTCC
GCCTGCGGGG TGCGCAACAT GATGTCGCAC GGCGCCGACG ACCCGTTCGC CCTTGCCCGA
CTGACCGTGA CGGTCGACAC CCCGGACCGG CAGATCAGGG CGTGGCTGGA CGGGGCCGGC
CGCGCGGCCC CCGCTCGGGA GTTGCTGCGT ACGCGGGGCA GCCGGCTCTC CCGTCGAACG
AGGGCTCGGC TCCTCGGCCC GCGTCTACCC TACCGACCAA TCGTCACCGA TCTGCCCGTA
CCAGCCGATC TGCCCGTACC AGCCGTCCCG GCGGCGTTCG GGGAGGTGCG GCCGTGA
 
Protein sequence
MTPPAGPRLS VVICAYTERR RHDLQRAVTS IAEQTRKADQ LILVIDHNDR LRRWAEAAYP 
GATVIPNTGR RGLSGARNCG VGAATGDVVA FLDDDAHAEP DWLAQLAAHY TDPRVAGVGG
AAMPVWPHRR PRWFPPEFDW VVGCSYVGLP TDTAAVRNPI GAGMSFRRAV FDRVGGFTEG
LGRVGTTPLG CEETEFGIRL QAILPDAVVR YEPRARVWHQ VTGDRASLRY FLARCHAEGL
SKAAVADRAG ADAALATERQ YLRRTLPRAL ARDRHSLGTW PRAGAVLVGT GSTAIGYARG
RLRLAAGRRD TPRQPIPSEV AVIPILLYHS VTDYPVASYR RWTVDTATFV RHLTLIAGSG
RVPLTVSEYV ERRRHQTLPP RPVLITFDDG FADNLAAARE VVAHGLTATC YVVTDWIGQV
GMLRGADLRT LAGLGVEIGG HSHTHPRLDE LRPDEARREI SDCNARLTAA IGAPVGSFAY
PHGNYDHAVR RLVGQAGFTS ACGVRNMMSH GADDPFALAR LTVTVDTPDR QIRAWLDGAG
RAAPARELLR TRGSRLSRRT RARLLGPRLP YRPIVTDLPV PADLPVPAVP AAFGEVRP