Gene Francci3_1629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1629 
Symbol 
ID3905908 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1958268 
End bp1959296 
Gene Length1029 bp 
Protein Length342 aa 
Translation table11 
GC content70% 
IMG OID637878967 
Productglycosyl transferase family protein 
Protein accessionYP_480734 
Protein GI86740334 
COG category[R] General function prediction only 
COG ID[COG1216] Predicted glycosyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCACA GGCTGTTGGC GGTCGTCGTT TCTCACGGGG GATCGACGTC TCTGTACCGG 
TTGCTGACCA CACTGGATGC CATGGCGGAA TGTCGGGTGT TTCTGGTAGA AAATGATGGG
AAATCCCGAC ATGATGCGTT GCCGGACGGT GTGCGCGTGG TCCAGGGGCA CGGGAACGTC
GGATACGGCA CCGCGGTGAA CCTCGCTGTC CGCCGTGCCC TGGAGGACGG GCTGCGGCCG
GAGTGGATCC TGGTGGTCAA CAGTGACGTG ACGGTCCCTG CCGACACCGC GACGATGATC
CCGAAGCTGC TCGCCTGGGC CCCGTCGTCC GCCGACGTGG TCGGCTTCCC GATCCGCGGC
ACGGCGGGCG AGCGGGGACG GGCGAGTGCC GTCCTGCCGC GCCCGCGGAC GAACGCCTAC
ACGGCGGTAC GGGGCGAGAT CGCCGCGGTG GAGAGGTGGC CGGAACTGCG CTATCCGGTC
GGCGCCTTCT TCGCCGTCCG CTCGGAGATT TTCCTACGGC TGGGCGGATT CGACCCGTCG
TACTGGATGT ACTACGAGGA GACGGATCTG TTCGCCCGGT TGCACGCCGC GGGTGGGCGC
ATCGTCTGGG CGGACGACGC CTGGCCGGTC GTTCATGTGG GCGGGGAGAC CGTCGGGCGG
TCCGGGCTGC TGTACGCCGA ACTCGGCCGG TCGGCAGCCA CCTATGCCCG GCGGCACCGT
CACGACGTGG GCCGGTCGTG GACCGCGGTG CACGCGGCCC AGTTGACCGT CCTCGCCGCG
CGCAAATTGG CCGTGGGCCG GTCGCACGAC GCGTTGCGCG CGGTCCGGAT CCTCTCCGGG
CTGGTGAGCG GGCTGGCCCG GCCAGGCTGG GAGCCCGCGG TCAGCTCACG GTGGCACGCC
GTCCCGGCCG AGACGCGGCT GCGTCTCGGC CATCTCCGCC CGGTCCCGCG GACGCCGCGG
CAGCGGCAGG ACGATCTGAT CGATGATCTC GCCGACGGCT CCCCCGGCTC CTCTGGCCAG
CGGACGTAG
 
Protein sequence
MMHRLLAVVV SHGGSTSLYR LLTTLDAMAE CRVFLVENDG KSRHDALPDG VRVVQGHGNV 
GYGTAVNLAV RRALEDGLRP EWILVVNSDV TVPADTATMI PKLLAWAPSS ADVVGFPIRG
TAGERGRASA VLPRPRTNAY TAVRGEIAAV ERWPELRYPV GAFFAVRSEI FLRLGGFDPS
YWMYYEETDL FARLHAAGGR IVWADDAWPV VHVGGETVGR SGLLYAELGR SAATYARRHR
HDVGRSWTAV HAAQLTVLAA RKLAVGRSHD ALRAVRILSG LVSGLARPGW EPAVSSRWHA
VPAETRLRLG HLRPVPRTPR QRQDDLIDDL ADGSPGSSGQ RT