Gene Francci3_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1304 
Symbol 
ID3904353 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1560118 
End bp1561443 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content72% 
IMG OID637878637 
Productglycosyl transferase, group 1 
Protein accessionYP_480410 
Protein GI86740010 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCCCCT CTCCTGTCCG GTCGCGCGTC CTGCTGGTGA CGCACTACTT CCCCCCGGAG 
ACCGGAGCTC CCCAGTCCCG GTTGTCGGAG ACGGCACGCG CGTGGGCGGC GAACGGGCTC
GACGTCACCG TACTCACCGG CATGCCGAAC CACCCGACGG GCAAGATCCC TGCCGCCTAT
CGCGGCGCCT GGCTGCGGAC CGAGCGGGTC GACGGGTACC GCGTCGTACG CACCTGGCTC
TATGCCACTC CCAACGAGGG GATCGCCCGC AAGACCCTCG GTCATCTGTC GTTCATGGTC
ACCAGCGTCC TGCTCGGCGG CCGGCCCGCC GGCCCGGCCG ACGTGGTGGT GGTGTCCTCG
CCGACCTTCT TTCCCCTCGG CTCGGCCTGG CTGCTCGCCA GGCTGCGCGG CGCCCGGCTG
GTTGTGGAGG TCCGCGACCT GTGGCCGGCC ATCTTCGAGC ACCTCGGCGT CCTCACCGAC
CGGCGGGTCC TCGGCGTTCT CGAACGCCTC GAACTCGCCG CCTACCGGGC CGCCGACGCC
GTTGTCACGG TGACGGAGGG GTTCCGGGAG GACATCGTGC GACGAGGCAT CGCGCCGCGC
AAGGTGCACG TGATTCCCAA CGGCGTGGAC CTCCGCCGGT TCCACCCGAC GACCGCGGCC
TCGGCCGACA TCCGGGCCTG GCTGGGCGCC ACCGACGGCG ACACCCTCGT GCTCTACCTC
GGCGCCCACG GCATCTCGCA CGGACTGACC TCGATCGCCG ACGCGGCCGC CCGGGTGACC
GGCCGGCCGA TCCGGTTCGC CTTCGTCGGT GAGGGGGCCG AGAAACGCAG GCTCGTCGGG
CACGTCGAGA GCCTGGGACT GGCCAACACG GTGCTGCGCG ACGGGGTTGC CCGCGAGGAG
GTACCCGCCG TCGTCGCGAC CGCCGACATC TGCGTTGTCC CACTGCGGGA CGTGCCGATG
TTCGACACGT TCATCCCATC GAAGATGTTC GAGTTCCTCG CCGCGGGCCG CCCGGTGATC
GGGGCGGTCC GCGGCGAGGC GGCCCGGATC CTCCTTGCCG CCGGGCAGAT GGTCGTGCCC
CCCGAGGACT CGGCCGCGCT GGCGGAGGCG ATCCTGGTCC TGGCGGCGGA CCCGGACCGC
CGGGCGCGGA TGGCCCGCGG CGGGCGGGCG CACGTCGAGG CCCACTACGA TCGCGACGAT
CTGGCCCGCC GGTACCAGAC GCTGCTGTTC GACAACGCAC CGTTCCCGGC GCCGCCTCCA
CCGACGACCC TTTCCCCACC GGTGCAGGTA CCTGCACCGG TACCTGCACC GGACGTGGTC
GCATGA
 
Protein sequence
MVPSPVRSRV LLVTHYFPPE TGAPQSRLSE TARAWAANGL DVTVLTGMPN HPTGKIPAAY 
RGAWLRTERV DGYRVVRTWL YATPNEGIAR KTLGHLSFMV TSVLLGGRPA GPADVVVVSS
PTFFPLGSAW LLARLRGARL VVEVRDLWPA IFEHLGVLTD RRVLGVLERL ELAAYRAADA
VVTVTEGFRE DIVRRGIAPR KVHVIPNGVD LRRFHPTTAA SADIRAWLGA TDGDTLVLYL
GAHGISHGLT SIADAAARVT GRPIRFAFVG EGAEKRRLVG HVESLGLANT VLRDGVAREE
VPAVVATADI CVVPLRDVPM FDTFIPSKMF EFLAAGRPVI GAVRGEAARI LLAAGQMVVP
PEDSAALAEA ILVLAADPDR RARMARGGRA HVEAHYDRDD LARRYQTLLF DNAPFPAPPP
PTTLSPPVQV PAPVPAPDVV A