Gene Francci3_3969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3969 
Symbol 
ID3906929 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4748950 
End bp4750659 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content72% 
IMG OID637881297 
Productglycosyl transferase family protein 
Protein accessionYP_483048 
Protein GI86742648 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1928] Dolichyl-phosphate-mannose--protein O-mannosyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.858554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCGA CGACGGCTCA CCTGCCCTGG ACGGCGACCG GTCGCGGGGA CGCGGCGCGG 
GTGACCGGCC CGGGTTCCGC CGGCCCGGCC GGCGGTCGGG CGCTCTCGGT GCGTGAGCGG
CTGTGTCCAC CGATGCCGCG GTCGCCGCTC ATGGGGTGGC TGGCGACCCT GTCCGTCGCG
ATCGTCGCCG GGCTGCTGCG TTTTTGGCAC CTCACGGAGC CCCGCGGGAT CTACTTCGAC
GAGGTCTACT ACACCAAGGA CGCCTGGGGC CTGCTGACGG CCGGGTATGA GATCAACTCG
ACGACCTGTT CCGGTCCGGC CTACGTCGTC CATCCGCCGT TCGGCAAATG GCTGATGGCC
GCCTCCGAGG GCCTGTTCGG CTATACCGAC TGTGCCGGGG TCCCGCACGG CAGTCCGGAG
CTCGGCTGGC GGTTCTCGTC CGCGCTGTTC GGGACGCTCG CCGTCCTGGT CCTCGCCCGC
GCCGCCCGAC GGATGTTCCG CTCCACGCTC CTGGGCTGCT TCGCCGGTCT GCTCCTCGCC
CTCGACGGGC TGGAGTTCGT GCAGAGCCGC ATCGGGATCC TCGACATCTT CCTGATGACC
GGCGTCGTCG TAGCGCTGGC CTGTCTGCTG CTCGACCGGG ACGACGGCCG CCGCCGGCTC
GCCGACCGGC TGGAACCGGT TGGTTCCAGC CCCGGCTCAG CACCCCTGGG GGACGAGGCC
GCCGGCCCGG ACGGCGGTAC GGACGCCGGC CCGGACGGCG GCCCGACCAC GACGGCTCCG
CCCACCCGGG CCGACCGCCT CCTCGACATG TACGGGCCGC GGCTGGGGTT CCGCCCCTGG
CGGCTGGCCT GCGGAGCGGC CCTGGGACTG TCCATGGGGG TGAAGTGGAG CGCCCTCTAC
ACGATCATCG GCTTCGCGGC GCTCGCCCTC GCGTGGGACA TCGGAGCACG GCGCACCGGC
GGAGCCCGCC GACCCGTGCT GGGAGCACTG CGCCGGGACA GCCCGGCCTG GTTCGCCGCG
TTCGTACTCG TCCCCATCGT GACCTTCCTC GCCACCTGGA CCGGCTGGTT CGTCACCGAC
GGCGGCTACT ACCGCCACTA CTACGGCAAC GGTTTCGGCG CCGCGTGGCA CGGCTGGTGG
AAGTACCAGA TGGCGGTGCT GAAGTTCCAC GAGGGACTCC ACGAGGGGCA CGCCTTCGCG
TCGCATCCGA TGAGCTGGCT GGTGATGGCG CGGCCGGTCG CGTACTACTA CTCCTCGCCG
GCGTACGGGA CCTCGGGCTG CCACGATCCG GCCGGCTGCT CCCGGGAGGT CATCGCCCTG
GGCAACCCGG CGATCTGGTG GGTCGGCACG GCGGCACTGG TCGCGATGCT CGCCTGGTGG
GTCTCCCGGC GGGACTGGCG GGCGGCCCTG GTGCTGGTCG GGTTCGGCTC GGCCTTCGTG
CCGTGGCTGC TGTTCCCGAA CCGCACGATG TTCTTCTTCT ACGCGCTGCC GTCGCTGCCG
TTCCTGGTCC TCGCCATCAC GGCGCTGGCC GGGCTCGTCC TCGGCCCGCG CGAGGCGTCG
GAGACCCGCC GCCTGGTCGG GGCGCTGTCG GTCGGGGTCT ACACGATCAT CGTCGTGCTG
CTCTTCGCGT ACTTCTACCC GATCCTGGCC GCCGAGGTGA TCCCCTACTC CTCGTGGCGC
GCCCGGATGT GGTTCCCGGG CTGGATCTGA
 
Protein sequence
MTATTAHLPW TATGRGDAAR VTGPGSAGPA GGRALSVRER LCPPMPRSPL MGWLATLSVA 
IVAGLLRFWH LTEPRGIYFD EVYYTKDAWG LLTAGYEINS TTCSGPAYVV HPPFGKWLMA
ASEGLFGYTD CAGVPHGSPE LGWRFSSALF GTLAVLVLAR AARRMFRSTL LGCFAGLLLA
LDGLEFVQSR IGILDIFLMT GVVVALACLL LDRDDGRRRL ADRLEPVGSS PGSAPLGDEA
AGPDGGTDAG PDGGPTTTAP PTRADRLLDM YGPRLGFRPW RLACGAALGL SMGVKWSALY
TIIGFAALAL AWDIGARRTG GARRPVLGAL RRDSPAWFAA FVLVPIVTFL ATWTGWFVTD
GGYYRHYYGN GFGAAWHGWW KYQMAVLKFH EGLHEGHAFA SHPMSWLVMA RPVAYYYSSP
AYGTSGCHDP AGCSREVIAL GNPAIWWVGT AALVAMLAWW VSRRDWRAAL VLVGFGSAFV
PWLLFPNRTM FFFYALPSLP FLVLAITALA GLVLGPREAS ETRRLVGALS VGVYTIIVVL
LFAYFYPILA AEVIPYSSWR ARMWFPGWI