Gene Francci3_4526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4526 
Symbol 
ID3907503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5400766 
End bp5403963 
Gene Length3198 bp 
Protein Length1065 aa 
Translation table11 
GC content71% 
IMG OID637881859 
Productglycosyl transferase family protein 
Protein accessionYP_483601 
Protein GI86743201 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0744] Membrane carboxypeptidase (penicillin-binding protein) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCTCTC CCTACGATCA GGGCTCTAGT CGGGGTGGTC CTGCACCTGA CCGAGCATGG 
CCTAACCCGG ACGGCCAGTC CCGGGTACCC GCGGACCGGG TACCCCCGGA CCGGGTACCC
CCGGACCGCG CCCGGCGGCC GGAGCAGGCC AGGCCCGGCC AGGCCCAGCC AGGCCAGGCC
CGGGCAGGCC AGGCCCGGGC AGGCCAGGCC CGGGCAGGCC AGGCCCAGGG ACGCCGGCTG
GTGCCGCCGA CGGGCCGGCC CACCGGCGGG AGAGGGGAAC GCACCGCTCT CGTGCCACCC
GCGGCTCCCT CCGCCGCGGC GAGGACGAAC ACGCCGCCGG ACGGTGGGAC CGGTCGGGGT
GCCGGTGCCA CCGTCGGGGG GTGGGTGCCG GACCGTGGCA TCGGGAAGTC CGACGCGCCC
GGCCGTGAAC CGTCCAGCGC TCGTCCCGGC ATGCCCGTCG GCGAGGCGAC CGGTCGTCGC
CGAGGCGGCG GCGCGCCCTC CGGTCGGGCG ACCCCCACCC GGCGGGTTCC CGGCCGTGGG
TTCTCCGATC GGACCGCCTC TGACCGCGGC GTCCCCGATC GTGGGCCGGA CCAACCGGCA
GACCGGACAG CCGGCGAGGT CAACACCCTG TACGACACCG TCGACCAGGA AGCCCTACGA
CGGGTCGATG TCCGGGAACG CACCGTGGAC GCCACCGTTT ACCACAAGGT GATGCCGGAG
CGGGACTCCC GGAATGGCCG CCGCAACGGT GTGTCGGACC GCGACCGCAT GACCGAGGTG
ACGCCCGCTC AGCGGGTGGC CGGTCGGCGC GACCTCACCC GCGCCACCGA ACTGATGGCC
GTCCGGAGCC GGGCACGGGA GGGAGACGGA TCGGCCGCTA CCCGCCGGAT GGCGGGTGGG
CGACCGGGCG CGGATCGGAT CGAGCCCCGC CGGACCGGTC CAGGCCGGCC CGGTCCAGGC
CGGGCCGGCC CGGGCCGATC CTCCCGGGCG GAGATGGGTG TCCCGGGCGG GGCGGGAGGA
CCCGGGCGCC GCAACGGGAA GGCCCCCCGG CCGGGCCAGG GGCGACACGG CCCGATGTGG
TGGCGACGGC GCCCGCGCTG GCTGCGCCGT CTGGTCTTCG CCAGCTTCTT CTCCGGTCTT
CTCCTGTTCG CCGCCGGCAT CGGCGTCATC TACGCGGCGA CCCGGGTCCC GCTGCCTGCC
GAGGTGAAGA CGGACCAGAC GTCGATCATC TTCTACGGAC CGCCGGCCGG CTCGCACCAG
GACAACGGTG AGGAGCTTGC CCGGATCGGC ACCGTGAACC GCACCGACGT GCCGCTCGCC
GAAGTGTCCG TGGACATGCA GCACGCGGTG CTCGCGGCCG AGGACAAGAA CTTCTATCAC
GAACCCGGTA TCTCTCCCAA GGGCATTGCC CGGGCGCTCT ACGTGAACGT CACCGGCGGG
GAGCTGCAGG GTGGTTCGAC CATCACCCAG CAGTACGCCA AGAACGCGTA CCTGTCGCAG
GAGCGGACCT TCACCCGCAA GATGCGCGAG ATCGTGCTCG CGGTGAAGCT GGGCCAGAAG
TACTCCAAGG GACAGATCCT GGAGTTCTAC CTGAATACGA TCTACTTCGG CCGGGGCGCC
TACGGTATCG AAGCGGCGGC GAAAGCCTAC TTCAACACCT CGGCCGCCAA ACTCACCCCC
GCCCAGGGGG CGGTCATCGC GGGCCTGATC CGATCACCCA ACTACCTCGA CCCGGCCAAG
AACCCGGGTC CGGCGGCGAA CCGCTGGCAC GACGTGGTCG CGACGATGGT CGCCGAGGGC
TGGGCTCCGC CGGGGCTCGC CCAGCAGAGC CCCCCACCCG TGGCCGCGAA GGCGCAGGAC
GCCGCCGCCT CGTCCGACCA GATCGCGTAC ATCCGCGATC AGGTCAAACG TGAGCTTACG
ACCGTCGGGA AGATCACCGA GGATCAGATC AACCGCGGTG GACTGCGCAT CACGACCACG
ATCGACAAGG GGCGGCAGTT CAAGGCCTTC CAGGCGGTGA GCGACGTGCT GGGCCCGGCG
TACGCCGCCG TGCCGGATCT GCGTACCGGG CTCACCGCGA TAGAACCCGG GACCGGCAGG
ATCCTCGCCT GGTACGGCGG GTCGCTCTAC GGCAAGGACG CACAGGGCCA CGAGCAGTAC
GTCGACAACG TGTCAGGGGC GCAGGTGCAG TCCGCCTCCA CCTTCAAGAC GATCACTCTG
GTCGCCGCGT TGCGTCAGAA CATCAACCTC AAGTCGACCT TCGCCGCTCC CGCCAAGATC
ACACTGCCGG GGAATTACGT GGTCAGCAAC GACGAGGGTG AGGCGGGCGA TCTCGGATAC
AAGAACCTCA TCGAGGCGAC GGCGGGCTCC ATCAACACGG TGTACGTACC GCTGGGGCAG
AACATCGGCG TCTCGAACAT CATCAAGACC GCGCGCGACC TTGGCATCCC GGCCAGTACC
CAGCTGCGCA ACGAAGCGGG TATCACGCTG GGCCAGGACG ACGTGCACGC GGTGGATATG
ACCACGGTCT ACGCGACCCT CGCCGGCGCC GGAGTGCGGG CCACACCGCA CATCGTGGAC
AAGGTGGTCG ACGGCAACGG GCAGGTGATC TATTCGGGTA CCCCGGATGT GAAGCAGGTC
ATCCCGGCCA CGGTGGCCCG CGACGCCACG TACGCCCTGC AGAGCGTGTT GACGGATTCG
AGTGGCACCG GAAAGCGGGC CCGGCTGGAC GGCGGGCGGG AGGCCGCCGG CAAGACCGGG
ACGTCGACCA ACTTCCGGTC GGCCTGGTTC TGCGGGTACA CCAGGGAACT CGCGTCCTGC
GTGAACATGT TCCGGGGCAA GGGCACCGAG CAGGATGTGC TGAAGGGCAT TCCCGGCGCC
GAGAAGGGCG TCTACGGGGG TACCTACCCG GCCAAGGTGT GGAAGGCGTT CATGGACGCC
GCGCTGACGG GGGTGCCGCC GTCGAAGTTC GATCCGCCGG CCTTCGGTGG CCTCGTGCAG
GATAACGAGC CCGAGCCGAC TCCCACACCC ACCCCGGCGC CGAGCGCCTC ATCGAGCCAG
CCCGGCGATA CCGGCGTCAA CCTGGGCGAT CTCCTGAACC CCAGTGGCAA CGGCAACGGC
GGTGGTCAGC AGCAGGGTGC CGGCCAGGCG GGCCGGCCGG CTCGACAGAC GGGGATCTTC
TCCGACCCGT TCAACTGA
 
Protein sequence
MSSPYDQGSS RGGPAPDRAW PNPDGQSRVP ADRVPPDRVP PDRARRPEQA RPGQAQPGQA 
RAGQARAGQA RAGQAQGRRL VPPTGRPTGG RGERTALVPP AAPSAAARTN TPPDGGTGRG
AGATVGGWVP DRGIGKSDAP GREPSSARPG MPVGEATGRR RGGGAPSGRA TPTRRVPGRG
FSDRTASDRG VPDRGPDQPA DRTAGEVNTL YDTVDQEALR RVDVRERTVD ATVYHKVMPE
RDSRNGRRNG VSDRDRMTEV TPAQRVAGRR DLTRATELMA VRSRAREGDG SAATRRMAGG
RPGADRIEPR RTGPGRPGPG RAGPGRSSRA EMGVPGGAGG PGRRNGKAPR PGQGRHGPMW
WRRRPRWLRR LVFASFFSGL LLFAAGIGVI YAATRVPLPA EVKTDQTSII FYGPPAGSHQ
DNGEELARIG TVNRTDVPLA EVSVDMQHAV LAAEDKNFYH EPGISPKGIA RALYVNVTGG
ELQGGSTITQ QYAKNAYLSQ ERTFTRKMRE IVLAVKLGQK YSKGQILEFY LNTIYFGRGA
YGIEAAAKAY FNTSAAKLTP AQGAVIAGLI RSPNYLDPAK NPGPAANRWH DVVATMVAEG
WAPPGLAQQS PPPVAAKAQD AAASSDQIAY IRDQVKRELT TVGKITEDQI NRGGLRITTT
IDKGRQFKAF QAVSDVLGPA YAAVPDLRTG LTAIEPGTGR ILAWYGGSLY GKDAQGHEQY
VDNVSGAQVQ SASTFKTITL VAALRQNINL KSTFAAPAKI TLPGNYVVSN DEGEAGDLGY
KNLIEATAGS INTVYVPLGQ NIGVSNIIKT ARDLGIPAST QLRNEAGITL GQDDVHAVDM
TTVYATLAGA GVRATPHIVD KVVDGNGQVI YSGTPDVKQV IPATVARDAT YALQSVLTDS
SGTGKRARLD GGREAAGKTG TSTNFRSAWF CGYTRELASC VNMFRGKGTE QDVLKGIPGA
EKGVYGGTYP AKVWKAFMDA ALTGVPPSKF DPPAFGGLVQ DNEPEPTPTP TPAPSASSSQ
PGDTGVNLGD LLNPSGNGNG GGQQQGAGQA GRPARQTGIF SDPFN