Gene Francci3_2954 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2954 
Symbol 
ID3903769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3497641 
End bp3501324 
Gene Length3684 bp 
Protein Length1227 aa 
Translation table11 
GC content71% 
IMG OID637880275 
Productbacteriophage resistance gene PglY 
Protein accessionYP_482041 
Protein GI86741641 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.231929 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCTGA TCTCGGAACT CCTCGACATC CCCGAGGCCG TGTACCCGGG GGACTTCGTC 
CTCGACCTGT CCCGCGGCGT GACCCAGATC GATCGCACCC TGCGCGAGTA TGTCGTCACC
CCCGAGCTGG CCATCCGGTT CGGGGAGGCG CTGGGTCTGA TCAAGAGCGC GCTCACCGAC
GGCAGTAGCA AGGCCGCCTA CCTCGACGGC TCGTTCGGGT CCGGCAAGAG CCACTTCATG
GCTGTGCTCG GCGCGCTGGC GCGGAACCAC CCCGGGGCCC GGGCGATCGG CGAGCTCGCC
CCTGTCATCA CCGCATTCGA CGGTGACGTC ATCGGCAAGC AGTTCCTCGT CGTGCCGTAT
CACCTCGTCG GCAAGACGTC GCTGGAGGAG GCGGTCCTCG GCGGCTACCT GGCGCACGTC
CGCGCGCTGC ACCCGGACGC GCCGCTGCCC GCCGTCCTCA TCGACAAGCC GCTGCTCGAC
ACCGCGGTGG CGTTGCGCGG CCAACTCGGC GACCAGGCGT TCTTCGGCCT GCTCGGCGGC
GGCGACCGGA GCTCGGCTGC CGACCCGCGC TGGGGCACGT TGAACCACGG GGCCTGGGAC
GCCGACCGGT TCGAGCAGGC ACTGACCGCC GCTCCCGCCG CGCAGGAGCG CCGCGACCTG
GTCGGCACCC TGGAGAAGGC GCTGGTCGGG TTCGCCGAGC TCGCCCGGGG CGCCGCCACC
GGCTACGTCA ACATCGACGA CGGCCTGGCC GCGATCAGCC AGCATGCCGC CGGCCTCGGC
TACGACGGGC TGATCCTCTT CCTCGACGAG CTCATCCTGT GGTTCGCCAC CAGGATGGCC
GATCACGCCT GGGTGGCGAA CGAGGCTCCC AAGGTCGCCA AGCTCGTCGA GGCGGCCAAC
GCCGACCGAC CGGTGCCGAT CATCAGTTTC GTCGCCCGCC AGCGGGACCT GCGCGAGCTC
GTCGGCACCA ACCTGCCCGG CACCGAACAT CTTGCCTTCG CCGACAGTCT CAAGTGGTGG
GAGGGTCGGT TCGGCTCGAT CAAGCTCTCC GACAACAACC TGCCGCTCAT CATGTCGAAG
CGGGTCCTGC GGCCGCGCAG CGATGCCACC CGGGCACAGA TCGACACCTC GTTCGCCGCG
ATGGACCGGG TCCGAGCGGA CATCCGGGCC GCGCTGATGA CCGCCCACGC CGACCGGGCC
GCGTTCCGGC TGACCTATCC GTTCAGCCCG GCATTCGTCG AGACGCTCGT CGCGCTGTCC
GGGTTCCTCC AGCGGGAGCG GACCGCGCTG CGGGTCATGC AGCAGCTACT CGTTGACCAG
CGGGACACCC TGCAGCTCGG CCAGCTCGTC CCCGTCTCCG ACCTGTTCGA CGCGCTCATC
ACGGATGCGT CGCCGATCAG CGGCGAGCTC GGCGGGCTGT GGCGCAACGC GCAGAAGGTC
TACGGCGAGA TCCGTCGGCT CATCCTGGAA AGCCACGGGC TGACCGAGGA AACCGTCACC
GGGGTGGCCC CGAAACACGC CGTGCACATG GACGACCGGA TCGCCAAGAC CCTGGTGCTC
GCCGCCCTGC TCCCCGAGGT CCCGTCGATG CGGGATCTCA CCGCCCGCCG GATCGTGGCC
CTCAACCACG GGTACATCCG GTCGATGGTG CCGGGTCAGG AGACGGGCGC GGTCACCCAG
GTCGTCCGCA AATGGGCGGC CCGGCTCGGC GCGGTGCAGG TCAGCGGGGA CGACGCAAGC
CCGCTCCTGT CGGTGCGGCT CGAAGGCGTC GACATCGAGG GCATTCTGGA GAACGCCAAG
GGGGCCGACT CGCCCGGCGA CCGCCGGCTG ACCGTGCGAA CGCTGCTGTT CGACGCGCTC
GGCGCCACGA AACCGGAGGG TATCGACCGC GCGACGATCG TCACCGCCAC CTGGCGCGGC
ACCAAACGCA ACGTCGAGCT CATCTACGGC TATCTTCGCG ACCCCGGTGA CGTGGGCGAC
AGCATGTTCA CGCCCACCCA CAACGGCTGG CGGCTCGCCC TCGACGTCCC CTTCGATCCG
GACGACCACT CCCCGGCCGA GGCCCTCGAC CGGGTCTCGC GGCTGCGGGA CGCGCTCCCA
GCGCGCACCC TCTGCTGGGT CCCGCGGCAC TTCACCGCAC ACACCCAATG GAGCCTGGGC
CGCTACCTGC GGCTGGAATA CGCCGTCGGG CCGAGCTTCG ACCAACTCGC CGGGCACCTG
TCCGACAATG ACCGGGCTAT CGCCCACCAG CAGTTGACCG CGCAGCTCGA CCAGCTGCGC
AGCCAACTGA CGAACGCGCT GCTGCAAGCC TATGGCCTCG TCACCCCGGA CGAGACGGTC
GTCGATCCCG CGCACGGCGG GATCGACATG TTCGTCAGCC TCGAACCGGG ATTCACGCCC
CGGGTCCCCG CCGGTGCCGC CCTGCGCCCC GCCTTCGAAA ACCTGCTCGA CCAGGCACTG
AGCTGGGAGT TCCCGGCCCA CCCCCACTTC GACGGCGAGG TGCGCCCGGG TGACCTGGCA
AAAGTGCACG CCCAGGTGCG CCGGGCCATC GAACATCCCG AGCACCGGAT CATGGTCGAA
TCCAGCGAGC GGCCCGTCAT GCGGAAGGTG GCCAACCCGC TGGGGCTCGG CGAGCAGAGC
GACCAGTACT TCGTCCTCGG CCACCACTGG GAACGGCACC TCGGCCGAAA GATCATCGAG
GCCGAGGCGG CGGGCCGGCC GGTCACGGTC GGAGACCTGC GTGCCTGGCT GGACGAGCCG
AAGCGGATGG GTCTGCCCCA GGAGATCGCG GACCTTGTTG TGCTGGTCTT CGCCGAGCAG
ACCAACCGCG GCATCCTCCA CGGCCGGCCG CTCGACGTGA GCATCCCGCG CCCGCTGCCC
GCGGACGCGC GCGTTGTCGC CGAGCCGCTA CCCGATCGGG ACACCTGGGA GGAGGCCCGC
GGCCGGGCGC ACGCCCTCTT CCTGATCACT GACATCACCG AACTGTGTAC CGCGCGCAAC
GTCGCCTTGC TGGCCACTCG GGTACGATCA GCCGTCGCGG ACCGGCTTCC GAAGGTTCGC
GCGCTGCACG CGGCCCTCAT CCGCCGAGGA CCGACCGTGC TCCGGGACGG CACCGACCCG
GCGACGGAAC GAGTCCGGAT CGCCCAGGGC GCGCTGCGTC TCTGTGCGGA CCTGGCGACC
ATCACCGACA ATCTGGCCCT GGTCGAGCGG CTCGCCGCCT TCGACCTCCC CGCGGTCCCG
CTGCACACCG GTCGGAGCCT GACCACCGCC GCCGATCTCG ACGCGGCCAT CCGGGAGGTG
GACTGGAACA TCTTCACCAC CGTGGCCGAC TGGGGACCCG AGCATCCGCG GGGCGCGGAG
GCAGCCGCGT TGGTGACCGA GCTCGCCGCG GTCTGGGCGG CCAACGAGTA CGTCCAGCAC
CTCCAGCCGG CGCTCACCAA GGCGGACAGG AAGGCTCGCG GGTTGCTGCT GGAATGGAGC
CGACGGGTCT CCCGCCCAGC GAGCAGAACC GGCACCATCA ACCCGCCGGG CGTGGCCACA
CCGGGCGTGG CGACACCGGG CGGCGTGGCA ACCAGCGGCG CCCGCGAGGT CGATCAGGAC
TCGGTTGCCG AGTTCACGTC CACGCTCGCG GCGCTCACCG CCCAGGGCTG GCGGATCAAA
GTCACCTGGC AGGCCGTGCG GTGA
 
Protein sequence
MTLISELLDI PEAVYPGDFV LDLSRGVTQI DRTLREYVVT PELAIRFGEA LGLIKSALTD 
GSSKAAYLDG SFGSGKSHFM AVLGALARNH PGARAIGELA PVITAFDGDV IGKQFLVVPY
HLVGKTSLEE AVLGGYLAHV RALHPDAPLP AVLIDKPLLD TAVALRGQLG DQAFFGLLGG
GDRSSAADPR WGTLNHGAWD ADRFEQALTA APAAQERRDL VGTLEKALVG FAELARGAAT
GYVNIDDGLA AISQHAAGLG YDGLILFLDE LILWFATRMA DHAWVANEAP KVAKLVEAAN
ADRPVPIISF VARQRDLREL VGTNLPGTEH LAFADSLKWW EGRFGSIKLS DNNLPLIMSK
RVLRPRSDAT RAQIDTSFAA MDRVRADIRA ALMTAHADRA AFRLTYPFSP AFVETLVALS
GFLQRERTAL RVMQQLLVDQ RDTLQLGQLV PVSDLFDALI TDASPISGEL GGLWRNAQKV
YGEIRRLILE SHGLTEETVT GVAPKHAVHM DDRIAKTLVL AALLPEVPSM RDLTARRIVA
LNHGYIRSMV PGQETGAVTQ VVRKWAARLG AVQVSGDDAS PLLSVRLEGV DIEGILENAK
GADSPGDRRL TVRTLLFDAL GATKPEGIDR ATIVTATWRG TKRNVELIYG YLRDPGDVGD
SMFTPTHNGW RLALDVPFDP DDHSPAEALD RVSRLRDALP ARTLCWVPRH FTAHTQWSLG
RYLRLEYAVG PSFDQLAGHL SDNDRAIAHQ QLTAQLDQLR SQLTNALLQA YGLVTPDETV
VDPAHGGIDM FVSLEPGFTP RVPAGAALRP AFENLLDQAL SWEFPAHPHF DGEVRPGDLA
KVHAQVRRAI EHPEHRIMVE SSERPVMRKV ANPLGLGEQS DQYFVLGHHW ERHLGRKIIE
AEAAGRPVTV GDLRAWLDEP KRMGLPQEIA DLVVLVFAEQ TNRGILHGRP LDVSIPRPLP
ADARVVAEPL PDRDTWEEAR GRAHALFLIT DITELCTARN VALLATRVRS AVADRLPKVR
ALHAALIRRG PTVLRDGTDP ATERVRIAQG ALRLCADLAT ITDNLALVER LAAFDLPAVP
LHTGRSLTTA ADLDAAIREV DWNIFTTVAD WGPEHPRGAE AAALVTELAA VWAANEYVQH
LQPALTKADR KARGLLLEWS RRVSRPASRT GTINPPGVAT PGVATPGGVA TSGAREVDQD
SVAEFTSTLA ALTAQGWRIK VTWQAVR