Gene Francci3_3781 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3781 
Symbol 
ID3906066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4530305 
End bp4533361 
Gene Length3057 bp 
Protein Length1018 aa 
Translation table11 
GC content67% 
IMG OID637881108 
Producthypothetical protein 
Protein accessionYP_482861 
Protein GI86742461 
COG category[S] Function unknown 
COG ID[COG1615] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.568022 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0630741 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCAACCA CCAGTCGCCG CCCGGTGCTG ACCCGCACCA AGGTGCTCGT GCCGGTGCTG 
CTTGTCATCG TCCTCGCGAT TACGATCATC GCCATCTTCA CCCGGCTATA CACTGACCTG
CTGTTCTACC GCTCGATCGA CTTCAGCAGG GTGTTCACCA CTGTCCTCTA CACCCGGATT
TTGCTGTTCG TCCTCTTCGG CGTGGTAATG GCGATCATCG TCGGGACCAA CATCGTACTT
GCGTACCGGC TACGGCCGCC GCTTCGGCCG TTGTCGACCG AGCAGCAGAA CCTGGAACGC
TACCGGTCGG TGATCGAGCC GTATATGCTG CTGATCCTGC TCGCGGTCAC GACGCTGTTC
GGCCTGGCCG CGGGCCTGTC CGCCGCCGGC CAGTGGCGGA CGTGGCTGCT GTGGGTGAAC
GGCGAGTCTT TCGGGACCCC TGATCTGCAG TTTCACCGGG ATATCAGCTA TTACGCGTTC
AGCTATCCCT TCCAGCGGTT CCTGCTCGGC TTCCTGCTGA CCGCGGTGCT GCTGTCGTTG
CTCGTCACCC TGCTGACCCA CTACCTGTTC GGCGGAATCC GGATGCAGAC CACGGGGGAA
CGGGTGACCC CGGCGGCGAA GGCGCACATC TCGGTGCTAC TCGGGCTGCT CGCCCTGCTT
AAGGCGTGGG CGTACTACCT CGACCGGTTC GGCTCGGTGT TCTCCTCGCG CGGGGTGGGG
ACCGGTGCGT CCTACACCGA TGTGCACGCC GTGCTGCCGG CGAAGCTGAT CCTGCTGTTC
ATCTCGCTGG CCTGCGCGGT GCTGTTCATC TACAACATCT TCCAGCGGGG CTGGACGCTG
CCGCTGCTCG GCGCGGGCAT CCTTGTCCTG TCCTCGGTGG TGATCGGCGG GATCTACCCG
GCGATCGTCC AGCAGTTCCA GGTCCGGCCG AACGAGGCGA CCCGCGAGGA GCCCTACATC
GCGCGCAACA TCGCGGCGAC CCGGGCCGCC TACGGCATCC AGGACGTCGA GCCGCAGGAC
TATGCCGCCA GCACCGACGT CACCGCCCAG CAGGTCGCCG CCGACACCGG CACGGTCCCC
AACATCCGTC TACTCGACCC CAGCAAACTC TCCCGGACGT TCCAGCAGCT CCAGCAGTTC
CGCGGCTACT ATGGTTTCCC GCCGACGCTC GACGTCGACC GGTACACCGT CACAGGGGCG
GACAAGAAGT CGACGACCCA GGACTACGTG GTATCGGTGC GGGAGCTGAA CCAGGCCGGG
CTGGGAACGC AGCAGCGCAA CTGGATCAAC GAGCACCTGA CCTATACCCA CGGCAAGGGA
TTTGTCGCGG CGCCGTCGAA CACCGTCGAC GTGGGGCGCC CCGACTTCGA TCACGGCGTG
AGCGGTTTCC CGCAGGACGG TACCTTCGGT ATCAAGGAGA ACCGGGTCTA CTTCGGGGAG
ATGTCGCCGT CGTATTCCAT CGTCGGCACC AGGCAGATGG AGATCGATGG GCCGGGGCCG
GGAGAAACCC AGGTCACCAC GACCTACCAG GGCGACGGTG GGGTCTCGAT CGGCTCGACC
TTCCGTCGGG CGCTGTTCGC GCTGCGGTTC GGGGAGAAGA ACATCCTGCT CTCCGGTGAC
ATCACGAAGA ACTCCCGGAT CCTGGACGAG CGAAACCCGC GGGACCGGGT GAGCAAGGCA
GCGCCCTGGC TCACCCTCGA CGGAGACCCG TACCCGGCGA TCGTCAATGG CCGGGTGACC
TGGATTCTCG ATGGTTACAC AACCTCCGAC GGCTATCCTT ACTCCGCCCG GCGCACCCTG
GGTGACGTGA CCGCCGACTC GGTGACCACC CAGAGTGGCA ACCGCACCCG GCAGGCCTCG
AACCAGGTCA ACTACATCCG TAACTCGGTG AAGGCGACGG TGGACGCCTA CAACGGCACC
GTCACCCTCT ACGCCTGGGA CGAGTCCGAC CCGGTGCTAC GGACCTGGAT GAAGGCGTTC
CCGGACACGG TGAAGCCGAA GAGTGACATC CCGCCGTCCC TCCGAGCACA CTTGCGTTAT
CCGGAGGACC TGTTCAAGGT CCAGCGCGAT CTGATCGGGC AGTACCACAT CTCCAACCCG
CGGGACTTCT ACTCCCAGGA GGACTTCTGG GATGTGTCCG ACTCGCCGGA TGGAAGCGGG
CAACCCCAGC CGCCGTTCTA CGTCTACAGC CAGCTTCCCG GGCGCAAAGA CCCGTCCTAC
AACCTCACCT CACCGTTGAT CTCCGCCCGG TCCTCGAAGC TCGCGGCGTA CATGGCGGTG
TCCAGTGACC CCGCCAACTA CGGCCGGTTC ACCCTGCTCA AACTGCCGGC GGGGAACACG
ATCAACGGTC CGGTCCAGGT GCAGAACGCC ATCGAGGGCA ACGGCGACGT GGCCAAACAG
CTGTCGCTAT GGCGCAGCGG GGGAGCGACC ACGATCGAGG GCAACCTGCT GACCCTGCCC
CTGGCCGGCG GCCTGCTCTA CGTCGAGCCC TACTACGTCC AGGCCAGAGG GAGCACCGGC
TACCCGACCC TGCAGGGGGT CGCCACGGCC TTCGGCGACC GCATCGGTTT CGGTGCCTCG
CTGGGTGAGG CCCTGGACAA GGTCTTCGGT GCCGGTGCCG GTGCCGCGGC GGCCGGGGCC
GGCATCGGGG CCACGACCAC GGACGCCGGT CAGGACGGCA CGCCGGCTCC CAGATCCGGG
CAGGGCGGGG CCGGCGTCCC GCCGCCGGGG CAGACCGCGC TGCAGGACGC CGTGGGCGAC
GCCGACCGCG CCTACCAGGC GGGGCAGGAC GCCCTGCGCA AGAGCCCGCC GGACTTCACC
GCCTACGGGA AGGCCCAGAG CGAGCTGGCT GATGCGCTGG GTCGCCTGCG TACTCTGTCG
AGTCCGGCGG CGACACCGCC CGCTGCCACG GCGACACGGG CCGGTGCGTC GGTGCCCGCG
TCACCGGTCC CCGCGTCACC GGCAGCCAAA CCACCGGCTC CGTCTCCTTC GGCGACCGTG
GCCGGCGGCG ACACTCCCGG TCCACCGGGG GCCCGGGCCG CCCCGGCGCC CGGCTAG
 
Protein sequence
MATTSRRPVL TRTKVLVPVL LVIVLAITII AIFTRLYTDL LFYRSIDFSR VFTTVLYTRI 
LLFVLFGVVM AIIVGTNIVL AYRLRPPLRP LSTEQQNLER YRSVIEPYML LILLAVTTLF
GLAAGLSAAG QWRTWLLWVN GESFGTPDLQ FHRDISYYAF SYPFQRFLLG FLLTAVLLSL
LVTLLTHYLF GGIRMQTTGE RVTPAAKAHI SVLLGLLALL KAWAYYLDRF GSVFSSRGVG
TGASYTDVHA VLPAKLILLF ISLACAVLFI YNIFQRGWTL PLLGAGILVL SSVVIGGIYP
AIVQQFQVRP NEATREEPYI ARNIAATRAA YGIQDVEPQD YAASTDVTAQ QVAADTGTVP
NIRLLDPSKL SRTFQQLQQF RGYYGFPPTL DVDRYTVTGA DKKSTTQDYV VSVRELNQAG
LGTQQRNWIN EHLTYTHGKG FVAAPSNTVD VGRPDFDHGV SGFPQDGTFG IKENRVYFGE
MSPSYSIVGT RQMEIDGPGP GETQVTTTYQ GDGGVSIGST FRRALFALRF GEKNILLSGD
ITKNSRILDE RNPRDRVSKA APWLTLDGDP YPAIVNGRVT WILDGYTTSD GYPYSARRTL
GDVTADSVTT QSGNRTRQAS NQVNYIRNSV KATVDAYNGT VTLYAWDESD PVLRTWMKAF
PDTVKPKSDI PPSLRAHLRY PEDLFKVQRD LIGQYHISNP RDFYSQEDFW DVSDSPDGSG
QPQPPFYVYS QLPGRKDPSY NLTSPLISAR SSKLAAYMAV SSDPANYGRF TLLKLPAGNT
INGPVQVQNA IEGNGDVAKQ LSLWRSGGAT TIEGNLLTLP LAGGLLYVEP YYVQARGSTG
YPTLQGVATA FGDRIGFGAS LGEALDKVFG AGAGAAAAGA GIGATTTDAG QDGTPAPRSG
QGGAGVPPPG QTALQDAVGD ADRAYQAGQD ALRKSPPDFT AYGKAQSELA DALGRLRTLS
SPAATPPAAT ATRAGASVPA SPVPASPAAK PPAPSPSATV AGGDTPGPPG ARAAPAPG