Gene Francci3_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0194 
Symbol 
ID3903221 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp228556 
End bp231681 
Gene Length3126 bp 
Protein Length1041 aa 
Translation table11 
GC content67% 
IMG OID637877525 
Producthypothetical protein 
Protein accessionYP_479314 
Protein GI86738914 
COG category[V] Defense mechanisms 
COG ID[COG0610] Type I site-specific restriction-modification system, R (restriction) subunit and related helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.394359 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACT ACCACGAGGT CGTCTTCGAG TCCGAGATCT GCGCGTACCT GGAGGCTCAT 
GGGTGGCTGT ACTCGGCGGG CGACTCCGGG TATGACCGTG AGCGGGCGCT CTTCCCCGCG
GATGTGTTCG GCTGGTTGGA GGAGACTCAG CCGGCGGCGT ACGGGAAGGC GTTGAAGGCG
GCCGGGTCGG CGGCGAAGTT CCTCGATGTG CTGGCCACGG CGCTCGACAG GCCGCTGGAG
CACGGCGGCG GGACGTTGAA CATCCTGCGC AACGGGGTCG TCTACATCGG TGGTGGCCGG
TTGAAGCTGG CGCAGTTCCG GCCGGAGACC AGCCTGAACG CGACGACGGT GGCGCAGTAT
GCGGCGATGC GGGTGCGGGT GATGCGGCAG GTGCGCTTCT CCACCGCCGA TCAGCGCAGC
ATCGACCTGG TGTTCTTCGT CAACGGGCTG CCGGTGGCGA CGGTGGAGCT GAAGACGGAC
TTCACGCAGT CGCTGGACGA GGCGATCAGC CAGTACCGCA AGGACCGGCG CCCGGTCACC
AACGGCCGGG CGGAGCCGTT GCTGTCGTTC GGGCATCGGG CGCTGGTGCA CTTCGCGGTC
TCCAACGACC TGGCGGCGAT GACCACCAGG TTGGAGGGGG AGAAGACGCA CTTTTTGCCG
TTCAACATCG GCCACGACGG CGGGGCGGGG AACCCGCCAG GTGCCGAGGG GCGGTCGGCG
ACGGCGTACC TGTGGGAGCG GGTCTGGGAG AAGGACGCCT GGCTCACCAT CGTCGGGCGG
CTGATGATCG TGGAGACCCG GGAGGAGTGG GACGTCGCGA CGGGGACGTC GGTGCGACGT
ACCAGCATGC TCTTCCCGCG GTTCCACCAG TGGGAGGCCG TGACGACCAT CGTCGACGCC
GTACGCGAGG AGGGCGTCGG CCACCGGTAC CTGATCGAGC ACTCGGCCGG GTCGGGGAAG
ACGAACACGA TCGCCTGGAC CGCACACCGG CTGGCGCGGC TGCACGTGGA CGACGAGAAG
GTCTTCGACA CGGTCCTCGT GGTCGTGGAC CGGACGGTGC TTGACGGGCA GCTTCAGGAG
GCGATCCGGC AGATCGACGG GTCCGGCAGG ATCGTGGCCA CGATCAGCCC GGAGGACGTC
CGCAAGGCCG GCGCGACGTC GAAGTCCGGG CTGCTGGCCG CCGCGCTGCG GAACGGCGAG
CTGATCATCG CGGTGACGGT GCAGACGTTC CCGTTCGCGA TGGACGAGAT CCAGGCGGAC
AAGGGGCTGA CGGGCAGGAA GTTCGCCGTG ATCGCCGACG AGGCGCACTC GTCGCAGTCC
GGCAGGATCT CCTCCCAGCT GAAAGCCGTG CTCACCGCCG AGGAGATCAA GGACCTCGCC
GACGGCGGCG AGGTCGCCCT GGAATCGCTT CTGGCGGCGC GGATGAGCGA GCGGGCCGAC
TCGCCGAACA TCTCCTACTT CGCGTTCACC GCGACCCCGA AGGCCAAGAC GCTGGAGATG
TTCGGGCGGA AGGGGCCGGA CGGGAAGCCG GTCGAGTTCC ACCTGTACTC GATGCGGCAG
GCGATCGAGG AGGGGTACAT CCTCGACGTT CTGCGTGGCT ATCAGTCGTA CGACACCGCG
CTGAAGATCG CCGGTAAGGC CACCGCCGAC AGCGAAGTCG AGGAAAGCGC CGCGCGTAAG
GGGCTGATGC GGTGGGTGAA GCTGCACCCG ACCAACATCA GCCAGAAAGT CCAGATCATC
GTCGAGCACT TCCACGCCAA CGTCGCCCAC CTGCTGGAGG GCAGGGCGAA GGCGATGGTC
GTCACCGACT CGCGCAAGGC CGCGGTGAAG TACAAGAAGG CGATCGACGC CTACATCGCT
CGCCGGGTCG CGGAGGATCC GTCGTACACC TACCGCACGC TGGTCGCCTT CTCCGGGTCG
GTGACGATGG ACGAGAACGA GGTGTGGACC TCGGAGTGGG GGCCGGTGCC GGCGGAGGAC
GTCGAGTTCA CCGAGACCAA CCTCAACCCC GGCGCCGGCG CGGACCTGGC CGCGGCGTTC
AAGGGCGGGA CCTACACGAT CATGCTGGTG GCCAACAAGT TCCAGACCGG CTTCGACCAG
CCGCTGCTCT CGGCGATGTA CGTCGACAAG AAGCTCTCCG GGGTCACCGC CGTGCAGACG
CTCTCCCGGC TCAACCGCAC CCACCGCACC GCGGGCGGGG AGATCAAGCG CACGACGTTC
GTCCTCGACT TCGTGAACAC GCCCGACGAC ATCCGGGCCG CGTTCGAGCC GTACTTCACC
GGTGCGACCC TGGAGACCGA GACCGACCCG TACGTCGTCG CCCACCTCGC CGCCAAGCTC
GCCCAGACGG GGATCTACAC CGCCGACCAG GTACGGAACG TCGCCGAGTT GTGGGTGAAG
CGGAAGGGTA ACAACGCGCT CTCGGCCGCG ATCGCTCCGG CGAAGCACGA GTTCGCGAGC
CGCTACGCCG CCGCGATCGA GGCCGATGAC AAGGTCACGC TCAGCACCCT CGACCTGTTC
CGTCAGGACG TCTCCACCTA CGTCCGGCTC TACGACTTCA TGAGCCAGAT CGTCGACTAC
GGCGACCCGC ACCTGGAGAT GCTCTCCATC TTCCTACGCC TCCTGGAGAA GGTCATCGCC
GACTCCTCCT GGGCCGCCGA GGTCGACCTC TCCGACGTCG TCCTGGTCGG GGTCAGACAC
GAGAAGCGGA TCGCCGTCGA CATCTCGCTG ACCGGCGACG GCGAGCTCAA GGGAATCAGC
GCCGCCGGAA CCGGCGCCCG CAAGGAGCCC AGGTACGTCG CGCTCCAGGT CGTGATCGAC
AAGATGAACG ACCTCTTCGG CGCCGAGTCC TTCACCGAGT CGCAGATCCG CGAGTTCGTC
GACGGCCTGG TCCAGCGACT CCTCGCCTAC CCCGACCTCG TCAGGCAGAC CCAGGTCAAC
TCGAAGAAGC AGTTCATGGA CTCCGACGAC TTCAAGGCCG TCGTCACCGA GGCCGTCCTC
GACAACCAGG AAGCCCACAA CACCATGGCC GACTACTTCT TCAGCGACGG CCCTGGGATC
AACAGCGTCA TCCTTGCCCT CGCGGACGCC TTCTACGAGG TCGCCACGTC ACAGGAGACC
GACTGA
 
Protein sequence
MADYHEVVFE SEICAYLEAH GWLYSAGDSG YDRERALFPA DVFGWLEETQ PAAYGKALKA 
AGSAAKFLDV LATALDRPLE HGGGTLNILR NGVVYIGGGR LKLAQFRPET SLNATTVAQY
AAMRVRVMRQ VRFSTADQRS IDLVFFVNGL PVATVELKTD FTQSLDEAIS QYRKDRRPVT
NGRAEPLLSF GHRALVHFAV SNDLAAMTTR LEGEKTHFLP FNIGHDGGAG NPPGAEGRSA
TAYLWERVWE KDAWLTIVGR LMIVETREEW DVATGTSVRR TSMLFPRFHQ WEAVTTIVDA
VREEGVGHRY LIEHSAGSGK TNTIAWTAHR LARLHVDDEK VFDTVLVVVD RTVLDGQLQE
AIRQIDGSGR IVATISPEDV RKAGATSKSG LLAAALRNGE LIIAVTVQTF PFAMDEIQAD
KGLTGRKFAV IADEAHSSQS GRISSQLKAV LTAEEIKDLA DGGEVALESL LAARMSERAD
SPNISYFAFT ATPKAKTLEM FGRKGPDGKP VEFHLYSMRQ AIEEGYILDV LRGYQSYDTA
LKIAGKATAD SEVEESAARK GLMRWVKLHP TNISQKVQII VEHFHANVAH LLEGRAKAMV
VTDSRKAAVK YKKAIDAYIA RRVAEDPSYT YRTLVAFSGS VTMDENEVWT SEWGPVPAED
VEFTETNLNP GAGADLAAAF KGGTYTIMLV ANKFQTGFDQ PLLSAMYVDK KLSGVTAVQT
LSRLNRTHRT AGGEIKRTTF VLDFVNTPDD IRAAFEPYFT GATLETETDP YVVAHLAAKL
AQTGIYTADQ VRNVAELWVK RKGNNALSAA IAPAKHEFAS RYAAAIEADD KVTLSTLDLF
RQDVSTYVRL YDFMSQIVDY GDPHLEMLSI FLRLLEKVIA DSSWAAEVDL SDVVLVGVRH
EKRIAVDISL TGDGELKGIS AAGTGARKEP RYVALQVVID KMNDLFGAES FTESQIREFV
DGLVQRLLAY PDLVRQTQVN SKKQFMDSDD FKAVVTEAVL DNQEAHNTMA DYFFSDGPGI
NSVILALADA FYEVATSQET D