Gene Francci3_0017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0017 
Symbol 
ID3903592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp22428 
End bp25316 
Gene Length2889 bp 
Protein Length962 aa 
Translation table11 
GC content68% 
IMG OID637877347 
ProductCRISPR-associated helicase Cas3 family protein protein 
Protein accessionYP_479140 
Protein GI86738740 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGCTGT CCATTATATC TGGGATGGGA GTCTTCATGG ATGATGAGTT CTTCGGTATG 
ATATGGGGGA AGTCGGCGGA GAAGGCGGGG GGATCGATGC ACCTGCTCCT CGGCCATCTT
CTGGATACGG CGGCGGTGGG CGAACTTGTC TGGGATCGTT TCCTCGCATC CACCATTCGG
GATCGTCTGG ATGATTGCAG CGATGGTCGC GGGCGGTCTC TGTTCGCGCT GTTGTGCGGC
CTGCATGATG TCGGTAAGGC CACGCCGGCC TTTCAGATGA AGGACGAGGG TTTGGCGCAG
CGCGTTCGGG CTGCCGGGCT TGGCTGGCGA GGGGTGACTC CTCAGCAGGG CCGGCAGTGG
CATCACGCGC GGGCGGGGGC TGTCATCGTC CGCAAGTATC TCCCGCAGGT CGGTTGGAGT
CGGCCGGGGT GTGACTGGGT GTGGCCGCTG GTCGCGGGCC ATCACGGCCT GATACCGGAC
CGTGGCCGTC TTGTGCACAA GCCCGCCGTC CACGGCGCCG GACCGTGGCT CGACGTCCAA
CGTGCGTTCG TCGACCGGGT GGCCGGCGAC CTCAACGTTG ACCTGGCTTC CTTTTCGGAG
CTGCGAACAC CATCCCGCGG AGGTCAGCTT GCCCTATCCG GGATGATCAT CATGGCAGAC
TGGGTCGCGA GCGACAAAGA GCATTTCGGT GGGCTGTCGG ATCTGGCGGA GATCTCGATG
AAGGGCTCGC GGGAGCGTGC ACAGCGAGCC TGGGCACAGT TGGGTCTGCG TGGCGGGTGG
CGCTCCGACC GGCCCGCACC CGGCAACCAG TCTGACCTGG TGCACCACAG ATTCGGTAAG
CCAGCCCGGC CGGCGCAGCG CGCCGCGGTC CAGGCCGTGC GGGAAATGCC CGGCCCCGGG
CTACTGATCC TGGAAGCGCC GATGGGTGAG GGGAAGACGG AGGCGGCGCT CGCCGCCGCG
GAGGTACTGG CCGGCAAGGT CGGTGCGGAT GGGGTGTTCG TCGGAATGCC GACGCAGGCG
ACGAGCGATC CGATGTTCAG CCGGGTCCGC GGCTGGCTCA CCGCGGTGGA TCCCGAGGTT
CCGATCGGCC TACTGCACGG CCGGGCTCGT TTCAACAAGG AGTGGGCGGC ACTTCGGTCG
CAGGTCCGGT TCGTCGACGT GCATGACGAT CTCGACGAGT ATGGGATGGC CGACGATTTC
GGTACGGGTA CGAGCGGCCC GGGTCGTCGC GACGCACCGC TGGCGGGCGC CGCGGCGGCC
GCGGAGTGGT TCTTCGGATC GAAACGGGGG TTGCTCGCTG CGGTGACGGT CGGCACGGTG
GATCACCTGC TGCAGGCGGC GACCCGGACG AAGCATGTGA TGCTACGGCA CGCCGGCCTG
GCCGGCCGGG TCGTCATCCT TGACGAGGTA CACGCCTACG ACGTCTACAT GGCGCAGTTC
CTGTTCGAGG CGTTGCGCTG GCTCGCCGAC ACCGGCGTGC CGGTCATCGT TCTGTCCGCG
ACGCTGCCGC CGGTACTGCG TGCACAGCTC GCCGGCGCCT ATCTGCAGGG CGCCCTGCAG
CGCCCCGATG TCGACCTCGC CGATCTGCCG CGGCCCACCG GCTACCCGAG CACCACCGCG
GTGTATATGG CCGACGGGAA GCCCCGGTTT TCCGTGACGT CCCAGTCACC CTGGCGTGAG
TCCGTGCCCG TTGCGGTAGA GATCCTGGCA GAGCCCACCG ACTTCGCCCC GGTGAGCATC
GCCGCCGCTG TCTCGGCGGA GGTGAGCGAG GGCGGCTGCG CTCTCGTCGT GTGTAACACC
GTCCAGCGGG CCCAGGTCGT CTACGCCGAG CTGCGGACGA CGTTCGGTGA TGACGTGGTG
CTGCTGCACT CCCGGTTCAC CGCAGCCGAA CGCGCGAGCC GTACGGAAGA CGTCGTGGAC
AAGCTCGGCC CCCCCGGACG GGAGAACGCC CAGGACCGGC CCGCGCGGCT CGTCGTCGTG
GCCACCCAGG TGGCCGAACA GTCGTTCGAC GTCGATGTGG ACATCCTCGT CACCGATCTC
GCCCCCATCG ACCTTCTGCT GCAACGGGTC GGCCGGCTGC ATCGGCATGA CCGGCCGACC
GGGCAGCGAC CGGCCCGGCT TCGCGACCCG AAAGTGATCG TCAGCGGTCT GCGGCTGCCC
GACGGGGCTG TGCCACTCTG GCCCGCCGGT AGCCGGGCCG TGTACGGGGA TCATCTACTG
CTGCGCTCCG CCGCTCTCGT CGCCGAAGCC GCCACGGGTG GTGGATGGTC GATTCCCGCC
GACGTGCCCG GCCTCGTCGC TCTCGGCTAC GGTGATGAGC CTCTCGGTCC GGCGCGCTGG
GCTGATGCCG CGGCCGAGGC CAGGCAGGAA TGGGAGGCCA AGGAGCGTCG TCGCCGGGTG
AACGCCACCG GATTTCTGCT TTCCGGCGAG GACGGTCTGG GCCTGACCAC GCTCGAGGGT
CTGCACGACA AGGCGACCGC GCCGCTGGAA ACCGAGGAAC GCCTCACCGC GGTCGTCCGT
GACAGCGACG AGTCCGTGGA GGTCGTCCTC GTCCGCCGCG GACCGGCCGG GTATCTCACG
CTCGCGGGCC GATCGCTCGG GCACATCGGC GAAGTTGCCG TGTCTGACGA CGCGGTCCTC
GAAGAGGTCG TCGGAGCCAC CATCCGCCTG CCCGCGAACA AGGAGATCAC CGCCGCGGCG
CATGCCGAAC TGACTCCCCT GGCCGGCTGG GACGGCGACT CCTGGCTCCG CCGCACCCGC
GCCCTGATTC TCGACGATCG CATGTCGGCG ATCCTCGGGG GGCGACACCT GACCTATGAC
AACAAGCTCG GGCTGAGCCT CGCGCCTCGA CCGGCTGCTG ATACTCGACC GGCTGCTGGT
AAAGGGTGA
 
Protein sequence
MGLSIISGMG VFMDDEFFGM IWGKSAEKAG GSMHLLLGHL LDTAAVGELV WDRFLASTIR 
DRLDDCSDGR GRSLFALLCG LHDVGKATPA FQMKDEGLAQ RVRAAGLGWR GVTPQQGRQW
HHARAGAVIV RKYLPQVGWS RPGCDWVWPL VAGHHGLIPD RGRLVHKPAV HGAGPWLDVQ
RAFVDRVAGD LNVDLASFSE LRTPSRGGQL ALSGMIIMAD WVASDKEHFG GLSDLAEISM
KGSRERAQRA WAQLGLRGGW RSDRPAPGNQ SDLVHHRFGK PARPAQRAAV QAVREMPGPG
LLILEAPMGE GKTEAALAAA EVLAGKVGAD GVFVGMPTQA TSDPMFSRVR GWLTAVDPEV
PIGLLHGRAR FNKEWAALRS QVRFVDVHDD LDEYGMADDF GTGTSGPGRR DAPLAGAAAA
AEWFFGSKRG LLAAVTVGTV DHLLQAATRT KHVMLRHAGL AGRVVILDEV HAYDVYMAQF
LFEALRWLAD TGVPVIVLSA TLPPVLRAQL AGAYLQGALQ RPDVDLADLP RPTGYPSTTA
VYMADGKPRF SVTSQSPWRE SVPVAVEILA EPTDFAPVSI AAAVSAEVSE GGCALVVCNT
VQRAQVVYAE LRTTFGDDVV LLHSRFTAAE RASRTEDVVD KLGPPGRENA QDRPARLVVV
ATQVAEQSFD VDVDILVTDL APIDLLLQRV GRLHRHDRPT GQRPARLRDP KVIVSGLRLP
DGAVPLWPAG SRAVYGDHLL LRSAALVAEA ATGGGWSIPA DVPGLVALGY GDEPLGPARW
ADAAAEARQE WEAKERRRRV NATGFLLSGE DGLGLTTLEG LHDKATAPLE TEERLTAVVR
DSDESVEVVL VRRGPAGYLT LAGRSLGHIG EVAVSDDAVL EEVVGATIRL PANKEITAAA
HAELTPLAGW DGDSWLRRTR ALILDDRMSA ILGGRHLTYD NKLGLSLAPR PAADTRPAAG
KG