Gene Francci3_3341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3341 
Symbol 
ID3904127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3963334 
End bp3965589 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content66% 
IMG OID637880666 
ProductCRISPR-associated helicase Cas3 family protein protein 
Protein accessionYP_482427 
Protein GI86742027 
COG category[R] General function prediction only 
COG ID[COG1203] Predicted helicases 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01587] CRISPR-associated helicase Cas3 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.127091 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.741402 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGGCAC ACAGTATGAA CACGTCCGGC ATGCGCCACC GACTTGTCGA TCATCTTGAG 
GGCACCGCAC TGCTCGCTGA GCGTTTCGCC GCGGTGTTCG GTGCCGGTGA GGTGGGCCGC
TTCGCAGGAT TGGCGCATGA CGTGGGCAAG GCATCCTGTG CCTGGCAGGA CGGTCTGTTG
CGGGCGGAGG CGACGGGCGT CCGCGTAGGT GAAGATCATA AAACGTTGGG CGTTCTTCTC
GCGGAGCGGC GTGGGGCCTT CCCGGTGCTC GGTTTGCCTT TGCATGGTCA TCACGGCGGG
CTGACCAATC CGTCCGAGGT ACGCTCGATG ATCAAAGATC GAAGCAAACC GCGAGATGTA
CAGGACCGGG CGGAGGCGGA GGCAGCGCTA CGTCCGCTGC TTCCTGGGTT GTTCGACGGG
CCGCCGGTGC GTCTCCCGGT CGGGTTCGAT GAGCCGGGTA CCAGCGAGAT GCTGATGCGC
TTCCTGTTCA GCGCGTTGGT CGACGCGGAC GGGCTGGACA CGGCGGCTCA TCGGTCCGGT
GGCCAGCCGC AGGTTGCGGC TCCCGCGGAC ATGGCGATGT TGTGGGATCG GTTCGTCGGT
CGGCGTAAGG AGATGCTGGC TGGTCGGCCC CCCGCCCCGG CTGCGGACCG GTTGCGCGGT
GAGGTGTATG AAGCGTGCCT GGCCGCGGCG GCCGGTCCGA CGGGAATCTA CCGGCTGCCG
GCGCCGACGG GATCGGGGAA GACGATCGCG GCGGCCGGGT TCGGCCTCCG GCATGCCCTC
GAGCAGGGAA AGTCCAGGGT GATCGTCGCG GTGCCCTTCA TCACGATCAC TGAGCAGAAC
CATGCAGTGT ACCGGCGTCT GTTGGACCCG GTCGGGGCGG GCAAGGGCGC TCCGGTGGTG
TTGGAGCATC ACTCGAACGT CACTGTCGAT GACGACGGTC CGGCGCAACG GTGGCGCAGG
CTGGCGGCGG AGAACTGGGA TGCGCCATTC GTCGTGACGA CCACCGTGCA GCTGTTCGAC
TCGCTTTTCG GCCGCAAGCC GGCGGCAATG CGCAAGGTGC ACCGTCTCGC CAACTCCGTG
ATCGTCCTGG ATGAGGTGCA GGCGTTGCCG GCGCCGCTGC TTCTCCCGAT CGTGGACGGG
TTGCGGACGC TGACGGCGCG GTTCGGGACG ACGGTGTTAC TGGCATCGGC GACACAGCCG
GAGCTGTGGG CGCTGGATCC GTTGGAGTCG ATCACGCCGA TCGAGGTGAT CTCCAATCCG
GAATCCTTGT ATAAGACGTC ATGTCGGGTT CGGTACGAGT GGTGGCCCGA TCCGAAACCG
ACGTTGGAGC AGGTCGCTGA TCGGGTTGCC GATCCGGCGG CTGCCGGTGC AGACCAGTCG
CTGACGATCG TGAATACGGT CGCGAATGCC CGCCAGATGC GGGACCTGGT TGCTGCCCGT
GTACCGGCGA CCACCCCGGT GCTGCATCTG TCGACGGCAA TGTGCCCGGC GCACCGTCGA
GAGGTACTCG CGGAGGTGAA GGAACGGCTC ACCGCGCGGC TACCCGTCCG GCTGGTGAGC
ACCCAGCTGA TCGAGGCCGG CGTCGACCTG GATTTCCCAG TGGTGTTCCG GGCGATGGCC
CCGGCGGACT CGTTGCAGCA GGCCGCGGGC AGGGCGAACC GGGAGGGGCA TCTCGGCCCG
AAAGGCGGCT TGGTGGTCGT ATTCGACCCG GCCGATGGCG GCCGTCCACG CTCCTACGAC
CTTCCCGTCG GCATCACCGC CCGGTACATC GGACTGGGCC GCGCCGACCC AGATGATCTC
AAGGCACTAC GCCTCTACTA CCACAGCTAC CTGCAAAATC CGCGGGTGTC AGGTAGGGAT
AGCAGGGGAG CGGCGGTGCA GATCAGCCGA GCGGCCCTCG ACTTCCGCGC GGTAGCGGAA
GGTCCGAAGC GCACAGGGGA AGACCCGAGA CCGGACAGAT CGAAGGCATT CCGAATGATC
GATGAGGACA CCGTCCCGGT CGCCGTGCCT TACCAGGGTG AGGAGGAACG GGTCCGTCAG
CTCGTCGAGC GGATTCGCAT GGTCCCGCTT CCCGAGCCTC GCCTCTTCCG CGATCTTCAA
CCCTACTTGG TCATGATCCA ACGCCGCACC CGGGACCGCG CCGACGTCGC CACGCTGTGC
CAGCCCGTCT TCGGGGACCT CGTCGAATGG GCCGGCAGCT ATGACAAAAA CACCGGCATC
GTTCTTGAAC CGTCAGGAGA GGAGTTCATC GCATGA
 
Protein sequence
MWAHSMNTSG MRHRLVDHLE GTALLAERFA AVFGAGEVGR FAGLAHDVGK ASCAWQDGLL 
RAEATGVRVG EDHKTLGVLL AERRGAFPVL GLPLHGHHGG LTNPSEVRSM IKDRSKPRDV
QDRAEAEAAL RPLLPGLFDG PPVRLPVGFD EPGTSEMLMR FLFSALVDAD GLDTAAHRSG
GQPQVAAPAD MAMLWDRFVG RRKEMLAGRP PAPAADRLRG EVYEACLAAA AGPTGIYRLP
APTGSGKTIA AAGFGLRHAL EQGKSRVIVA VPFITITEQN HAVYRRLLDP VGAGKGAPVV
LEHHSNVTVD DDGPAQRWRR LAAENWDAPF VVTTTVQLFD SLFGRKPAAM RKVHRLANSV
IVLDEVQALP APLLLPIVDG LRTLTARFGT TVLLASATQP ELWALDPLES ITPIEVISNP
ESLYKTSCRV RYEWWPDPKP TLEQVADRVA DPAAAGADQS LTIVNTVANA RQMRDLVAAR
VPATTPVLHL STAMCPAHRR EVLAEVKERL TARLPVRLVS TQLIEAGVDL DFPVVFRAMA
PADSLQQAAG RANREGHLGP KGGLVVVFDP ADGGRPRSYD LPVGITARYI GLGRADPDDL
KALRLYYHSY LQNPRVSGRD SRGAAVQISR AALDFRAVAE GPKRTGEDPR PDRSKAFRMI
DEDTVPVAVP YQGEEERVRQ LVERIRMVPL PEPRLFRDLQ PYLVMIQRRT RDRADVATLC
QPVFGDLVEW AGSYDKNTGI VLEPSGEEFI A