Gene Francci3_3344 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3344 
Symbol 
ID3904130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3968225 
End bp3969112 
Gene Length888 bp 
Protein Length295 aa 
Translation table11 
GC content67% 
IMG OID637880669 
ProductCRISPR-associated Csh2 family protein 
Protein accessionYP_482430 
Protein GI86742030 
COG category[L] Replication, recombination and repair 
COG ID[COG3649] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR01595] CRISPR-associated protein, CT1132 family
[TIGR02589] CRISPR-associated protein, Csd2 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00748339 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.380655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTTCA ACCTTGATCC CGAGAAGAAG CACGACATGG TGCTGCTGTT CGACGTCACC 
GACGGCAACC CCAACGGTGA CCCGGACAAC GGAAACCGGC CCCGCACCGA CGACGAAACC
GGCCATGGCC TGGTCACCGA CGTCGCGATC AAGAGGAAGG TCCGCGACAC CATCGGCCTG
GCCGCCGAAG CCGAAGGCCT CGACCTGACC CGCTACCAGA TCTTCGTCGA AGCCGGCCAC
GCGCTGAACA CCCGACTGGA AGAGTCCTAC CTCGTCAAGG GACTCGAACT CGGCAAGAAG
ATCGACGATG CGAAAGCCGC GAAGGCCCGG GAATGGCTCG CCAACCGGTA CGTCGACATC
CGCCTGTTCG GCGCGGTCCT GTCCACCGGC AAGACCCAGT CGCTGGGGCA GATCCGCGGA
CCGATCCAGG TCGGCATGGC CCGGTCCCTC GACCCGGTCC TGCCCGTCGA CCATGCGATC
ACCCGGGTCA CCCAGACCAC CCAGGCCGAC ATCGACAAGG GCGAACGCAC CGAGATGGGC
GGCAAGTGGA CCGTCCCCTA CGGCCTGTAC CGGGCAGAGA TCCACTACTC GGCGCCCCGA
GGCCGCCAGA CCGGTGTCAG CGCCGCCGAC CTCGACCTGT TCCTGTGCAC CCTGGTCAAC
ATGTTCGACC ACGACCGGTC CGCGACCCGC GGCGAGATGG CCACCCGTGG CCTGTACGTG
TTCAGCCACC ACAACGCCTT CGGCGTCGCA CCGGCCCACA CCCTCTCCGC CCGCATCACC
GCCCGGAAGA TCTCCGCGGG TGAACCGCGC AGCTTCGGCG ATTACAAGAT CGACGTCGAT
GACGCCGACC TGCCCGACGA CGTGGCCCTC ACCCGCGTGC TGGGATGA
 
Protein sequence
MAFNLDPEKK HDMVLLFDVT DGNPNGDPDN GNRPRTDDET GHGLVTDVAI KRKVRDTIGL 
AAEAEGLDLT RYQIFVEAGH ALNTRLEESY LVKGLELGKK IDDAKAAKAR EWLANRYVDI
RLFGAVLSTG KTQSLGQIRG PIQVGMARSL DPVLPVDHAI TRVTQTTQAD IDKGERTEMG
GKWTVPYGLY RAEIHYSAPR GRQTGVSAAD LDLFLCTLVN MFDHDRSATR GEMATRGLYV
FSHHNAFGVA PAHTLSARIT ARKISAGEPR SFGDYKIDVD DADLPDDVAL TRVLG