Gene Francci3_3346 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3346 
Symbol 
ID3904132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3969767 
End bp3970801 
Gene Length1035 bp 
Protein Length344 aa 
Translation table11 
GC content70% 
IMG OID637880671 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_482432 
Protein GI86742032 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03640] CRISPR-associated endonuclease Cas1, DVULG subtype 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0114588 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.257048 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGC TCCTCAACAC CCTCTACGCC ACAACACCCG GAACCAGCCT GCACCTCGAC 
GGCGACGCCG TACGCATCTG GCATCCCGAC AACGACAAAG GCCGCCGCCT TCTCCCCCTC
GTCCGCGTCG ATCACATCGT CGTCTTCGGC GGCGTCACCA TCACCGACGA TCTCCTACAA
CGCTGCGCCA CCGACCGCCG CTCCGTCACC TGGCTCACCG GCAACGGCCG CTTCCGCGCC
CGCGTCGAAG GACCCACCGG CGGCAACCCC CACCTACGCA TCGCCCAACA CGATCACTTC
CGCGACGACG AACGACGCCT CACTCTCGCC ATGTCATACA TAGCCGGGAA ACTCCAGAAC
AGCCGCCAAC TCCTCCTCCG CGCCGCCCGC GACGCCACCG GCACCCGCCA AACCGCACTC
CGCGACACCG CCGCCCACCT CGCCGACGCC CTCCCCACCC TGCGTGACAC CACCAACGTC
GCCGAGGCCA TGGGCGTCGA AGGACAGGCA GCCCGCCGCT ACATCGCCAC ATGGCCGCAC
CTGCTCACCC CGCACGCGAC CGTCACCGCC CCCGCCGGAC GCACCAGCCG ACCCGCCACC
GACCCGGTCA ACGCCGCCCT GTCCTTCGGC TACGGCATCC TGCGCATCGC CGTCCACGGC
GCCCTCGACC ACGTCGGCCT CGACCCCCAC ATCGGCTACC TCCACGGCAT CCGCCCCGGC
AAACCCGCCC TCGCCCTCGA CCTCATGGAA GAATTCCGCG CCCTGCTCGT CGACCGCCTC
GTCTTCACCG CCTTCAACCA GCGCCAGCTC ACCGATGCCG ACTTCGAACA CCACCCCGGC
GGCTCCTGCC AGCTCACCGA GTCCGGCCGG AAAAACTACC TCACCCTGTG GAGCCAGGCA
CGCGCCCGAA CCTGGCCCCA CACCCTCCTC ACCCACGACA CCCCCGCCGC CACCCTTCCC
CTGCTCCAGG CCAGGATCCT CGCCCGACAC CTCCGCGGCG ACATCCCCCG GTACATCCCC
TGGAGCCCTA CCTGA
 
Protein sequence
MAELLNTLYA TTPGTSLHLD GDAVRIWHPD NDKGRRLLPL VRVDHIVVFG GVTITDDLLQ 
RCATDRRSVT WLTGNGRFRA RVEGPTGGNP HLRIAQHDHF RDDERRLTLA MSYIAGKLQN
SRQLLLRAAR DATGTRQTAL RDTAAHLADA LPTLRDTTNV AEAMGVEGQA ARRYIATWPH
LLTPHATVTA PAGRTSRPAT DPVNAALSFG YGILRIAVHG ALDHVGLDPH IGYLHGIRPG
KPALALDLME EFRALLVDRL VFTAFNQRQL TDADFEHHPG GSCQLTESGR KNYLTLWSQA
RARTWPHTLL THDTPAATLP LLQARILARH LRGDIPRYIP WSPT