Gene Francci3_0020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0020 
Symbol 
ID3903595 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp28057 
End bp29211 
Gene Length1155 bp 
Protein Length384 aa 
Translation table11 
GC content67% 
IMG OID637877350 
ProductCRISPR-associated Cse4 family protein 
Protein accessionYP_479143 
Protein GI86738743 
COG category 
COG ID 
TIGRFAM ID[TIGR01869] CRISPR system CASCADE complex protein CasC/Cse4 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.451411 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCTGCT ACATCGACGT CCACATCCTG CAGACCGTCC CTCCGTCCAA CCTCAACCGA 
GATGACGCCG GTACCCCGAA ACAAGCCGTC TATGGCGGGG TGAAGCGGGC CCGGGTGTCG
TCCCAGGCCT GGAAGCGGGC GACGCGAACC GCCTTCGCCG ACCACATCGA TCAGGCCCAG
CTCGGAACAC GCACCAAGCG GATCTCCGCG CTGCTCGCGG AACGGCTCGC AACCCGCTGC
GCGCTCGACG CGGAAACCAG CACCCGGATC GCCACCAGCC TTCTGACCGC TCTGAAGATC
AGTGCGGGGA AGAAGGCGGC GGAGACCGCC TATCTGCTGT TCTTTGGCCG TCCCCAGCTC
GAACGGCTCA TCGACCTCAT TGTCGAGGAT GTGCCGCGCC TCGCCGATCT CAGCGACGGC
GATCTGCTCG CCGCGGTCAA GGATGTGCCT GTCCTGGCTA CTCTCGGCAG CGACCATCCG
ATCGACGTCG CGCTGTTCGG GCGGATGGTC GCCGACCTGG CGTCGTTGAA CGTCGACGCG
GCCACCCAGG TCGCGCATGC CCTGTCCACC CATGCCGTCG ACGTCGAGTT CGACTACTAC
ACCGCCGTTG ACGACCAGAA CGCCAAGGAC GAGACCGGCG CCGGGATGAT CGGCACGGTC
GAGTTCCAGT CCGCGACGCT GTACCGGTTC GCCACCGTCG GCCTGCACCA GCTCGCCGAG
AACCTCGGCG GTGACATCGA GGCGACCGTC GAGGCGCTAC GGGTGTTCCT CACCGCGTTC
ACCACCTCCA TGCCGACCGG CCATCAGAAC TCCTTCGCCC ACCGCACCGT GCCGAACCTG
CTCACCATCG CGATCCGCCC CGACCAGCCG GTCAACCTTG TCTCCGCGTT CGAGAAGCCG
GTACTGCCCC GTGGCCGGGG CGTCCTCACC GGATCCCTCG AGCAGTTCGC CATCGAACTC
AACAGCGCGT CGACGCTGTG GGGCCTCCAG CCCGACATCC TCGCCTCCAC CTACCGCGCC
CCCGACGACA CCAACACCAA CACCGACACC ACGGCGATGA TCGTCAAGGC GCTCGGCGAG
CCGAAGCCGT TCGACGAGGT TCTCGACACA GTGGTGGCTG CCGCCCGCGA CCGGCTCATG
AGCAGCGTCC GATGA
 
Protein sequence
MRCYIDVHIL QTVPPSNLNR DDAGTPKQAV YGGVKRARVS SQAWKRATRT AFADHIDQAQ 
LGTRTKRISA LLAERLATRC ALDAETSTRI ATSLLTALKI SAGKKAAETA YLLFFGRPQL
ERLIDLIVED VPRLADLSDG DLLAAVKDVP VLATLGSDHP IDVALFGRMV ADLASLNVDA
ATQVAHALST HAVDVEFDYY TAVDDQNAKD ETGAGMIGTV EFQSATLYRF ATVGLHQLAE
NLGGDIEATV EALRVFLTAF TTSMPTGHQN SFAHRTVPNL LTIAIRPDQP VNLVSAFEKP
VLPRGRGVLT GSLEQFAIEL NSASTLWGLQ PDILASTYRA PDDTNTNTDT TAMIVKALGE
PKPFDEVLDT VVAAARDRLM SSVR