Gene Francci3_0018 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0018 
Symbol 
ID3903593 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp25679 
End bp27331 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content68% 
IMG OID637877348 
Producthypothetical protein 
Protein accessionYP_479141 
Protein GI86738741 
COG category 
COG ID 
TIGRFAM ID[TIGR02547] CRISPR system CASCADE complex protein CasA/Cse1 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.830051 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGGTT TCAACCTGAT CGATGGGCAA TGGATCCCAG TGATCAAGCG GGGGCGGCGC 
CTGGAGGTGG GGATCAGGAA GGCGCTCGTC GACGCTCACA CGATCGACGG GCTGGCCCTT
GATGATCCGC TCGAAGCCGT GGCGGTGCTG CGTCAGGTCC TGCTGCCCGT CGTCCTGGAC
GTGTTCGGCG CCCCCCGTAC CGACGAGGAG TGGTCACAGC GGTGGGAGGC TGGCTGTTTC
GACCGGATTA TCAGGAAAGA TCGGGCTGAG GATGAGGAGG GCATCGAGTC GTATCTGATC
CGACAGGCGG CACGATTTCA TCTGTTTCAT CCGACGGCTC CGTTTGCGCA GGTCGCCGGG
TTGCGGACCG CGAAGGACGA GACGAAGCCG GTGTCGCTGC TGGTTCCGCG GCTGGCCTCG
GGGAACAATG TTCCGCTGTT CAGTTCCCGG ACCGAGAACG ACCCGCCGAG CCTGACGCCC
GCTGCGGCGG CACGGGCGTT GCTCGCAGCC CATTGCTGGG ACACCGCCGC GATCAAGACC
GGAGCGGCGG ATGACCCGAA GGTCAAGACG GGGAAGACGA TGGGGAACCC GACCGGGCCG
CTCGGGCAGT TCGGGATCGT CCTGCCGCTG GGTGAGACGC TCTTCCATAC GCTGATGCTC
AGCATCCCCG TCCTGCGGCA CGGTCTGCGG CAGAAGGACC GGCCGCAATG GAGATCCGAA
AGCAGCGCCA CCTCCCGCTG GGAGACTCGC GCCCCGGAAG GGCTACTGGA TCTGCTGACC
TGGCAGTCGC GGCGGATCCG GCTCGTGCCC GAGGCAGACC CGACGGCTGT CGAGGATGTC
TCGGTCCGAC GGGTGGTGCT CACCGCCGGT GACCGGCTCA CCGGATCGGT GCACGCACTC
GAACCGCACA CGGCGTGGCG GCAGGTTGAC AAGCCCAAAG CCGACGAGCC GCCGGTCCGT
CCGGTGCGGC ATCAGCCGGG CCGGTCGGCC TGGCGCGGGC TGGAGGCTCT GCTGACCACA
ACGCCCTTGT CGAGTGACAA AGTGTTCGCG CCGACGGCGC TGTCGCAGCT CGCCCGGCTG
CGTGACGACG GCTATGTCCC CGATGATCTG CCACTGCAGG TACTGACCGT CGGGGTGAAG
TACGGCACGC AGTCGGCGGT GATCGACGAG GTAATGGCCG ATGAGATTCC CCTCCCGGTC
ACCGCGCTGG CACGGGACTC CGCGGTCCGT GAGACCGTGC TGGCGGTGGC GGCCCAGGCC
GAGAGCCTGC GGATCGCCGC CAACCGTCTC GGCGACGATC TTCGTGAGGC TGCCGGTGCG
ACGGACAAGC TGCCGTGGGA CAAGGGCCAG CGGCTCGGAG AAATACTGAT CCACAGCTTC
AATCCGACGG TGCACCGGCT GCTAGCCGGG TTGCAGCAGC ATCCGGAGGA CGCCAAGCGG
GCCGAACTTG CGTGGCGGAT CCTCGCTCGG CGGTTGGCGT GGGAGGTCGT CGACCCGGTG
TTGTCCGCTG CCGGTCCTGA GACGTTCCTG GGCCGCGATC CCGGTGAGCC GTTCGGCGCC
CGCCTCGCCG GCGCCGAGAT GTCATTCCGG CGCACGCTCA ACGACGTGCT CGGCAAGGAC
GAGGACAACC GGCTTGCTCT GGCCGCCGCC TGA
 
Protein sequence
MNGFNLIDGQ WIPVIKRGRR LEVGIRKALV DAHTIDGLAL DDPLEAVAVL RQVLLPVVLD 
VFGAPRTDEE WSQRWEAGCF DRIIRKDRAE DEEGIESYLI RQAARFHLFH PTAPFAQVAG
LRTAKDETKP VSLLVPRLAS GNNVPLFSSR TENDPPSLTP AAAARALLAA HCWDTAAIKT
GAADDPKVKT GKTMGNPTGP LGQFGIVLPL GETLFHTLML SIPVLRHGLR QKDRPQWRSE
SSATSRWETR APEGLLDLLT WQSRRIRLVP EADPTAVEDV SVRRVVLTAG DRLTGSVHAL
EPHTAWRQVD KPKADEPPVR PVRHQPGRSA WRGLEALLTT TPLSSDKVFA PTALSQLARL
RDDGYVPDDL PLQVLTVGVK YGTQSAVIDE VMADEIPLPV TALARDSAVR ETVLAVAAQA
ESLRIAANRL GDDLREAAGA TDKLPWDKGQ RLGEILIHSF NPTVHRLLAG LQQHPEDAKR
AELAWRILAR RLAWEVVDPV LSAAGPETFL GRDPGEPFGA RLAGAEMSFR RTLNDVLGKD
EDNRLALAAA