Gene Francci3_1380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1380 
Symbol 
ID3906582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1658205 
End bp1660010 
Gene Length1806 bp 
Protein Length601 aa 
Translation table11 
GC content73% 
IMG OID637878717 
Producttype III restriction enzyme, res subunit 
Protein accessionYP_480486 
Protein GI86740086 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1061] DNA or RNA helicases of superfamily II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.942893 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0185831 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGAGCCG GCCCCCTCGA CCGAAGCCCT CGTCCACGAC GTCCCGTTCC ACCAGCCCGA 
CCGCTGCGCG CGTGGCAGCG TGCGGCCCTG GAGATCTACC GGTCACGGTC GTCGTCGGGC
GCCCGGGACT TCATGGCGGT CGCCACCCCG GGCGCCGGGA AGACGACCTT CGCCCTGCAG
GTGGCCGCCG ACCTTCTCAC TGCCGGCGAG ATCAGTCGGA TCACCGTCGT CGCGCCCACC
GAGCACCTCA AACGCCAGTG GGCCGTGGCC GCGGCGGAGG TTGGTGTCGA CCTCGATCCG
GACTTCCGCA ACTCGGCCGG GGCGACATCC TCCGACTACA CCGGTGTCGC CGTGACCTAC
GCCCAGGTGG CGGCGCATCC GGCCCTGCAC CGTGCCCGGA CCGCCGCGCG TCGCACCCTG
ATCGTCCTGG ACGAGATCCA CCACGCCGGC GACGCGCTGT CATGGGGTGA GGCGATCCGG
GAGGCGTTCA CCCCGGCCGC GCGTCGCCTC GCCCTGACTG GTACTCCGTT TCGCTCCGAC
GTCAACCCGA TCCCTTTCGT CACCTACCTG CCGGGCCCCG ACGGGGTGAT GCGCAGCATC
GCGGACTCCT CCTACGGCTA TGCCGAGGCG CTGCGCGACG GGGTTGTCCG CCCGGTGCTC
TTCCTCGCCT ACTCGGGCGA GATGACCTGG CGGACCAGTG CCGGCGCGGA ACTGAGCGCC
CGACTCGGGG AGCCGCTCAC CAACGAGCAG ACCGCCGCCG CGTGGCGCAC CGCGCTCGAC
CCGGGCGGGG ACTGGATGCC GGCGGTGCTC GCCTCCGCCG ACACCCGGCT GTCCCAGGTG
CGGGCCGGCG GCATGCCCGA CGCGGGCGGG CTGGTCATCG CCACGGATCA CGCGGCCGCG
CGCGCCTACG CCGCGCTGCT CACCCGGATC ACCGGCACCT CGCCCGTGCT GATCCTCTCC
GACGACCCGA CCGCGTCGAC CAAGATCGAT GACTTTCGCC GGTCGGCCGA CCGGTGGATG
GTGGCGGTAC GGATGGTCAG TGAGGGCGTG GACGTGCCTC GCCTGGCGGT CGGGGTCTAT
GCGACCTCGG CGTCAACCCC GCTGTTCTTC GCCCAGGCCG TCGGGCGGTT CGTTCGGGGC
CGCGGTCGCG GGGAGACGGC GTCGGTCTTC CTCCCGAGCG TGCCGTCCCT GCTGGCGCTG
GCTGGCGAGA TGGAGGTCCA GCGGGACCAC GCCCTGGAGA AGTCCCCGCG CGATCCGGAC
GCCTTCGACG ACGACGCGCT GCGGGACGCC AACCGGCGCA AGGACATCCC CGACAAGCCG
GACACCTTGT TCACGGCGTT GGGATCCTCG GCCCATCTCG ACCGAGTGAT CTTCGATGGC
GGCGAGTTCG GCACCCCGGC GGCCCCCGGG TCCATCGAGG AGGAGGACTT CCTGGGCCTG
CCGGGCCTGC TGGAGCCCGA TCAGGTCGCG GTTCTGCTGC GCCAGCGGGA GGCCGCCCAG
CAGGCCGCCC AGCAGGCCGC CACGCGCCGT GCCGCTGCCC GTGCCGCGGA TTCGCCGGTC
GTCCCGCCGG CCCGCGAGGG CGTGCAGCCG GCGGTCGAGG CGGCCACCCG GCCCGTGCAC
GAGGTGATCG GCGAGCTGCG CAAGGAGCTG AACCGGCTTG TCGGCGCACA CTTCCACCGC
ACGGGCAAAC CGCACGGGAT GATCCACGCC GAGCTGCGCC GGACCTGTGG CGGTCCGCCG
AGCGGCCAGG CCACCGCCGC GCAGCTCCAG GCCCGCATCG ACACCATCCG CCGCTGGGCC
GGTTGA
 
Protein sequence
MRAGPLDRSP RPRRPVPPAR PLRAWQRAAL EIYRSRSSSG ARDFMAVATP GAGKTTFALQ 
VAADLLTAGE ISRITVVAPT EHLKRQWAVA AAEVGVDLDP DFRNSAGATS SDYTGVAVTY
AQVAAHPALH RARTAARRTL IVLDEIHHAG DALSWGEAIR EAFTPAARRL ALTGTPFRSD
VNPIPFVTYL PGPDGVMRSI ADSSYGYAEA LRDGVVRPVL FLAYSGEMTW RTSAGAELSA
RLGEPLTNEQ TAAAWRTALD PGGDWMPAVL ASADTRLSQV RAGGMPDAGG LVIATDHAAA
RAYAALLTRI TGTSPVLILS DDPTASTKID DFRRSADRWM VAVRMVSEGV DVPRLAVGVY
ATSASTPLFF AQAVGRFVRG RGRGETASVF LPSVPSLLAL AGEMEVQRDH ALEKSPRDPD
AFDDDALRDA NRRKDIPDKP DTLFTALGSS AHLDRVIFDG GEFGTPAAPG SIEEEDFLGL
PGLLEPDQVA VLLRQREAAQ QAAQQAATRR AAARAADSPV VPPAREGVQP AVEAATRPVH
EVIGELRKEL NRLVGAHFHR TGKPHGMIHA ELRRTCGGPP SGQATAAQLQ ARIDTIRRWA
G