Gene Francci3_1127 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1127 
Symbol 
ID3906606 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1340702 
End bp1343791 
Gene Length3090 bp 
Protein Length1029 aa 
Translation table11 
GC content73% 
IMG OID637878459 
Productlantibiotic dehydratase-like 
Protein accessionYP_480236 
Protein GI86739836 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACCGCA GCCTTGACGT TCCGCTGCTA CGGATGGCCG TGGTCCCCAC CGATGTCGAC 
CTGCCCCCAT GGCCAGACCT CGACGACACC ACCGACGGCG CCGTAGCCGG CTGGACGGAT
TGGCTGCGGG ACATCTGGGC ACGGCCCGTG TTCGCCGACG CGATGAGGCT CGCCAGCCCG
TCGCTGGCCG CGCGGGTCGC GTCGGTCTTC GACGGGCAGT GCCACGATCG GCGGACCGCC
CGGCGGGCTG TCACTGCGAC AGCCCGCTAC GTACTGCGGA TGACCGGGCG CGCCACTCCA
TTCGGCCTGT TCGCCGGTGT CGGGCCCGCG GATTTCGGCC CGCGGGCCCG CGTCCGGCTC
GGCGACGACC ATCGGGTTCA CGCCCGGCCG GGCGGCGCCT GGCTCGCCGC GCTCGCCGAC
GACCTGGAAG CGTTCGACAC GCTGCTGCGA CGGCTGCCGG TCACCGCCGA CGGAATACCA
GTCGCCCGAG GCGACCGACT GTTCATCGGC CCTGTCGCCC GGTCCGGCGA CGACGCCGGG
CCGGAGTCGG TGGTCGAGGT GTCGGTGCGC CGCACCCGGG CTGTGGAGGC GGCGTTGCTC
CACGCGGTCA GCTCGCTGCC CGTCCCCGAG CTGGTCGCTC GCCTCGCCGC CGACAAACCC
GCTCCCGCCG CCGGCCAGAT CGAGACTATG GTGATCGAGT TGGTCCGCCG CCGGTTCCTG
CACACGCCCC TACGCCCGGC GACAACCATC ACCGATCCAC TCAGTGAAGT CGCCGACCAG
CTGGCCGCAG CCCGGATTGA CGACCTGCCC CCACTCGCCG GCCGTCTCAA CGTGATCCGC
GACCTACAAG ACCTGCTCGC CCACCACGAC AACACGCGCT CCGCCTCGCA CCGTCGCACG
CTGCGCACGC AGGCAGAGAC GCTTCAACGG CGCGTTGTTC CCAGCGCTGA CCCGGACCTC
GACGTAGGCC TCGTGTTGAA CGCGGACGTG GTCCTGCCGG CGGCTGTCGC CGAGGAGGCT
GGGGCCGCCG CTCGGGCGCT CGCCCGGCTC ACAGGGCCGA CCGCCGTTCC CCCGGAGTGG
GCGGACTGGC ACGAAGCCTT CCTGGAACGC TACGGTCCCG CCGCCCTCGT CCCACTGCTG
GAAGCCGTCA ACCCTGACAC CGGCCTCGGT CTGCCCGCCA CCTTCCGCGG CTCCACACGC
TCCCACCGCC CCCGCGGGAA CGCGGCCCGC GATCAGCGCC TGGCCGCCCT CACGCAGCGG
GCGGCCCTGG ACGGCGCGCT CGAGGCACGC CTCGGCGACG CCGCCATCGC GGCTCTGGCC
GGCGACACCG CACCGGACCG GGTTCCACCG CACGCCGAAC TCATCGCCGA GGTCCACGCC
AACGACCTGC CGGCCTTGTC CCGCGGCCTG TTCACCCTGC GTGTGACCGG AGGCGGACGC
GCCGCCGCAA CCACCACTGG CCGCTTCCTG CACCTGCTCG ACCCCGCGTC CTTCGACCGG
TTCCGCGACA TCTACGCCGC CCTACCCACC ATGCGGGCCG GCGCGACAGC CGCGCAGGTG
TCCTGTCCGC CGCTTCCCGC AGCCACCGTC GGCGTGGCCC GATCACCGCG GCTCACCGAC
ACGGTGATCA CGCTTGCCGA GCATCACGAC GCCGGTCCGT GCGTGCTGCG TCCAGCCGAT
CTGGCTGTCG GTGCCGACGC TGAGGGGCTG TATCTGGTGT CGCTGCCCGA CCGGCGTCTC
GTCGAGCCGT CCGCCGCGCA CGCAGTGGAG TTCCAGTACC ACACCCACCC GCTGACACGG
TTCCTGTGCG AGCTACCCAC CGCCCGCGTC GCCGTCTCCA TCCCCTTCTC CTGGGGCGCT
GCCGGTGGGC TGCCGTTCCT TCCCCGACTC GTCCATCGGC GCACCGTGCT CGCCCCGGCC
CACTGGAATC TCACAGCTGC CGACCTGCCC AGCCGCGCCG AGTCGGCGCA GGAATGGCGC
GAGGCGCTGG CCCGCTGGGG CGGCACGATC CCAGTGCCCA CGCGGGTCCA GCTCGTCGAG
ACCGACAACC GATTGCGCCT CGATCTGCAC GTCGACGCCC ATCTCGCCCT GCTACGCGAT
CACCTGCACC GGCACGGCTC CGCCCGGCTC GACGAGGACG CCGCTCCAGA AGCCTTCGGC
TGGCTCGCCG GCCACGCCCA CGAACTGGTC ATCCCGCTGG CCACCACCGC CGCACCCCTG
CCGGGGCCGG CACCGGAGCG CCTCGCGGCA ACCGTGCCCG CTGCTCGGTC GGATGCCCAG
CTACCCGGTG TGTCGCCGTG GCTGTCGGCC CGGCTCTACG GCCACCCCGA CCGGGCCACC
GATCTCCTCG CCCGCCTCCA CGACCTCTTG GACGCGTGGC CCGACCCGCC CGCGTGGTGG
TTCCTGCCGT ACCGGGACAC CGAGCCCCAC CTGCGGCTAC GCCTCGCCGT CAACGAACCT
GGCGGCTACG GGCTGGCGGC CACGCGGCTG GGAACCTGGG CTGCGGTACA GCGCGAGGCC
GGACTGCTGT CGCACCTTCA CCTCGACACC TACCAACCGG AGACCGGCCG CTACGGCTAT
GGCCCGGCGA TGACCGCCGC CGAAGGCGTC TTCTCCGCCG ACTCCGCCGC TGCCCTCGCC
GCCCGCCGTC TCGCCGCCAG CGCCCGCCTG TCCGTCGGAG CGCTCACCGT CGCCGGCCTG
GTCGATATGG CCATCGCCTT CACCGGCGCG TTGTCCACCG GCATGCGGTG GCTTCGTGAC
GTGCTGCCCC ACGAGCCTGC CCGCGTCCCA CGGGCGGAAC GCACCACCAC GTTGCACCTG
GCCAACCCCG CAGATAGCTT CGCTCATCTG CGGGCCGTTC CCGGGGCGGA CGCCGTGCTG
TCCGCCTGGC AGCCGCGGGC CACAGCACTG GCCGCCTACC GCGACGCCCT CGCCGCGCAG
CGCCCGCCTG ACCAGGCCTT GGCCTCGCTG CTGCACCTGC ACGTCGTCCG AACCCTCGGC
CTGAACCCCG CCGCCGAACA GACCTGCATC GCCCTCGCCC GAGCCACAGC CGCACGCGCC
CTCGCGACCA CGGAAGCGAC CGTCTCATGA
 
Protein sequence
MYRSLDVPLL RMAVVPTDVD LPPWPDLDDT TDGAVAGWTD WLRDIWARPV FADAMRLASP 
SLAARVASVF DGQCHDRRTA RRAVTATARY VLRMTGRATP FGLFAGVGPA DFGPRARVRL
GDDHRVHARP GGAWLAALAD DLEAFDTLLR RLPVTADGIP VARGDRLFIG PVARSGDDAG
PESVVEVSVR RTRAVEAALL HAVSSLPVPE LVARLAADKP APAAGQIETM VIELVRRRFL
HTPLRPATTI TDPLSEVADQ LAAARIDDLP PLAGRLNVIR DLQDLLAHHD NTRSASHRRT
LRTQAETLQR RVVPSADPDL DVGLVLNADV VLPAAVAEEA GAAARALARL TGPTAVPPEW
ADWHEAFLER YGPAALVPLL EAVNPDTGLG LPATFRGSTR SHRPRGNAAR DQRLAALTQR
AALDGALEAR LGDAAIAALA GDTAPDRVPP HAELIAEVHA NDLPALSRGL FTLRVTGGGR
AAATTTGRFL HLLDPASFDR FRDIYAALPT MRAGATAAQV SCPPLPAATV GVARSPRLTD
TVITLAEHHD AGPCVLRPAD LAVGADAEGL YLVSLPDRRL VEPSAAHAVE FQYHTHPLTR
FLCELPTARV AVSIPFSWGA AGGLPFLPRL VHRRTVLAPA HWNLTAADLP SRAESAQEWR
EALARWGGTI PVPTRVQLVE TDNRLRLDLH VDAHLALLRD HLHRHGSARL DEDAAPEAFG
WLAGHAHELV IPLATTAAPL PGPAPERLAA TVPAARSDAQ LPGVSPWLSA RLYGHPDRAT
DLLARLHDLL DAWPDPPAWW FLPYRDTEPH LRLRLAVNEP GGYGLAATRL GTWAAVQREA
GLLSHLHLDT YQPETGRYGY GPAMTAAEGV FSADSAAALA ARRLAASARL SVGALTVAGL
VDMAIAFTGA LSTGMRWLRD VLPHEPARVP RAERTTTLHL ANPADSFAHL RAVPGADAVL
SAWQPRATAL AAYRDALAAQ RPPDQALASL LHLHVVRTLG LNPAAEQTCI ALARATAARA
LATTEATVS