Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3975 |
Symbol | |
ID | 5672336 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 4757148 |
End bp | 4761383 |
Gene Length | 4236 bp |
Protein Length | 1411 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641242854 |
Product | hypothetical protein |
Protein accession | YP_001508271 |
Protein GI | 158315763 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | [TIGR02680] conserved hypothetical protein TIGR02680 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.225862 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAGC CGACGACCGA ACGCTGGCAG CCGCTGCGCG CCGGGCTGGT CGACGTCTTC CTCTACGACG AGGAGGAATT CCACTTCCAC GAGGGCAACC TGCTGCTGCG CGGCAACAAC GGCACCGGTA AGTCCAAGGT CCTTGCGATG CTGCTGCCGT TCCTGCTCGA CGGTGAGGTG TCGGCCCACC GCGTCGAACC CGACGGCGAC CCGAAGAAGC GGATGGACTG GAACCTGCTC ATGGGCGGGC GCCATCCCTA CCCGGAGCGC CTCGGCTACA GCTGGCTGGA GTTCGGCCGG CTCGACACGT CCGGCGCCGC CGAGTACCGC ACGATCGGGG CCGGGCTGAA GGCGGTCGCC GGCCGCGGGT CGGTCACCAC CTGGTTCTTC ATCGCCGACG CCCGTGTCGG CGTCGATCTC GACCTGGTCG GCCCGACCCG GACCGCGCTG ACGAAGGAAC GGCTGCGCGA TGCCCTCGGC CCGCGCGGCA ACGTCTACGA ACAGGCGTCG CGCTACCGGC GCGCCCTGGA CGAGGCGTTC TTCGGCCTCG GCGAGGAACG GTACGACGCG CTGGTCAATC TGCTCATCGC GCTCCGCCAG CCGCAGCTGT CGAAAAAGCC GGACGAGCGG GCCCTGTCGG CGGCCCTGAC CGAGGCGCTG ACCCCGCTGG ACCAGGCGGT GATCGCGGAC GCGGCCGAGG CGTTCCGGGG TCTCGAGCAG GAACAGTCGG ACCTGCAGGG GTTCGAGGAG GCGCACGAGG CCGCCACCGC CTTCCTCGGA CACTATCGCC GCTACGCCCG CATCAGCGCG CGGCGCCGGG CCCGCGAGCC CCGCGCCACC CAGGCCGCCT ACGAGTCGGC GTCCCGCCAG CTGAACGACG TCCGGGCGGA GCGGGAGCGG GTCGAGGCCG AGGCGATCCG GGTGGACGGG GAGGACAAGG AGAACGAGGA GCGCATCACC GCGCTCACCG CCGAACGGGA CACCCTCGCC GCCCGCCCGG AGATGGACCA GGCCCGCGAA CTCGTCCGGC TGCAGCAGCG GATGTCCGAC CTGGAGCGCG TGGCCAACCG GGCCGGGGCC GAACTTGCGG CCGTGACGAC CCGGCGCGAC GGCCTCGCCC GCCGGACCGC CGACGCCCGC TGGCGGGTCG AGGCGACCGT AGCCGCGCTG GAGCGGGCGG CCGATGCCGC CGGCACCGGT GCCGAGCATG CTGGCGTCGG CGCCGAGCAC CAGGCGCTAC TCGCCCCCCT GCACATACCC GACGGTCCTC CCGTTGAGCG GGTTGAGGAG GCCTCCTCGG CCGCCCGGCG CGCGGCCGAG GATGTCACCC GACGCCGCGA GGACGCGGTC AAGCTGGTCG CCGGCCTGAT AACGGCCGAC CGCCGAGCCC AGGAGCGGGC GGCCGACCGG CGGCGGGAGC GAGACGAGGC CCGGTCGGTC CTCGATGGGG CGACCGAGGC CGTAGCCGGC GCCGAGGCGG CCCGGGAGAC CGCCGCCGCC GCGTTCGTCG ACGCGGCGCG TCGCCACCTC GAGGGCGCCG AACAACTCCG CCTGACCACC CTCGACGACG TCCTCGCCGC GGTGGCCTAC TGGCTGGACG GCTTCGACGG GGACAGTCCG CTGGTCGCCG CGGTCACGGG GCGCGCCCGC GAGCTGGAGA ACGAGTTCGG CGCCGCCGAC CAGCGGCTGC GTACCGCCGT CGGCGAGGCA CAGGCGCGGG TGGGCATCCT GACCACCGAG CGCCGCGCCC TGGCCGAGGG TGCACGGCTG TCCCCGCCGG CCCCGCCGAC CCGCGCTGCC GACCGGGACG GACGGCCGGG AGCGCCGCTG TGGCAGCTCA TCGAGCCGCG CGAGGCAGGT CCGACGCCCG CGCCGGCCCT GTCAGCTTCC CCGCCGCTGG GTCCGGCGGA GATCGCCGGG CTGGAGGCGG CGCTGGAGGC GTCTGGGCTG CTGGACGCCT GGCTCTGCCC CGACGGGCGG CTGCTCGCCC CCGGCACACA CGACGTGGCC ATCGTCGCCG ACGGCGCCGG TGCGGGCCGT GTCGGCGACC ACCATCTCGG CGCGGTGTTG TGTCCAGCGG TCGACCGCTC CGACCCGGCC GCGGCTGCCG TCCCCGACGC GATCGTGGCC GCCGTCCTAG CCCGCGTCGG GTACGTCCCT GCGGATACGT CCGGCGTGGT GGATCCGGAT GCCGTCGATC GTGTCGGCGA CACCTGGGTC GCCGCGGACG GGCGGTTCCG CGTCGGGCGG CTGCGCGGCT CCTGGACCAA ACCGGCGGCC TGCTACCTCG GGCACGCGGC GCGCGAGCTG GCCCGCCGCA ACCGCCTCAC CGACATCGAC ACCGAGCTGG CCGCCCTCGA CCGCGAGATC GCCGAGCACG GCGCGGCCCG CACGGCCCTG CGCCAGCGCC GGGCCACGCT CAACGACGAG GTGGCCGGCC TCCCGGCGGA CACCCCGGCC CGCCACACGT TCGCCACCGC CCGCCAGGCC GACCGCGACC GACGCATGGC CGACGCCTCC CACCAGACGG CCGTGCGGGC CGCCGAGCAG GCCGAGGACG CGTTCCGGCA GGCCCGGGCC GAACGCGACG AGACCGCCGC CCAGCTCGGC CTGCCCGTCG CGACCGACGA GCTCGAGGAG ACCCGCGAGG CGCTCGTCGC CTACCGCGTC GCCCTGTCCG CGCTGTGGCC CGCCGTGCGG GAACGCATCA GCGCCGCGGC CCAGCTCGCG CAGACCGAGC AGGACCTGGC CGACGCCCAG GAGCAGGTCG AGCAGCGCGA GATCCAGCAC TCCGGCGCCC GCAGCGAGTA CCTGTCGGCC CGCACGGCGT TCGAGACGAT GGAACGCACG GTCGGCACCG CCGTCGCCGA GGTGCAGCGC CAGCTCGCCG CCGCCCAGCG CGGTCTGGAC TCCGCCACCG TCCGCCGCCG GGAACTCGGC CAGCTCCGGC TGACGATCGC CGAGGCGGTC GGCCAGGCCC GCGGGCAGGA GAGCGGCCTG ACCAGCCGGG TCGCGGACCT CGGCGAGCAG CGCCGGGACG CGGCCGGCCG GCTGCGCCGC TTCGCCGCGG GCGGACTGCT GGCCATCGCC CTGCCCGAGC TGACGTTCCC CGACACCGGC CGGGAATGGG CGCCCGACCC GGCAGTGCGC CTCGCCCGCC AGATCGAGGC GGCGGTCGGT GAGGTCGACG ACTCCGACGC CGGCTGGGAC CGGGTGCAGA AGCTCATCGC GGCCGAGCAC GGCACCCTGC GGGAGGTGAT GAGCCGCCAC GGCCACCACG CCGCCGCGAC GCTCACCGAC GACGGCTGGG TGGTCGAGGC GACGTTCCGC ACCCGCGTGA TGAGCCCCGC GGAGCTGGCC GACGCGCTCG GCGTCGAGGT CACCGACCGG CGTCGCCTGT TGACGGAACG GGAACGGGCC ATCCTGGAGA ACCACCTCGT CAGCGAGGTC GCCAGCCATC TCCAGGACCT GATCAGCGAC GCCGAGCACC AGGTCGAGCG GATGAACGCC GAGCTCGACG AGCGGCCCAA CAGCACCGGC ATGAAACTGC GCTTCCAGTG GCGCCCCGGC CCGGACGCAC CACCCGGCCT GGGCCCCGCC CGGGACCGCC TGCTGCGCCA GGTCTCGGAC GCCTGGTCAC CCGCCGACCG GGAAGCCGTC GGCGCGTTCC TGCAGGCGCA GATCAAGGCC GAGCGCGACC AGGACGCCGG CGGGACGTGG CTGGAACACC TCGGCCGTGC CCTCGACTAT CGCCGCTGGC ACGTGTTCGC CGTGCAGCGC TTCTCCGACG GCCAGTGGCG CTCGGCGTCC GGGCCGGCCT CGGGCGGCGA ACGGGCCCTG GCCGTCACCA TTCCGCTGTT CGCGGCCGCC TCCGGGCACT ACCGCTCGGC CGGCAACCCA CACGCGCCCC GCCTCGTCAT GCTCGACGAG GCGTTCGCCG GCGTCGACGA CGACGCCCGC GCGTCCTGCC TTGGCCTGCT CGCCGTCTTC GACATGGACG TCGTCATGAC CAGCGAGCGG GAATGGGGCT GTTACCCGAC CGTCCCGGGC CTCGCCATCA GCCAGCTGTC CCGGCGGGAC GGCATCGACG CGGTCCTGGT CACCCGCTGG CGCTGGGACG GCCGCCAGCG CCTCCCCGCC GGCGTCAAGC GGCCCGCCGG TGTTGCCACT GTTGCCGGGG CGAGGGAACC CGGCGCCACC GCACCTGCCG AGAACACTCT GTGGGACGAG CCGTGA
|
Protein sequence | MTEPTTERWQ PLRAGLVDVF LYDEEEFHFH EGNLLLRGNN GTGKSKVLAM LLPFLLDGEV SAHRVEPDGD PKKRMDWNLL MGGRHPYPER LGYSWLEFGR LDTSGAAEYR TIGAGLKAVA GRGSVTTWFF IADARVGVDL DLVGPTRTAL TKERLRDALG PRGNVYEQAS RYRRALDEAF FGLGEERYDA LVNLLIALRQ PQLSKKPDER ALSAALTEAL TPLDQAVIAD AAEAFRGLEQ EQSDLQGFEE AHEAATAFLG HYRRYARISA RRRAREPRAT QAAYESASRQ LNDVRAERER VEAEAIRVDG EDKENEERIT ALTAERDTLA ARPEMDQARE LVRLQQRMSD LERVANRAGA ELAAVTTRRD GLARRTADAR WRVEATVAAL ERAADAAGTG AEHAGVGAEH QALLAPLHIP DGPPVERVEE ASSAARRAAE DVTRRREDAV KLVAGLITAD RRAQERAADR RRERDEARSV LDGATEAVAG AEAARETAAA AFVDAARRHL EGAEQLRLTT LDDVLAAVAY WLDGFDGDSP LVAAVTGRAR ELENEFGAAD QRLRTAVGEA QARVGILTTE RRALAEGARL SPPAPPTRAA DRDGRPGAPL WQLIEPREAG PTPAPALSAS PPLGPAEIAG LEAALEASGL LDAWLCPDGR LLAPGTHDVA IVADGAGAGR VGDHHLGAVL CPAVDRSDPA AAAVPDAIVA AVLARVGYVP ADTSGVVDPD AVDRVGDTWV AADGRFRVGR LRGSWTKPAA CYLGHAAREL ARRNRLTDID TELAALDREI AEHGAARTAL RQRRATLNDE VAGLPADTPA RHTFATARQA DRDRRMADAS HQTAVRAAEQ AEDAFRQARA ERDETAAQLG LPVATDELEE TREALVAYRV ALSALWPAVR ERISAAAQLA QTEQDLADAQ EQVEQREIQH SGARSEYLSA RTAFETMERT VGTAVAEVQR QLAAAQRGLD SATVRRRELG QLRLTIAEAV GQARGQESGL TSRVADLGEQ RRDAAGRLRR FAAGGLLAIA LPELTFPDTG REWAPDPAVR LARQIEAAVG EVDDSDAGWD RVQKLIAAEH GTLREVMSRH GHHAAATLTD DGWVVEATFR TRVMSPAELA DALGVEVTDR RRLLTERERA ILENHLVSEV ASHLQDLISD AEHQVERMNA ELDERPNSTG MKLRFQWRPG PDAPPGLGPA RDRLLRQVSD AWSPADREAV GAFLQAQIKA ERDQDAGGTW LEHLGRALDY RRWHVFAVQR FSDGQWRSAS GPASGGERAL AVTIPLFAAA SGHYRSAGNP HAPRLVMLDE AFAGVDDDAR ASCLGLLAVF DMDVVMTSER EWGCYPTVPG LAISQLSRRD GIDAVLVTRW RWDGRQRLPA GVKRPAGVAT VAGAREPGAT APAENTLWDE P
|
| |