Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4793 |
Symbol | |
ID | 5673134 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5720144 |
End bp | 5724547 |
Gene Length | 4404 bp |
Protein Length | 1467 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 641243649 |
Product | hypothetical protein |
Protein accession | YP_001509065 |
Protein GI | 158316557 |
COG category | [D] Cell cycle control, cell division, chromosome partitioning |
COG ID | [COG1196] Chromosome segregation ATPases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.422119 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTACAAGC TTCGCCGTCT GCACCTGGAC ACCGTCGGCG TGCGTGAGAA CCGGTTCGCC GCGCTGACCC TGGACACCGT CGACGTGGAT GGGAACCCGG CCGACACCCT CATCTGGATG CGCAACGGCT CCGGCAAGAC GACGATCATG TCGCTGCTGG CGGCGCACAT CCTTCCCGCC CGCCGCGACT TCCTCGCGGC GCGCAAGAAG CGCCGCCGTG ACGACGTCAC GAGAACGCTC GAGGACCTCG TGCTCACCGA CGACACCTCC CACGTCGTCG CCGAATGGGA GGCACCGGAC GGAGCCCTGC TGCTGACCGG AGCGGTGTGG GAGTGGCCGA ACAGGCGGCG GCCGGTTGAC CACAACGGCG CGGGTGCGCA GAAGCTCAGG AAGCTGTTCT GGACGATGCG CCCGGACCCG GCGGTCGAGG GCACGATGTT CGAGGATCTG CCGCTGACGC ACCGCACCCG GGGCGAGGTG GATCTCGCCG GCTTCCATGC GTGGATCAAG AGCCGGGCGG TGGCGGGAGC GGACGCCGCG GCCGTGGACC AGATCGACCC GTGGCACCGC CTGCTGGACG ACCGTGGCTT CGACGCAGGT CTGTACCGGT ACTTCGTCAC GATCAACGCG ACCGAGGGCG GGATCGACGC TCTCTTCGCG AACATCCGCA GCGAGAACGA CTTTGCCCGC TACCTGCTCG GCTTCGTCGC CGACGCGGGG CGGGCCGCGA GCGTCCGCGA CCTGCTCCGT CAGGTGGCGA CCGAGCTCGC GCTGCGCCCC AGGTACCGGA CGGAGCAGGA GTTCTGCGTC GACGCGCAGC CCCGGCTCGC CACGTTGGCC ACCGCGCACG AGGACGTGGG GGGCGCCTCC CGGGAGCGCG CCGAGACGGC CCGCGCGGCG GCGGCCTTCC GGCTCGGCCT CCTCGACGCG GCGCACACGG CGGAGGCGGG TGCCGAGCAG GCGGCGCAGG AGCGCCGCGA CGTCGATGAG CAGCGGCTGG AGACGCAACG CCAGGCTGAC GGTGCCCGCC GCCGTCGTGA CGAGCTGATG CGCCTGGCCG CCGGGTACTG GTTCGCCGAC GCCACGATCG AGGAGGAGGC CGCGCGCGAG GCGCTCGCCG CGGCGCGGAC CACCGAGCGG GCCTGGGAGG CCGTGCTTGC CCTGCAGGCC GTCAACCGCG CGAGGATCGA TGTCGACGCC CACCGCCGGG CGATCGACGC GGCGCGGACA CAGGCCGCCC CGCTGCTCGT CCAGGCGGAC CGGGCCGACG CGGACCTCGC CGCCGCGCTC GACGCGGAGC TGCGAGCCGT CGCCGTGGAG ATCGAACATC TCAGCCGCCG GATCGACGAG ACCGGGACTC GTGCCGAGGC GGCAGAGGAC CGGCGGGATC AGGCGGCCGA CGAGATGGGC GGGCTGACCG CGGAAGCTGA TCAGCTCGCC CAGCGCGCCG ACGAGCTGGA GCGGAAGAGA ACGCGGGCCG TCGGCGCGGG TCTGCTCGTC GACGGAGTCC CGGTGGAGAC GGCCGTCGCC GAGGCGGAGA CCGTCCTGGA GTCGCTGCGC GCCGACCTAG ACGCCTGTGT CGTGGAGCTG GACGGGGCGG ACAGGGCGGT GCGCGGCACC GCCACCGCCT ACGACGAGGC CCGCGGGCGA CACCGGGACG CACAGCACGC GGCGGACACG GCGGCGACGA CTCTGCATGG GCTCGAGGAG CGGGCCGCCC AGCTGAACAG GTCCGATGCC GCGGCCCGGC TGGCGCAGAC CTCCGACCTC GATCTGCTCG CCGACGCGGG GCGCGTCCTC GCGCTGGGCG AGGAGGCGAT CGCGACCTCG CAGACGAGGC TGCTGGAACG CGGAGTCGAC GCGAGCAGCC TGCGGCTCGC TGTGGCCGGC CTGGAGGAGA CCTCCCTGCT GCCGCCTCGC CCCGCGCTCC GGCAGGTGCT CGAGGTGCTG GACGATGCCG GGATCCCCGC CGTTCCCGGT TGGGTCTACC TGGCGGAGCA CCTGCCGCTG ACCGCGGCGG CGGCGCTCGT CCAGGAACGG CCGGAGATCT GTGACGGCGC CGTCGTCTAC GGTGACCTCG CGGCCGCCGT CCAACTCGCC GCCGAGCTCG TTCTCGACGA CCCGGTCGTC CTCGCGCCGG TGACGGTCTT CGGTGAGACG CCCGTCGTTC GCGACGGCCT GGTCGTGGTC GGGCCGGTCG CCGCCCGCTA CGACCACGCG GCCGGGGCCG TCGAGCTCGA GGCTCGTCGT GGCAGGCTGA GTGAGCATGA CCAGGAGAGC GGTCGGCTGC GTACCGACCT GGACGAGCTC TCCCGCCTCA CCGCGGATCT GCGCACGCTC GCGGCCGACG TCGCGACGCG CGGCGGTGCC GCGGTGGTGC GGGAGCGGGC AAGCCACACC ACCGCGGTCC TCGAACAGGC CGAAGACGCG CAGACGGCGG CGGCCGACGC CCACCGCGAG GCGCAGGTGG TCCAGCGCGA GCTCGCCGCG AGGTGCGAGG AGCTGCGTGG CGCGGTCACC GGAGCACAGG GGCGACTGGA CCGGCTCAAC GACCTCGCCG AGGACCTGGC GGCTCTCCCG CCTGTGCGGG CACGGCTGGA GGAGATCCCC GCCCTGCGCG CGAAGGCGGT TGCCGCGCGG GAGCGGGCCC ACAACGAGGC CCGCGCGGCC CGCCGGGAGC AGCTTGACCT GAGCGACACC CGGGCCCGTC TCAGCACCAG GAGCGGCGAT CTCACCCTCG AGAGGGGATC GCTCACCGTG CACGGAGAGC ACCGGGGCGA AGTGCCAGAA GGCGGCCTGG ACGGGGCGCG CCGAGCCCGG GAGCAGGCAC ACCGGCTGGC CCGGGAGGCG TTCGACGAAC CCGAGCTGGT CCGCCGGCTC GACGACCTGG TCGACATCGC CGCCCAGGCA GGTGGGGCGT GGGGCGAGTA TGCCGTGGAG GTACAGGAAC GGGCCCGGGA ACTGGCCGGG GCGCCTGGCG CCGCCGACCC GGCCCTACGG TCCCGCTACC GAGACGAAGC CGCCCAGGCC GCGGACCAGG CAGCCACCCG TCATGCCCTG GCCAAAGCCG ACCTCGACGA CGCCGGCCAT GCCGTGCGGG AGCGCCGGCC GGTGGACCGG CCGCGGCATG CCGAGCTCCC GCACGAACCG GTGAACCGCG CCGACGCGGA GGCCGCCGCG GCGGAGGAGG ACCGCGCCGC GGCCAGGATC CAGGATGATC TCCGTCGGGT GGAGCAGCGG CTGGAAGCCC TGGAGAAACA CCATGACGCG TTGGCGTCAC GGGCTGTGCA ACTGCGGCTG CGCGCCGACC GGTTCGACCA GTGGGCCGAC GTCGCCGTCA CCGCGACCGC GGTGTCCACC GATCCGGCCG CGCTGGACGA GTCCACCACA CGGATCCTCG ACTGGGCTCG GCGTGCGGTG GCCCGCCACA CCGAAGCCGT CGACGCACGA GGCCGCGCGT TCGAGGCCCT GCGCCGCTGG GCCGCGGACG AACGGTTCGC GCTGACCACC GGCGACACCG AAGCCACCCG GCGGCTCCGG AAGATGTTCC TCGACCAGCC CGTGGAAGGA GTCGCTGGCC GGGCCGGTGC GCTGAGCGCC GAACTCGACC AGCGCCGGGC CCGTATCGAC CAGCACCTCG AGCATCTCGA CCAGTCGCAG GAGAACATCG TCACCCGGCT TGTCGATCTC GTCGACGCGG CGCTGGCGGA CCTCGCCCGG TTCACCCGGC TGTCCCGGCT GCCCGAGAAC ATCGGTCCCT GGGGTGGTCA CGAGTTCGTA CAGGTCCGAC CACGCGGGGA ACGGCCGAGC CAGGAGCAGG CGCGGGTCCG GGTCTCCGAC CTCGTCGACC AGCTGGTGGC GCCCGGTGCG TCCCTCGACA TGGAACCGGT GGACCTGGTG TGGCGGGCGA CCGAGGCGGC CGTCGGCGGC TCGGGGTTCC GGGCCCGCAT CCTCAAGCCC GACCCGGCGC AGCCGACGCA GAAGGTCGAC GTGACCGAGA TGAACAAGTG GTCCGGCGGC GAGAACCTGA CCGCCTGCCT GGTGCTGTTC TGCGTCATGG TGAAACTGCG GGCGGAGAAC CGAGGCGGCC GGGTCGAGGG CGGTGTCGCC GGCGGTCTCG TCCCGCTGGA CAACCCGCTC GGCAAGGCGA ACTACGTGCC GTTCCTCCAG CTGCAGCGCA CGGTCGCGGC TGCGTCCGGG GTGCAGCTGC TGTTCTTCAC CGGGATTGGC GATCTCCCGG CCATTCGCGC CTTCGACGCC GTCCTCGCCT GCTCCAAGCG TCGGTCGCGA ACCGGCCCGG GAACGTACGT GACCCTGGAC GAAGCCGACC GCCCATCTCG AAGCGAGGAG ATTGACGGCA TCCGCCTGGT ACGGCAGCGG ACGACGAGCG AACCCTCGCC GTGA
|
Protein sequence | MYKLRRLHLD TVGVRENRFA ALTLDTVDVD GNPADTLIWM RNGSGKTTIM SLLAAHILPA RRDFLAARKK RRRDDVTRTL EDLVLTDDTS HVVAEWEAPD GALLLTGAVW EWPNRRRPVD HNGAGAQKLR KLFWTMRPDP AVEGTMFEDL PLTHRTRGEV DLAGFHAWIK SRAVAGADAA AVDQIDPWHR LLDDRGFDAG LYRYFVTINA TEGGIDALFA NIRSENDFAR YLLGFVADAG RAASVRDLLR QVATELALRP RYRTEQEFCV DAQPRLATLA TAHEDVGGAS RERAETARAA AAFRLGLLDA AHTAEAGAEQ AAQERRDVDE QRLETQRQAD GARRRRDELM RLAAGYWFAD ATIEEEAARE ALAAARTTER AWEAVLALQA VNRARIDVDA HRRAIDAART QAAPLLVQAD RADADLAAAL DAELRAVAVE IEHLSRRIDE TGTRAEAAED RRDQAADEMG GLTAEADQLA QRADELERKR TRAVGAGLLV DGVPVETAVA EAETVLESLR ADLDACVVEL DGADRAVRGT ATAYDEARGR HRDAQHAADT AATTLHGLEE RAAQLNRSDA AARLAQTSDL DLLADAGRVL ALGEEAIATS QTRLLERGVD ASSLRLAVAG LEETSLLPPR PALRQVLEVL DDAGIPAVPG WVYLAEHLPL TAAAALVQER PEICDGAVVY GDLAAAVQLA AELVLDDPVV LAPVTVFGET PVVRDGLVVV GPVAARYDHA AGAVELEARR GRLSEHDQES GRLRTDLDEL SRLTADLRTL AADVATRGGA AVVRERASHT TAVLEQAEDA QTAAADAHRE AQVVQRELAA RCEELRGAVT GAQGRLDRLN DLAEDLAALP PVRARLEEIP ALRAKAVAAR ERAHNEARAA RREQLDLSDT RARLSTRSGD LTLERGSLTV HGEHRGEVPE GGLDGARRAR EQAHRLAREA FDEPELVRRL DDLVDIAAQA GGAWGEYAVE VQERARELAG APGAADPALR SRYRDEAAQA ADQAATRHAL AKADLDDAGH AVRERRPVDR PRHAELPHEP VNRADAEAAA AEEDRAAARI QDDLRRVEQR LEALEKHHDA LASRAVQLRL RADRFDQWAD VAVTATAVST DPAALDESTT RILDWARRAV ARHTEAVDAR GRAFEALRRW AADERFALTT GDTEATRRLR KMFLDQPVEG VAGRAGALSA ELDQRRARID QHLEHLDQSQ ENIVTRLVDL VDAALADLAR FTRLSRLPEN IGPWGGHEFV QVRPRGERPS QEQARVRVSD LVDQLVAPGA SLDMEPVDLV WRATEAAVGG SGFRARILKP DPAQPTQKVD VTEMNKWSGG ENLTACLVLF CVMVKLRAEN RGGRVEGGVA GGLVPLDNPL GKANYVPFLQ LQRTVAAASG VQLLFFTGIG DLPAIRAFDA VLACSKRRSR TGPGTYVTLD EADRPSRSEE IDGIRLVRQR TTSEPSP
|
| |